• Title/Summary/Keyword: 선택적 문자 인식

Search Result 47, Processing Time 0.024 seconds

A Rotary-type Virtual Keyboard System for Keylogging Prevention (키로깅 방지를 위한 회전형 가상키보드 시스템)

  • Baik, Geum Ok;Lim, Cheol Ho;Shon, Jin Gon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.04a
    • /
    • pp.774-777
    • /
    • 2010
  • 키로깅(Keylogging) 방지를 위한 입력방식은 무작위로 배열된 숫자나 문자를 마우스로 선택하는 가상 키보드가 주로 사용되고 있다. 그런데 무작위로 배열된 숫자나 문자는 순차적인 배열에 비해 가시성이 떨어지므로 사용자의 입력시간이 지연되어 사용하기 불편하다는 단점이 있다. 이에 본 논문에서는 숫자나 문자를 순차적으로 배열하여 사용자가 쉽게 인식할 수 있는 시각적 추상화 방법을 기반으로 하는 회전형 가상키보드 시스템(Rotary-type Virtual Keyboard System; R-VKS)을 제안한다. 제안하는 R-VKS는 기술적 측면에서 키로깅이나 마우스 커서 위치추적 등의 악성코드로부터 안전한 특성을 갖고, 공간 지각적 측면에서 사용자의 가시성을 높여 입력시간을 단축하는 효과가 있다.

Character Extraction from Color Map Image Using Interactive Clustering (대화식 클러스터링 기법을 이용한 칼라 지도의 문자 영역 추출에 관한 연구)

  • Ahn, Chang;Park, Chan-Jung;Rhee, Sang-Burm
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.1
    • /
    • pp.270-279
    • /
    • 1997
  • The conversion of printed maps into computerized databases is an enormous task. Thus the automation of the conversion process is essential. Efficient computer representation of printed maps and line drawings depends on codes assigned to characters, symbols, and vector representation of the graphics. In many cases, maps are constructed in a number of layers, where each layer is printed in a distinct color, and it represents a subset of the map information. In order to properly represent the character layer from color map images, an interactive clustering and character extraction technique is proposed. Character is usually separated from graphics by extracting and classifying connected components in the image. But this procedure fails, when characters touch or overlap lines-something that occurs often in land register maps. By vectorizing line segments, the touched characters and numbers are extracted. The algorithm proposed in this paper is intended to contribute towards the solution of the color image clustering and touched character problem.

  • PDF

A Quality Evaluation System of a Handwriting String by Global and Local Features (지역특징과 지역특징을 통한 필기문자열의 품질평가시스템)

  • Kim Gye-Young
    • Journal of Internet Computing and Services
    • /
    • v.5 no.6
    • /
    • pp.121-128
    • /
    • 2004
  • This paper proposes a quality evaluation system of a handwriting string written by electronic pen. For the purpose of the system, this paper describes how to retrieve reference data from a database, how to evaluate the quality of a handwiting string using global and local features. Also, it explains how to optionally recognize a grade of a handwriting string at using global and how to diagnose stroke order at using local. The quality can be evaluated in the case of different language between reference and input by the system. Therefore, we expect that the system is very useful not only for training on handwriting but also for learning a language.

  • PDF

Automatic gasometer reading system using selective optical character recognition (관심 문자열 인식 기술을 이용한 가스계량기 자동 검침 시스템)

  • Lee, Kyohyuk;Kim, Taeyeon;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.1-25
    • /
    • 2020
  • In this paper, we suggest an application system architecture which provides accurate, fast and efficient automatic gasometer reading function. The system captures gasometer image using mobile device camera, transmits the image to a cloud server on top of private LTE network, and analyzes the image to extract character information of device ID and gas usage amount by selective optical character recognition based on deep learning technology. In general, there are many types of character in an image and optical character recognition technology extracts all character information in an image. But some applications need to ignore non-of-interest types of character and only have to focus on some specific types of characters. For an example of the application, automatic gasometer reading system only need to extract device ID and gas usage amount character information from gasometer images to send bill to users. Non-of-interest character strings, such as device type, manufacturer, manufacturing date, specification and etc., are not valuable information to the application. Thus, the application have to analyze point of interest region and specific types of characters to extract valuable information only. We adopted CNN (Convolutional Neural Network) based object detection and CRNN (Convolutional Recurrent Neural Network) technology for selective optical character recognition which only analyze point of interest region for selective character information extraction. We build up 3 neural networks for the application system. The first is a convolutional neural network which detects point of interest region of gas usage amount and device ID information character strings, the second is another convolutional neural network which transforms spatial information of point of interest region to spatial sequential feature vectors, and the third is bi-directional long short term memory network which converts spatial sequential information to character strings using time-series analysis mapping from feature vectors to character strings. In this research, point of interest character strings are device ID and gas usage amount. Device ID consists of 12 arabic character strings and gas usage amount consists of 4 ~ 5 arabic character strings. All system components are implemented in Amazon Web Service Cloud with Intel Zeon E5-2686 v4 CPU and NVidia TESLA V100 GPU. The system architecture adopts master-lave processing structure for efficient and fast parallel processing coping with about 700,000 requests per day. Mobile device captures gasometer image and transmits to master process in AWS cloud. Master process runs on Intel Zeon CPU and pushes reading request from mobile device to an input queue with FIFO (First In First Out) structure. Slave process consists of 3 types of deep neural networks which conduct character recognition process and runs on NVidia GPU module. Slave process is always polling the input queue to get recognition request. If there are some requests from master process in the input queue, slave process converts the image in the input queue to device ID character string, gas usage amount character string and position information of the strings, returns the information to output queue, and switch to idle mode to poll the input queue. Master process gets final information form the output queue and delivers the information to the mobile device. We used total 27,120 gasometer images for training, validation and testing of 3 types of deep neural network. 22,985 images were used for training and validation, 4,135 images were used for testing. We randomly splitted 22,985 images with 8:2 ratio for training and validation respectively for each training epoch. 4,135 test image were categorized into 5 types (Normal, noise, reflex, scale and slant). Normal data is clean image data, noise means image with noise signal, relfex means image with light reflection in gasometer region, scale means images with small object size due to long-distance capturing and slant means images which is not horizontally flat. Final character string recognition accuracies for device ID and gas usage amount of normal data are 0.960 and 0.864 respectively.

Region Analysis of Takbon Images (탁본영상의 영역분석)

  • Hwang, Jae-Ho
    • Proceedings of the KIEE Conference
    • /
    • 2006.04a
    • /
    • pp.141-143
    • /
    • 2006
  • 한국을 비롯한 동양 금석학 정보 인식의 중요한 매체인 탁본을 디지털 영상데이터로 변환하여 영상 특성을 분석하고 수학적 모델을 구현한다. 이를 위해 역사적으로 유명한 대표적 탁본을 포함한 50여개의 탁본영상 샘플을 작위로 선택하였고, 샘플영상 속에 내재되어 있는 영역특성을 중심으로 통계분석을 시도하였다. 탁본 원영상은 흑백의 두 영역으로 분할되는 완벽한 이진영상인데 반하여, 관측영상은 탁본뜨기 수작업과정을 거치면서 영역간 색도의 혼재와 얼룩무늬와 문양이 전체 영상에 분포한다. 본래의 두 영역은 정보영역과 바탕영역으로 구분되나 이들 얼룩무늬들은 또 다른 영역들로 치부되어 주로 바탕영역에 산발적으로 분포되어 영상인식을 저해하는 요인으로 작용한다. 관측영상 속에 내재되어 있는 영역 본래의 특성과 본뜨기 수작업 과정에서 새로 생성되는 영역들 사이의 기하학적 차이를 통계적으로 분류 처리함으로 관측 탁본영상의 영역 특성의 추이를 추론할 수 있다. 분석 결과, 탁본영상은 영역간 극단적인 확률적 차이를 보였으며, 이 양극성은 곧 탁본 원영상의 속성이 수작업과 관측이라는 훼손 과정을 거치면서도 보존됨을 의미한다. 이를 근거로 영역 특성과 훼손 과정을 수학적으로 모델링하였고 정보영역 추출의 일차적 개연성을 제시하였다.

  • PDF

Tangible Interaction : Application for A New Interface Method for Mobile Device -Focused on development of virtual keyboard using camera input - (체감형 인터랙션 : 모바일 기기의 새로운 인터페이스 방법으로서의 활용 -카메라 인식에 의한 가상 키보드입력 방식의 개발을 중심으로 -)

  • 변재형;김명석
    • Archives of design research
    • /
    • v.17 no.3
    • /
    • pp.441-448
    • /
    • 2004
  • Mobile devices such as mobile phones or PDAs are considered as main interlace tools in ubiquitous computing environment. For searching information in mobile device, it should be possible for user to input some text as well as to control cursor for navigation. So, we should find efficient interlace method for text input in limited dimension of mobile devices. This study intends to suggest a new approach to mobile interaction using camera based virtual keyboard for text input in mobile devices. We developed a camera based virtual keyboard prototype using a PC camera and a small size LCD display. User can move the prototype in the air to control the cursor over keyboard layout in screen and input text by pressing a button. The new interaction method in this study is evaluated as competitive compared to mobile phone keypad in left input efficiency. And the new method can be operated by one hand and make it possible to design smaller device by eliminating keyboard part. The new interaction method can be applied to text input method for mobile devices requiring especially small dimension. And this method can be modified to selection and navigation method for wireless internet contents on small screen devices.

  • PDF

A Passport Recognition and face Verification Using Enhanced fuzzy ART Based RBF Network and PCA Algorithm (개선된 퍼지 ART 기반 RBF 네트워크와 PCA 알고리즘을 이용한 여권 인식 및 얼굴 인증)

  • Kim Kwang-Baek
    • Journal of Intelligence and Information Systems
    • /
    • v.12 no.1
    • /
    • pp.17-31
    • /
    • 2006
  • In this paper, passport recognition and face verification methods which can automatically recognize passport codes and discriminate forgery passports to improve efficiency and systematic control of immigration management are proposed. Adjusting the slant is very important for recognition of characters and face verification since slanted passport images can bring various unwanted effects to the recognition of individual codes and faces. Therefore, after smearing the passport image, the longest extracted string of characters is selected. The angle adjustment can be conducted by using the slant of the straight and horizontal line that connects the center of thickness between left and right parts of the string. Extracting passport codes is done by Sobel operator, horizontal smearing, and 8-neighborhood contour tracking algorithm. The string of codes can be transformed into binary format by applying repeating binary method to the area of the extracted passport code strings. The string codes are restored by applying CDM mask to the binary string area and individual codes are extracted by 8-neighborhood contour tracking algerian. The proposed RBF network is applied to the middle layer of RBF network by using the fuzzy logic connection operator and proposing the enhanced fuzzy ART algorithm that dynamically controls the vigilance parameter. The face is authenticated by measuring the similarity between the feature vector of the facial image from the passport and feature vector of the facial image from the database that is constructed with PCA algorithm. After several tests using a forged passport and the passport with slanted images, the proposed method was proven to be effective in recognizing passport codes and verifying facial images.

  • PDF

Efficient From Document Classification Large using Partial Matching Method (부분 매칭 방법을 이용한 효율적인 서식 문서 분류)

  • Byeon, Yeong-Cheol;Choe, Yeong-U;Kim, Gyeong-Hwan;Lee, Il-Byeong
    • The KIPS Transactions:PartB
    • /
    • v.8B no.1
    • /
    • pp.1-9
    • /
    • 2001
  • 본 논문에서는 서식 문서를 짧은 처리 시간에 정확히 분류함으로써 실제 환경에서 응용할 수 있는 서식 분류 방법을 제안한다. 제안하는 방법은 서식 문서 이미지 전체를 다루기보다는 처리하고자 하는 서식 문서에서 서식 구조가 많이 다른 곳을 찾아서 매칭 영역으로 결정하고, 그 영역들에 대해서만 비교를 수행함으로써 계산 시간을 줄이고 인식률을 높인다. 선분 추출 시 오류를 고려하기 위하여 기존 인쇄 문자와 채워진 데이터, 그리고 매칭 영역의 크기 정보를 페널티 함수로 반영하여 매칭 영역 선택 시 고려한다. 본 방법은 구조적으로 많이 다르고, 양질의 특징을 포함하는 적은 수의 매칭 영역을 선택함으로써 처리 시간을 줄일 수 있음은 물론 높은 서식 분류율을 얻을 수 있다.

  • PDF

A Design on Mechanism for Recognize Simultaneous multiple-input in Multi-touch Screen Environment (멀티터치 스크린 환경에서 동시 다중입력 인식을 위한 메커니즘 설계)

  • Ju, Seung-Hwan;Seo, Hee Suk
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.11a
    • /
    • pp.1017-1020
    • /
    • 2011
  • 터치스크린은 키보드나 마우스와 같은 입력장치를 사용하지 않고, 스크린에 손가락, 펜 등을 접촉하여 입력하는 방식이다. 터치스크린은 정확성, 입력속도, 문자입력 등에서는 개선이 필요하나, 누구나 쉽게 입력할 수 있는 장점으로 인해 기존에는 현금인출기, 키오스크(Kiosk) 등 공공분야에서 주로 많이 사용되어 왔다. 터치스크린은 손가락을 접촉하는 것만으로 컴퓨터, 모바일 기기 등을 직관적으로 쉽게 사용할 수 있다는 것이 가장 큰 장점이다. 현재는 단순 터치스크린 환경을 넘어 동시 다중 입력이 가능한 멀티터치 환경의 터치스크린이 상용화 되면서 동시 다중 입력을 통한 어플리케이션들이 주목받고 있다. 본 논문에서는 동시에 입력한 여러 개의 아이템을 인식하는 방법을 다중 입력 패스워드를 통해 시뮬레이션 해보았다. 선택된 아이템으로 리스트를 구성하고 이를 동시 입력 단위로 사용함으로써 사용자 입장에서의 다중 입력의 정확성과 시스템 입장에서의 인식률을 높이고자 하였다.

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

  • Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.143-174
    • /
    • 2023
  • With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.