• Title/Summary/Keyword: 문자 인식

Search Result 1,163, Processing Time 0.027 seconds

A Method of Detecting of OCR error using Morphological Analysis (형태소 분석을 이용한 문자인식 에러의 검출)

  • Kim, Yun-Ho;Lee, Jong-Kuk;Kim, Hang-Joon;Lee, Sang-Jo
    • Annual Conference on Human and Language Technology
    • /
    • 1992.10a
    • /
    • pp.545-553
    • /
    • 1992
  • 문자인식에 있어서 인식율을 높이기 위한 후처리의 한 방법으로서, 문법 정보를 이용하는 후처리를 제안하고자 한다. 즉, 문자 인식 시스템에 의해 인식된 국어문에 대해서 오인식된 문자를 포함하는 어절을 검출하고, 오인식된 문자의 적절한 후보를 선정하여 그에 따라 자동수정을 행하는 것을 전채 후처리 과정으로 전제한다. 본 논문에서는 형태소 분석을 통해 오인식된 부분을 검출하는 과정을 보임으로써 문자인식에 있어서 문법 정보를 이용하는 후처리의 가능성과 그 유효성을 보이는 것을 목적으로 한다.

  • PDF

Character Recognition of Vehicle Number Plate Using Feature Based Neural Network (특징 추출에 기반한 신경망 시스템을 이용한 차량 번호판 문자인식)

  • 이현숙;김희승
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10b
    • /
    • pp.383-385
    • /
    • 2000
  • 차량 번호판 문자영상으로부터 여러 가지 특징 추출 방법을 조합하여 입력특징소를 재구성하고, 신경망을 이용하여 문자를 인식한다. 속도 개선을 위해 특별한 전처리 과정없이 이치화와 크기 정규화만을 수행한 후 그물망 방법과 BLT 방법, 정규화된 투영값 특정 방법을 조합하여 입력특징소를 구성한다. 본 연구에서는 숫자 인식에서 그물망 방법과 BLT 방법을 이용하여 잡음으로 인한 유사 문자의 오인식을 해결하였고, 문자 인식에서는 정규화된 투영값 특징을 이용하여 문자의 유형을 분류한 후 자소를 개별적으로 인식하였다. 이로써 모음 인식 경우에 중요한 역할을 하는 작은 획의 영역에 BLT 방법을 사용함으로 기존 연구에서의 모음 오인식 문제를 해결하였다.

  • PDF

Preprocessing Technique Using a Feature of Character′s Stroke (다문자 획의 특성을 이용한 전처리 기법)

  • 이수봉;김우생
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2004.05a
    • /
    • pp.758-761
    • /
    • 2004
  • 온라인 문자 인식 기술은 PDA, 테블릿 PC등 많은 새로운 응용에서 사용되고 있으나, 인식 기술은 아직 이러한 첨단 도구들을 자연스럽게 이용하기에는 못 미치는 실정이다. 따라서 본 논문에서는 인식률을 높이기 위해 전처리 과정에서 문자를 구성하는 획수를 통해 인식 시 해당 HMM 모델들에게만 적용하여 인식 시간을 줄이고 동시에 오류도 줄이고자 한다 제안하는 방법들의 타당성은 실험을 통해서 검증하였다.

  • PDF

Extraction text-region's pixel on caption of video (동영상에 삽입된 자막 내 문자영역화소추출)

  • An, Kwon-Jae;Kim, Gye-Young
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2011.01a
    • /
    • pp.43-45
    • /
    • 2011
  • 본 논문은 동영상 내 삽입된 자막을 문자인식이 가능하도록 문자영역을 이루는 화소를 추출하는 방법을 제안한다. 최초 자막영상을 통계학적 방법을 이용하여 색상극성을 결정한다. 이 후 색상극성에 따른 잡음제거 방법을 명암값기반과 형태학적기반으로 달리한다. 제안된 방법은 각 색상결정에 따른 적합한 잡음제거를 수행함으로서 추출된 화소들이 이루는 문자영역의 영상을 이용하여 문자인식을 수행하였을 때 기존방법보다 높은 문자인식률을 보였다.

  • PDF

Video character recognition improvement by support vector machines and regularized discriminant analysis (서포트벡터머신과 정칙화판별함수를 이용한 비디오 문자인식의 분류 성능 개선)

  • Lim, Su-Yeol;Baek, Jang-Sun;Kim, Min-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.4
    • /
    • pp.689-697
    • /
    • 2010
  • In this study, we propose a new procedure for improving the character recognition of text area extracted from video images. The recognition of strings extracted from video, which are mixed with Hangul, English, numbers and special characters, etc., is more difficult than general character recognition because of various fonts and size, graphic forms of letters tilted image, disconnection, miscellaneous videos, tangency, characters of low definition, etc. We improved the recognition rate by taking commonly used letters and leaving out the barely used ones instead of recognizing all of the letters, and then using SVM and RDA character recognition methods. Our numerical results indicate that combining SVM and RDA performs better than other methods.

A Survey on the Off-line of Handwritten Korean Characters (필기 한글 문자의 오프라인 인식에 관한 사례 연구)

  • 김수형;정선화;오일석
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10c
    • /
    • pp.396-398
    • /
    • 1998
  • 본 논문에서는 필기 한글 문자의 오프라인 인식 기술에 관련하여 최근 5년 동안 발표된 연구 사례를 종합하여 향후의 연구자들이 연구 방향을 설정하고 방법론을 개발하는데 도움이 되도록 함음 물론, 당 분야 연구의 발전 방향을 모색하고자 한다. 사례 조사의 범위는 필기 한글 문자 인식에 관련된 문자 데이터베이스, 낱자 인식, 단어 인식의 세 가지 핵심 요소 기술로만 국한하였으며 이들 각각에 대한 향후 연구의 방향을 제시하였다.

  • PDF

An Implementation Method of the Character Recognizer for the Sorting Rate Improvement of an Automatic Postal Envelope Sorting Machine (우편물 자동구분기의 구분율 향상을 위한 문자인식기의 구현 방법)

  • Lim, Kil-Taek;Jeong, Seon-Hwa;Jang, Seung-Ick;Kim, Ho-Yon
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.12 no.4
    • /
    • pp.15-24
    • /
    • 2007
  • The recognition of postal address images is indispensable for the automatic sorting of postal envelopes. The process of the address image recognition is composed of three steps-address image preprocessing, character recognition, address interpretation. The extracted character images from the preprocessing step are forwarded to the character recognition step, in which multiple candidate characters with reliability scores are obtained for each character image extracted. aracters with reliability scores are obtained for each character image extracted. Utilizing those character candidates with scores, we obtain the final valid address for the input envelope image through the address interpretation step. The envelope sorting rate depends on the performance of all three steps, among which character recognition step could be said to be very important. The good character recognizer would be the one which could produce valid candidates with very reliable scores to help the address interpretation step go easy. In this paper, we propose the method of generating character candidates with reliable recognition scores. We utilize the existing MLP(multilayered perceptrons) neural network of the address recognition system in the current automatic postal envelope sorters, as the classifier for the each image from the preprocessing step. The MLP is well known to be one of the best classifiers in terms of processing speed and recognition rate. The false alarm problem, however, might be occurred in recognition results, which made the address interpretation hard. To make address interpretation easy and improve the envelope sorting rate, we propose promising methods to reestimate the recognition score (confidence) of the existing MLP classifier: the generation method of the statistical recognition properties of the classifier and the method of the combination of the MLP and the subspace classifier which roles as a reestimator of the confidence. To confirm the superiority of the proposed method, we have used the character images of the real postal envelopes from the sorters in the post office. The experimental results show that the proposed method produces high reliability in terms of error and rejection for individual characters and non-characters.

  • PDF

A Recognition Method of Container ISO-code for Vision & Information System in Harbors (항만 영상정보시스템 구축을 위한 컨테이너 식별자 인식)

  • Koo, Kyung-Mo;Cha, Eui-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.06a
    • /
    • pp.721-723
    • /
    • 2007
  • Recently, the size and location of the acquired container image while the container is loading and unloading in Harbors is not fixed. And it is difficult to get a good image for recognition because of the variation of external environment as those the size of container and where the yard-tractor stop is. In this paper, we estimate where the container ISO-code set is using Top-hat transform from realtime images and get an image to recognize container ISO-code using PAN/TILT/ZOOM camera. We extract the container ISO-code using Top-hat transform and Histogram projection. After binarization, we extract each character from complex background using labeling. We use BP(Backpropagation Network) to recognize extracted characters.

  • PDF

A Study on the Hangeul confusion Character Recognition Using Fractal Dimensions and Attactors (프랙탈 차원과 어트랙트를 이용한 한글 혼동 문자 인식에 관한 연구)

  • Son, Yeong-U
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.7
    • /
    • pp.1825-1831
    • /
    • 1999
  • In this paper, to reduce misrecognized characters, we propose the new method that extract features from character to apply to the character recognition using features from character to apply to the character recognition using fractal dimensions and attractors. Firstly, to reduce the load of recognizer we classify the characters. For the classified character, we extract the features for Box-counting dimensions. Natural Measures, Information dimensions then recognize characters. With histogram, we generate attractors and calculate dimensions from attractors. Then we recognize characters with dimensions of characters and attractors. An experimental result that the overall recognition rates for the training data and testing data are 96.03% and 91.74% respectively. This result shows the effectiveness of proposed method.

  • PDF

An On-line Speech and Character Combined Recognition System for Multimodal Interfaces (멀티모달 인터페이스를 위한 음성 및 문자 공용 인식시스템의 구현)

  • 석수영;김민정;김광수;정호열;정현열
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.2
    • /
    • pp.216-223
    • /
    • 2003
  • In this paper, we present SCCRS(Speech and Character Combined Recognition System) for speaker /writer independent. on-line multimodal interfaces. In general, it has been known that the CHMM(Continuous Hidden Markov Mode] ) is very useful method for speech recognition and on-line character recognition, respectively. In the proposed method, the same CHMM is applied to both speech and character recognition, so as to construct a combined system. For such a purpose, 115 CHMM having 3 states and 9 transitions are constructed using MLE(Maximum Likelihood Estimation) algorithm. Different features are extracted for speech and character recognition: MFCC(Mel Frequency Cepstrum Coefficient) Is used for speech in the preprocessing, while position parameter is utilized for cursive character At recognition step, the proposed SCCRS employs OPDP (One Pass Dynamic Programming), so as to be a practical combined recognition system. Experimental results show that the recognition rates for voice phoneme, voice word, cursive character grapheme, and cursive character word are 51.65%, 88.6%, 85.3%, and 85.6%, respectively, when not using any language models. It demonstrates the efficiency of the proposed system.

  • PDF