• Title/Summary/Keyword: 혼합 문자 인식

Search Result 17, Processing Time 0.025 seconds

Statistical Approach to the Automatic Korean-English String Conversion (통계적 기법에 의한 한-영 문자열의 자동 전환)

  • Ahn, Young-Hoon;Kang, Seung-Shik
    • Annual Conference on Human and Language Technology
    • /
    • 2001.10d
    • /
    • pp.205-208
    • /
    • 2001
  • 한글 혹은 영어 문자열을 입력할 때 입력 모드를 수동으로 전환하지 않더라도 입력된 문자열이 한글인지, 영어인지를 자동으로 판단하여 해당 문자열로 변환하는 방법을 제안한다. 한글 문자열일 확률을 계산하기 위해 음절 구성 요건과 음절 빈도 정보를 이용하고, 영어 문자열일 확률을 계산하기 위해 영어 bigram 및 trigram 정보를 이용한다. 또한, 한글과 영어가 혼합된 문자열은 한글일 확률과 영어일 확률이 교차되는 경계 위치를 인식함으로써 혼합 문자열을 생성한다.

  • PDF

A Gerber-Character Recognition System with Multiple Recognizers and a Verifier (다중 인식기 및 검증기를 갖는 거버문자 인식 시스템)

  • Oh, Hye-Won;Park, Tae-Hyoung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.1
    • /
    • pp.20-27
    • /
    • 2004
  • We propose the character recognition system for Gerber files. The Gerber file is the vector-formatted drawing file for PCB manufacturing, which includes various symbols, figures and characters. Also, the characters are written in horizontal, vertical, and reverse-vortical directions. In this paper, we newly propose the Gerber-character recognition system to recognize all of component names located in PCB. To improve the performance, we develop the multiple recognizers by neural networks and the verifier considering the structural features. The developed system has been installed to the auto-programming software for PCB assembly and inspection machines.

A Variable Parameter Model based on SSMS for an On-line Speech and Character Combined Recognition System (음성 문자 공용인식기를 위한 SSMS 기반 가변 파라미터 모델)

  • 석수영;정호열;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.7
    • /
    • pp.528-538
    • /
    • 2003
  • A SCCRS (Speech and Character Combined Recognition System) is developed for working on mobile devices such as PDA (Personal Digital Assistants). In SCCRS, the feature extraction is separately carried out for speech and for hand-written character, but the recognition is performed in a common engine. The recognition engine employs essentially CHMM (Continuous Hidden Markov Model), which consists of variable parameter topology in order to minimize the number of model parameters and to reduce recognition time. For generating contort independent variable parameter model, we propose the SSMS(Successive State and Mixture Splitting), which gives appropriate numbers of mixture and of states through splitting in mixture domain and in time domain. The recognition results show that the proposed SSMS method can reduce the total number of GOPDD (Gaussian Output Probability Density Distribution) up to 40.0% compared to the conventional method with fixed parameter model, at the same recognition performance in speech recognition system.

The Modified ART1 Network using Multiresolution Mergence : Mixed Character Recognition (다중 해상도 병합을 이용한 수정된 적응 공명 이론 신경망: 혼합 문자 인식 적용)

  • Choi, Gyung-Hyun;Kim, Min-Je
    • The KIPS Transactions:PartB
    • /
    • v.14B no.3 s.113
    • /
    • pp.215-222
    • /
    • 2007
  • As Information Technology growing, the character recognition application plays an important role in the ubiquitous environment. In this paper, we propose the Modified ART1 network using Multiresolution Mergence to the problems of the character recognition. The approach is based on the unsupervised neural network and multiresolution. In order to decrease noises and to increase the classification rate of the characters, we propose the multiresolution mergence strategy using both high resolution and low resolution information. Also, to maximize the effect of multiresolution mergence, we use a modified ART1 method with a different similarity measure. Our experimental results show that the classification rate of character is quite increased as well as the performance of the propose algorithm in conjunction with the similarity measure is improved comparing to the conventional ART1 algorithm in this application.

A Study on the Recognition of Handwritten Mixed Documents (필기체 혼합 문서 인식에 관한 연구)

  • 심동규;김인권;함영국;박래홍;이창범;김상중;윤병남
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.6
    • /
    • pp.1126-1139
    • /
    • 1994
  • This paper proposes an effective recognition system which recognizes the mixed document consisting of handwritten korean/alphanumeric texts and graphic images. In the preprocessing step, an input image is binarized by the proposed thresholding scheme, then graphic and character regions are separated by using connected components and chain codes. Separated Korean characters are merged based on partial recognition and their character types and sized. In the character recognition step, we use the branch and bound algorithm based on DP matching costs to recognize Korean characters. Also we recognize alphanumeric characters using several robust features. Finally we use a dictionary and information of a recognition step to correct wrong recognition results. Computer simulation with several test documents shows what the proposed algorithm recognized effectively handwritten mixed texts.

  • PDF

Phoneme Extraction from Freely Hand Written Han Gul (자유 필기체 한글에서의 자모 추출)

  • Oh, Weon-Geun;Shin, Young-Geon;Ahn, Young-Kyung
    • Annual Conference on Human and Language Technology
    • /
    • 1989.10a
    • /
    • pp.142-147
    • /
    • 1989
  • 필기체 문자는 인쇄체 문자와는 달리, 복잡한 변형이 따르므로, 인식 하는데 많은 문제점이 따른다. 그렇기 때문에 일반적인 필기체 인식에 있어서는 필기 자체에 대한 제한을 두어 변형을 적게한 문자를 인식 대상으로 삼고 있다. 이러한 문자는, 설정된 조건만 확실하게 만족한다면, 비교적 간단하게 인식 할 수 있다. 반면에, 자유 필기체 문자는, 제한 필기체 문자와는 달리 변형이 크기 때문에, 그 인식에는 많은 연구가 필요하다. 본 연구에서는, 자유 필기체 한글의 자모를 추출하는데 있어 두개의 parameter space method를 이용했다. 화상내에서의 혼합은, 기본적으로 5 개의 element ($\mid,\;\setminus,\;/,\;-,\;o$)로 구성되어 있고, 이 element를 정의하는데는 최소한 4 개의 parameter, 즉 element의 위치 [x, y], 크기 [1] 및 type [T] 등이 필요하다. 입력 화상에서 추출된 직선 및 원의 성분은 [x, y, l] 과 [x, y, T]의 2 개의 3-D parameter space 에 누적되고, parameter space 상에서의 병합 분할 과정을 거쳐, element 가 형성된다. 추출된 element 들은, parameter space 상에서의 방향성 및 상호 위치 관계에 의한 조합 형태로서, 미리 기술되어진 자모 모델과 비교되어 인식된다. 본 방법의 특정은, 문자의 크기에 무관하고, 해석방법에 의해서는, 끊어진 element나 불필요한 element 등의 왜곡된 element 들의 처리가 가능한 점, 4 차원 parameter space를 두개의 3 차원 parameter space로 분리, 처리시간과 기억용량의 절약을 기한점 등을 들 수 있다.

  • PDF

Improved Text Recognition using Analysis of Illumination Component in Color Images (컬러 영상의 조명성분 분석을 통한 문자인식 성능 향상)

  • Choi, Mi-Young;Kim, Gye-Young;Choi, Hyung-Il
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.3
    • /
    • pp.131-136
    • /
    • 2007
  • This paper proposes a new approach to eliminate the reflectance component for the detection of text in color images. Color images, printed by color printing technology, normally have an illumination component as well as a reflectance component. It is well known that a reflectance component usually obstructs the task of detecting and recognizing objects like texts in the scene, since it blurs out an overall image. We have developed an approach that efficiently removes reflectance components while preserving illumination components. We decided whether an input image hits Normal or Polarized for determining the light environment, using the histogram which consisted of a red component. We were able to go ahead through the ability to extract by reducing the blur phenomenon of text by light because reflection component by an illumination change and removed it and extracted text. The experimental results have shown a superior performance even when an image has a complex background. Text detection and recognition performance is influenced by changing the illumination condition. Our method is robust to the images with different illumination conditions.

  • PDF

Video character recognition improvement by support vector machines and regularized discriminant analysis (서포트벡터머신과 정칙화판별함수를 이용한 비디오 문자인식의 분류 성능 개선)

  • Lim, Su-Yeol;Baek, Jang-Sun;Kim, Min-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.4
    • /
    • pp.689-697
    • /
    • 2010
  • In this study, we propose a new procedure for improving the character recognition of text area extracted from video images. The recognition of strings extracted from video, which are mixed with Hangul, English, numbers and special characters, etc., is more difficult than general character recognition because of various fonts and size, graphic forms of letters tilted image, disconnection, miscellaneous videos, tangency, characters of low definition, etc. We improved the recognition rate by taking commonly used letters and leaving out the barely used ones instead of recognizing all of the letters, and then using SVM and RDA character recognition methods. Our numerical results indicate that combining SVM and RDA performs better than other methods.

A study on the segmentation and extraction of the pictures and characters in korean document (한글 문서 인식을 위한 문서 영상에서의 문자와 그림의 분리 추출)

  • Lee, In-Dong;Ho, Kang-Tae;Kwon, Oh-Seok;Kim, Tae-Kyun
    • Annual Conference on Human and Language Technology
    • /
    • 1989.10a
    • /
    • pp.50-53
    • /
    • 1989
  • 한글 문서를 인식하기 위하여 문서 영상에서 문자와 그림을 분리 추출하기 위한 방법에 대하여 논하였다. 분리 추출 방법으로는 실시간으로 입력되는 영상 데이타로부터 문자와 그림 의 경계 위치를 알아내는 방법을 사용하였다. 한글, 영문, 한자, 기호 등의 문자와 그림이 혼합된 A4 크기의 문서 영상을 300 DPI의 해상도로 입력받아 실험하였다. 단 한번의 주사만으로 모든 문자와 그림이 정보 gm름의 순서에 따라 분리 추출되었다. 실험 결과 본 방법은 최소한의 시간과 최소한의 기억 용량으로 완벽한 분리 추출이 가능함을 보였다.

  • PDF

A Comparison of Pre-Service Teachers' and Students' Understanding of the Concept of Parameters as Means of Generalization (일반화 수단으로서 매개변수의 인식과 오류에 대한 연구 -중학교 2학년 학생들과 예비교사들의 인식과 오류를 중심으로-)

  • Jee, Young Myong;Yoo, Yun Joo
    • School Mathematics
    • /
    • v.16 no.4
    • /
    • pp.803-825
    • /
    • 2014
  • From the early stages of learning algebra, literal symbols are used to represent algebraic objects such as variables and parameters. The concept of parameters contains both indeterminacy and fixity resulting in confusion and errors in understanding. The purpose of this research is to compare the beginners of algebra and pre-service teachers who completed secondary mathematics education in terms of understanding this paradoxical nature of parameters. We recruited 35 middle school students in eight grade and 73 pre-service teachers enrolled in a undergraduate course at one university. Using them we conducted a survey on the perception of the nature of parameters asking if one considers parameters suggested in a problem as variables or constants. We analyzed the collected data using the mixed method of qualitative and quantitative approaches. From the analysis results, we identified several difficulties in understanding of parameters from both groups. Especially, our statistical analysis revealed that the proportions of subjects with limited understanding of the concept of parameters do not differ much in two groups. This suggests that learning algebra in secondary mathematics education does not improve the understanding of the nature of parameters significantly.

  • PDF