Development of a korean Text Recognition System

한글 문서 인식 시스템 개발 연구

  • 고견 (연세대학교 전산과학과 인공지능 연구실) ;
  • 이일병
  • Published : 1989.03.01

Abstract

This paper reports on the development of a recognition system for Korean character,numbers and punctuation marks by syntactic approach after extracting a character or punctuation mark from a page of text.First,using the projection profile(Masudaet.al.1985,Pavlidin 1981)method, we segment a page into different regions of column or row major and then extracts lines of characters from it.Considering the height,width and connectivity of character block,we proceed to extract syllables from the extracted lines.Basically we distinguish syables into six types of formal pattern(남궁재찬 1982,이주근등 1981)following the research of lee and others,and the punctuation marks and numbers into two kinds of formal patterns,and discriminate the surface structure of the extracted syllables.By Index-Removal algorithm,we subdivide them into 44 kinds of basic korean subpattern and special characters (numbers,punctuation marks)and recognize them by syntactic method(이주근등 1981.)

Keywords