A Study on the Korean Character Segmentation and Picture Extraction from a Document

한국어 문서로부터 문자분리 및 도형추출에 관한 연구

  • ;
  • 류황빈 (광운대학교 전자계산기공학과) ;
  • Published : 1988.09.01

Abstract

In this paper, a method to segment each character and extract figure from Korean documents is proposed. At first, each character string is extracted by means of iterative horizontal propagation, shrink algorithm and run-length algorithm. Individual character region is extracted by iterative horizontal and vertical manipulation. Next, characters of right pitch are searched. Each character is segmented by the position information. Overlapped character is segmented on the ground of the width of already extracted character. The rest are extracted as special characters of half pitch. Using 9 data input in the form of 840 X 600 from Korean monthly magazine, experiment was simulated. Extraction rate of character is 100%, and that of individual character is 98%. Judging from these results, efficiency on extracting character region and segmenting individual character is proved.

Keywords