• 제목/요약/키워드: grapheme extraction

검색결과 6건 처리시간 0.021초

A Study on Grapheme and Grapheme Recognition Using Connected Components Grapheme for Machine-Printed Korean Character Recognition

  • Lee, Kyong-Ho
    • 한국컴퓨터정보학회논문지
    • /
    • 제21권9호
    • /
    • pp.27-36
    • /
    • 2016
  • Recognition of grapheme is a very important process in the recognition within 'Hangul(Korean written language)' letters using phoneme recognition. It is because the success or failure in the recognition of phoneme greatly affects the recognition of letters. For this reason, it is reported that separation of phonemes is the biggest difficulty in the phoneme recognition study. The current study separates and suggests the new phonemes that used the connective elements that are helpful for dividing phonemes, recommends the features for recognition of such suggested phonemes, databases this, and carried out a set of experiments of recognizing phonemes using the suggested features. The current study used 350 letters in the experiment of phoneme separation and recognition. In this particular kind of letters, there were 1,125 phonemes suggested. In the phoneme separation experiment, the phonemes were divided in the rate of 100%, and the phoneme recognition experiment showed the recognition rate of 98% in recognizing only 14 phonemes into different ones.

Graphemes Segmentation for Arabic Online Handwriting Modeling

  • Boubaker, Houcine;Tagougui, Najiba;El Abed, Haikal;Kherallah, Monji;Alimi, Adel M.
    • Journal of Information Processing Systems
    • /
    • 제10권4호
    • /
    • pp.503-522
    • /
    • 2014
  • In the cursive handwriting recognition process, script trajectory segmentation and modeling represent an important task for large or open lexicon context that becomes more complicated in multi-writer applications. In this paper, we will present a developed system of Arabic online handwriting modeling based on graphemes segmentation and the extraction of its geometric features. The main contribution consists of adapting the Fourier descriptors to model the open trajectory of the segmented graphemes. To segment the trajectory of the handwriting, the system proceeds by first detecting its baseline by checking combined geometric and logic conditions. Then, the detected baseline is used as a topologic reference for the extraction of particular points that delimit the graphemes' trajectories. Each segmented grapheme is then represented by a set of relevant geometric features that include the vector of the Fourier descriptors for trajectory shape modeling, normalized metric parameters that model the grapheme dimensions, its position in respect to the baseline, and codes for the description of its associated diacritics.

A study on Machine-Printed Korean Character Recognition by the Character Composition form Information of the Graphemes and Graphemes using the Connection Ingredient and by the Vertical Detection Information in the Weight Center of Graphemes

  • Lee, Kyong-Ho
    • 한국컴퓨터정보학회논문지
    • /
    • 제22권3호
    • /
    • pp.97-105
    • /
    • 2017
  • This study is the realization study recognizing the Korean gothic printing letter. This study defined the new grapheme by using the connection ingredient and had the graphemes recognized by means of the feature dots of the isolated dot, end dot, 2-line gathering dots, more than 3 lines gathering dots, and classified the characters by means of the arrangement information of the graphemes and the layers that the graphemes form within the characters, and made the character database for the recognition by using them. The layers and the arrangement information of the graphemes consisting in the characters were presumed by using the weight center position information of the graphemes extracted from the characters to recognize and the information of the graphemes obtained by vertically exploring from the weight center of each grapheme, and it recognized the characters by judging and comparing the character groups of the database by means of the information which was secured this way. 350 characters were used for the character recognition test and about 97% recognition result was obtained by recognizing 338 characters.

이웃 각도 히스토그램 및 변형된 하우스도르프 거리를 이용한 'ㅁ', 'ㅇ' 자소 인식 (The Recognition of Grapheme 'ㅁ', 'ㅇ' Using Neighbor Angle Histogram and Modified Hausdorff Distance)

  • 장원두;김하영;차의영;김도현
    • 한국멀티미디어학회논문지
    • /
    • 제8권2호
    • /
    • pp.181-191
    • /
    • 2005
  • 한글 문자 인식에 있어서 ' ㅁ '과 ' ㅇ '의 오인식은 전반적인 인석성능의 저하를 가져오는 요소가 되고 있으나 이에 대한 연구가 미흡한 실정이다. 따라서, 본 논문에서는 'ㅁ'과 'ㅇ'을 효과적으로 인식하기 위한 새로운 특징 추출 방법을 제안하였다. 제안하는 방법은 변형된 하우스도르프 거리를 이용한 최적의 이웃 반경을 설정하고, 이 반경에 의해 이웃 픽셀과의 각도를 추출하여 두 자소를 구분하는 특징으로 사용하였다 실험을 통하여 분석한 결과 제안하는 특징 추출 방법은 기존의 방법들보다 적은 특징 개수를 사용하여 효율적으로 패턴을 인식할 수 있었으며 우수한 일반성 및 안정성을 나타내었다.

  • PDF

A Study on Automation about Painting the Letters to Road Surface

  • Lee, Kyong-Ho
    • 한국컴퓨터정보학회논문지
    • /
    • 제23권1호
    • /
    • pp.75-84
    • /
    • 2018
  • In this study, the researchers attempted to automate the process of painting the characters on the road surface, which is currently done by manual labor, by using the information and communication technology. Here are the descriptions of how we put in our efforts to achieve such a goal. First, we familiarized ourselves with the current regulations about painting letters or characters on the road, with reference to Road Mark Installation Management Manual of the National Police Agency. Regarding the graphemes, we adopted a new one using connection components, in Gothic print characters which was within the range of acceptance according to the aforementioned manual. We also made it possible for the automated program to recognize the graphemes by means of the feature dots of the isolated dots, end dots, 2-line gathering dots, and gathering dots of 3 lines or more. Regarding the database, we built graphemes database for plotting information, classified the characters by means of the arrangement information of the graphemes and the layers that the graphemes form within the characters, and last but not least, made the character shape information database for character plotting by using such data. We measured the layers and the arrangement information of the graphemes consisting the characters by using the information of: 1) the information of the position of the center of gravity, and 2) the information of the graphemes that was acquired through vertical exploration from the center of gravity in each grapheme. We identified and compared the group to which each character of the database belonged, and recognized the characters through the use of the information gathered using this method. We analyzed the input characters using the aforementioned analysis method and database, and then converted into plotting information. It was shown that the plotting was performed after the correction.

한글 모음의 구조적 특징을 이용한 문자영역 검출 기법 (Character Region Detection Using Structural Features of Hangul Vowel)

  • 박종천;이근왕;박형근
    • 한국산학기술학회논문지
    • /
    • 제13권2호
    • /
    • pp.872-877
    • /
    • 2012
  • 본 논문은 한글 모음의 구조적 특징을 이용하여 자연영상에 포함된 한글 문자영역을 검출하는 기법을 제안하였다. 자연 영상을 명도영상으로 변환하고 에지 및 연결요소 기반 방법으로 특징값을 추출하며, 추출된 특징값은 필터링을 수행하여 한글 문자의 특징에 맞지 않는 특징값을 제거하여 한글 문자영역 병합을 위한 후보를 선정한다. 선정된 후보 특징값은 한글 자소 병합 알고리즘으로 하나의 문자로 병합하여 후보 문자영역으로 검출하고, 한글 문자 유형 판별 알고리즘으로 한글 문자영역 여부를 판별함으로서 최종적인 한글 문자영역을 검출한다. 실험결과, 복잡한 배경을 갖고 다양한 환경에서 촬영된 영상에서 한글 문자영역을 효과적으로 검출하였고, 제안한 문자영역 검출 방법은 향상된 검출 결과를 보여 주었다.