• Title/Summary/Keyword: Handwritten Chinese

Search Result 12, Processing Time 0.02 seconds

Character Segmentation in Chinese Handwritten Text Based on Gap and Character Construction Estimation

  • Zhang, Cheng Dong;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • v.8 no.1
    • /
    • pp.39-46
    • /
    • 2012
  • Character segmentation is a preprocessing step in many offline handwriting recognition systems. In this paper, Chinese characters are categorized into seven different structures. In each structure, the character size with the range of variations is estimated considering typical handwritten samples. The component removal and merge criteria are presented to remove punctuation symbols or to merge small components which are part of a character. Finally, the criteria for segmenting the adjacent characters concerning each other or overlapped are proposed.

Improved Handwritten Hangeul Recognition using Deep Learning based on GoogLenet (GoogLenet 기반의 딥 러닝을 이용한 향상된 한글 필기체 인식)

  • Kim, Hyunwoo;Chung, Yoojin
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.7
    • /
    • pp.495-502
    • /
    • 2018
  • The advent of deep learning technology has made rapid progress in handwritten letter recognition in many languages. Handwritten Chinese recognition has improved to 97.2% accuracy while handwritten Japanese recognition approached 99.53% percent accuracy. Hanguel handwritten letters have many similar characters due to the characteristics of Hangeul, so it was difficult to recognize the letters because the number of data was small. In the handwritten Hanguel recognition using Hybrid Learning, it used a low layer model based on lenet and showed 96.34% accuracy in handwritten Hanguel database PE92. In this paper, 98.64% accuracy was obtained by organizing deep CNN (Convolution Neural Network) in handwritten Hangeul recognition. We designed a new network for handwritten Hangeul data based on GoogLenet without using the data augmentation or the multitasking techniques used in Hybrid learning.

Performance Improvement Strategies on Minimum Distance Classification for Large-Set handwritten Character Recognition (대용량 필기 문자인식을 위한 최소거리 분류법의 성능 개선 전략)

  • Kim, Soo-Hyung
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.10
    • /
    • pp.2600-2608
    • /
    • 1998
  • This paper proposes an algorithm for off line recognition of handwritten characters, especially effective for large-set characters such as Korean and Chinese characters. The algorithm is based on a minimum distance dlassification method which is simple and easy to implement but suffers from low recognition performance. Two strategies have been developed to improve its performance; one is multi-stage pre-classification and the other is candicate reordering. Effectiveness of the algorithm has been proven by and experimet with the samples of 574 classes in a handwritten Korean character catabase named PE02, where 86.0% of recognition accuracy and 15 characters per second of processing speed have been obtained.

  • PDF

Dissemination of the Tale of meifeizhuan to Korea and its Translation Practice (《매비전(梅妃傳)》의 국내유입과 번역양상)

  • Yoo, Hee June;Min, Kuan dong
    • Cross-Cultural Studies
    • /
    • v.27
    • /
    • pp.255-289
    • /
    • 2012
  • In the course of completing a National Research Foundation project, I recently found that a handwritten Korean manuscript of The Tale of Mei Fei is kept in the Adan Collection, which is a significant scholarly discovery given that no relevant research is available. The editions of the Tale of Mei Fei available in Korea include ${\ll}$藝苑?華${\gg}$ edition, ${\ll}$說?${\gg}$ edition, and the handwritten manuscript in Korean collected in the Adan Collection. Being the only handwritten Korean translation of the work, the Tale of Mei Fei in the Adan Collection was appended by the translations of ${\ll}$한셩뎨됴비연합덕젼${\gg}$ and ${\ll}$당고종무후뎐${\gg}$. As for the practice of translation of the work, literal "word to word" translation was done for the most part of the text; some sentences were occasionally translated liberally. Also, as for the poems in the text, pronunciation of each Chinese character was provided along with the translated text.

The Main Character and Evaluation of China's New Electronic Signature Legislation (중국 전자서명법의 주요내용 및 평가와 전망)

  • Han, Sang-Hyun
    • The Journal of Information Technology
    • /
    • v.9 no.3
    • /
    • pp.1-14
    • /
    • 2006
  • China has recently (28 August 2004) adopted a new act legalizing the electronic signature. This new act provides electronic signatures with the same legal status as handwritten signatures, and states that on-line certification providers will have to be created in order to ensure the security of on-line operations made using said signatures. This new act is intended to increase Chinese electronic business, and thus to raise the revenue China can expect from said business. And the law grants electronic signatures the same legal effect as handwritten signatures and seals in business transactions, and sets up the market access system for online certification providers to ensure the security of e-commerce. As Internet trade requires a reliable third party to identify the signers, the credibility of online certifying organizations is significant for the transaction security. So, considering the weakness of China's social credibility system, the law regulates that the online signatures certification providers should be approved and administered by governments.

  • PDF

Result Verification of On-line Handwritten Chinese Character Recognition using Structural Information (구조적 정보를 이용한 온라인 필기 한자 인식 결과 검증)

  • Yoon, Byoung-Hoon;Ha, Jin-Young
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10b
    • /
    • pp.274-277
    • /
    • 2006
  • 본 논문에서는 온라인 필기 한자 인식 과정에 있어 비슷한 한자끼리 혼동되어 오인식되는 한자들을 선별하여 이들 한자 인식과정의 후처리에 적용 할 수 있는 결과검증 방법에 대해서 소개한다. 결과검증 방법은 온라인 필기 한자인식기가 최종 인식결과를 산출하기 전에 후보한자들 중 혼동한자가 있으면 그 혼동 한자에 대해서 미리 정해놓은 조건을 만족하는지 검사하여 점수를 내고 이를 인식 후처리에 이용한다. 인식 후처리에 쓰이는 결과검증 방법에서는 그 한자만이 가지고 있는 구조적인 정보를 다른 한자들과 구별하기 위해 휴리스틱으로 파악하여 조건화 시킨다. 구조적인 정보는 획의 좌표, 방향코드, 순서, 길이 등으로 판단되며 다양한 휴리스틱방법이 고려 될 수 있다. 인식 후처리에 적용되는 결과검증 방법을 통해 혼동되는 온라인 필기 한자를 구별하는데 도움이 되는 것을 실험을 통해 확인하였다.

  • PDF

Performance Evaluation of Chain-code Sequence and Structure-code Sequence for On-line Handwritten Chinese Character Recognition (온라인 필기 한자인식을 위한 체인코드열과 구조코드열의 성능평가)

  • Kim, Hyung-Tai;Ha, Jin-Young
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10b
    • /
    • pp.402-407
    • /
    • 2006
  • 본 논문에서는 보다 실용적인 온라인 한자인식기 개발을 위하여 한자 검정 능력 1급 쓰기 수준을 모두 포함하는 한자 필기 데이터로부터 16방향의 체인코드열과 부분획의 구조를 반영하는 구조코드열을 만들어 성능평가를 하였다. 인식 방법으로는 DP 매칭 방법과 HMM을 사용하여 2,362 종류의 한자에 대해 인식 실험을 하였다. 그 결과 체인코드열을 사용한 DP 매칭 방법에 의한 결과가 96.54%로 가장 높은 인식률을 보였으며, 구조코드열을 사용하여 HMM에 의한 인식실험 결과가 95.65%로 그 뒤를 이었다. 인식 속도면에서는 체인코드 보다 코드열의 길이가 짧은 구조코드열을 사용한 방법이 상대적으로 유리했고, 클래스 당 1개의 모델을 사용한 HMM에 의한 방법이 클래스 당 복수개의 모델을 사용한 DP 매칭 방법에 비해 모델의 개수가 훨씬 적기 때문에 속도 면에서 월등히 유리해 더 효율적인 인식 성능을 보인다는 결론을 내릴 수 있었다.

  • PDF

Clustering Method based on Structure Code and HMM for Huge Class On-line Handwritten Chinese Character Recognition (대용량 온라인 필기 한자 인식을 위한 구조 코드 및 HMM 기반의 클러스터링 방법)

  • Kim, Kwang-Seob;Ha, Jin-Young
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2008.06c
    • /
    • pp.472-477
    • /
    • 2008
  • 본 논문에서는 은닉 마르코프 모델(HMM)을 기반한 대용량의 필기 한자 인식의 문제점인 시스템 리소스의 한계와 인식에 소요되는 많은 시간을 단축하기 위해 구조코드와 HMM에 최적화 된 클러스터링 알고리즘을 제안한다. 제안하는 클러스터링 알고리즘의 기본 개념은 훈련된 HMM를 대상으로 하고, HMM의 파라미터 수가 동일한 클래스에 대해서 클러스터를 구성하는 것이다. 또한 인식에 소요되는 시간을 줄이기 위해 2단계 클러스터모델 구조를 사용한다. 총 98,639 종류의 일본 한자를 대상으로 한 실험에서 평균 0.92 sec/char 인식 속도와 30순위 후보인식률 96.03%를 보임으로서 대용량 필기 한자 인식을 위한 좋은 방안이 될 것이라 기대한다.

  • PDF

A Hangul Script Matching Algorithm for PDA (PDA상에서의 한글 필기체 매칭 알고리즘)

  • Cho, Mi-Gyung;Cho, Hwan-Gue
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.10
    • /
    • pp.684-693
    • /
    • 2002
  • Electronic Ink is a stored data in the form of the handwritten text or the script without converting it into ASCII by handwritten recognition on the pen-based computers and Personal Digital Assistants(PDAs) for supporting natural and convenient data input. One of the most Important issue is to search the electronic ink in order to use it. We proposed and implemented a script matching algorithm for the electronic ink. Proposed matching algorithm separated the input stroke into a set of primitive stroke using the curvature of the stroke curve. After determining the type of separated strokes, it produced a stroke feature vector. And then it calculated the distance between the stroke feature vector of input strokes and one of strokes in the database using the dynamic programming technique. We did various experiments and our algorithm showed high matching rate over 97.7% for only the Korean script and 94% for the data mixed Korean with the Chinese character.

Kim Youngjak(金永爵) and the new material, 『A Collector of Correspondence from Chinese Intellectuals (中朝學士書翰錄)』 (금영작(金永爵)과 한중 척독교류의 새 자료 『중조학사서한록(中朝學士書翰錄)』)

  • QIAN, JINMEI
    • (The)Study of the Eastern Classic
    • /
    • no.34
    • /
    • pp.167-206
    • /
    • 2009
  • This paper discovers and introduces the collection of letters, "A Collector of Correspondence from Chinese Intellectuals (中朝學士書翰錄)" which was made by Kim Youngjak(1802~1868) who had collected the letters from Chinese intellectuals. "A Collector of Correspondence from Chinese Intellectuals (中朝學士書翰錄)" is a collector which contains handwritten letters to Kim Youngjak from Chinese people such as cheng gong shou(程恭壽), weng xue han(翁學涵), zhang bing yan(張丙炎), shao yan han(少言翰), and li wen yuan(李文源). Kim Youngjak had frequent meetings with Chinese intellectuals not only directly but also indirectly. He had exchanged letters with li bo heng(李伯衡), shuai fang wei(帥方蔚) for 30 years. In 1858, he went to Beijing and met Chinese intellectuals ye ming li(葉名澧), zhang bing yan(張丙炎), wu kun tian(吳昆田), cheng gong shou(程恭壽), and zhao guang(趙光). After coming back to Chos?n, he continued to exchange letters with them. "A Collector of Correspondence from Chinese Intellectuals (中朝學士書翰錄)" contains autograph letters by Kim Youngjak and Chinese intellectuals. It has ten letters for Kim Youngjak written by cheng gong shou(程恭壽), weng xue han(翁學涵), zhang bing yan(張丙炎), shao yan han(少言翰) and so on. One letter and five poems which zhao ting huang(趙廷璜) wrote to the son of Kim are also contained. The letters by zhao ting huang(趙廷璜) shows a sincere friendship with Kim Youngjak. The relationship between li bo heng(李伯衡) (who had exchanged letters with Kim for 30 years) and his son li wen yuan(李文源) proves that the cultural interchange between Korea and China had lasted successively. Kim Youngjak has not been widely known in academic circles yet but should not be ignored for the study in the cultural interchange between Korea and China. He proposed to have a relationship with li bo heng(李伯衡) and shuai fang wei(帥方蔚) first and they sent back positively. Therefore, they had a literal and private relationship by only exchanging letters each other. Also considering the fact that Kim Youngjak, as a man of high birth, had a close relationship with Chinese intelletuals, we can notice that Chinese and Korean intellectuals had open minds based on sincerity and trust. This was possible because many intellectuals before him like Hong Daeyong made a basis of the tradition of companionship. At this point, the relationship between Kim Youngjak and Chinese intellectuals and "A Collector of Correspondence from Chinese Intellectuals (中朝學士書翰錄)" have an important value. The collections of Kim Youngjak's works contain only several letters and poems which he sent to Chinese intellectuals. Accordingly, the letters in "A Collector of Correspondence from Chinese Intellectuals (中朝學士書翰錄)" are important to understand the aspects of their interchange.