Search | Korea Science

A Study on Classification into Hangeul and Hanja in Text Area of Printed Document (인쇄체 문서의 문자영역에서 한글과 한자의 구별에 관한 연구)

심상원;이성범;남궁재찬
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.18 no.6
- /
- pp.802-814
- /
- 1993
This paper propose an algorithm for preprocessing of character recognition, which classify characters into Hangeul and Hanja. In this study, we use the 9 structural chacteristics of Hanja which isn't affected by deformation of size and style of characters and rates based on character size to classify characters. Firstly, we process the blocking to segment each characters. Secondly, on this segmented characters, we apply algorithm proposed in this paper to classify Hangeul and Hanja. Finally, we classify characters into Hangeul and Hanja, respectively. An experiment with 2350 Hangeul and 4888 Hanja printed Gothic and Mincho style of KS-C 5601 are carried out. We experiment on typeface sample book, newspapers, academic society's papers, magazines, textbooks and documents written out word processor to obtain the classifying rates of 98.8%, 92%, 96%, 98% and 98%, respectively.
PDF

Support on Ideograph Characters Search of Unicode Based Information System (정보 시스템의 유니코드 기반 한자 검색 지원)

Yoon, So-Young
- Journal of the Korean Society for information Management
- /
- v.24 no.4
- /
- pp.375-391
- /
- 2007
Unicode Han ideograph character set differed from the our principle of the phonetic value ordering in that it followed the principle of KangXi radical-stroke ordering of the characters. Therefore, information system should support ideograph search on precise analysis of materials which consist of korean character (hangul) and ideograph character (hanja). History Information system has been maintaining Hanja(Chinese Character) to Hangul Dictionary, Terminology Dictionary for composition, borrowing, non-ideographic principles, Variant Forms Dictionary, and Recently discovered Chinese Characters List.
https://doi.org/10.3743/KOSIM.2007.24.4.375 인용 PDF

A Study on CC Processor for NAVTEX System

Fuwen Pang;Wenli Sun;Se-Mo Chung;Tchang-Hee Hong
- Journal of the Korean Institute of Navigation
- /
- v.22 no.3
- /
- pp.1-8
- /
- 1998
한자는 복잡한 상형문자로서 자모문자와 비교하여 볼 때, 문자의 처리, 프로그램의 작성 및 전송에 있어서 많은 차이점이 있다. 지금까지 NAVTEX프로그램중 한자 전송의 관건적인 기술을 해결하지 못하고 있다. 따라서, 본 논문에서는 NAVTEX프로그램중의 한자처리 방법을 모색 하고자 하였으며, 이러한 프로그램을 이용하면 NAVTEX 단말기에서 바로 한자를 인쇄할 수 있게 될 것으로 기대된다.
PDF

A Processing Method on Telegraphic Code for Chinese NAVTEX Receiver (중국어 NAVTEX 수신기를 위한 Telegraphic Code 처리방법)

Fuwen pang;Wenli Sun;Tchang-Hee Hong
- Journal of the Korean Institute of Navigation
- /
- v.24 no.4
- /
- pp.313-317
- /
- 2000
한자는 복잡한 상형문자로서 표음문자와 비교하여 볼 때, 문자의 처리, 프로그램의 작성 및 전송에 있어서 많은 차이점이 있다. 지금까지 NAVTEX 프로그램중 한자 전송의 핵심적인 기술을 해결하지 못하고 있다. 따라서, 본 논문에서는 NAVTEX 프로그램중 한자처리 방법을 모색을 하고자 하였으며, 이러한 프로그램을 이용하면 NAVTEX단말기에서 바로 한자를 인쇄할 수 있게 될 것으로 기대된다.
PDF

Consideration of CJK Joint Hanja Unicode when is used in AMI/HDB-3 Line Coding (AMI/HDB-3 회선부호화와 한·중·일 한자 유니코드 체계 고찰)

Tai, Dong-Zhen;Hong, Wan Pyo
- The Journal of the Korea institute of electronic communication sciences
- /
- v.8 no.7
- /
- pp.1011-1015
- /
- 2013
This paper analyses the violation rate of CJK joint Chines character Unicode to the source code rule. In the paper, Chinese character 150ea in Chinese Unicode which have relatively a higher frequency in use of a character was chosen to study. The frequency rate in use of the 150ea characters is about 50% of the total frequency rate of the Chinese characters. The study was applied the AMI/HDB-3 line coding/scrambling and HDLC protocol, According to the analyses, the number of violated characters were 77ea of 150 ea, frequency rate in use 29%. Therefore, when the violated 77ea characters are replaced to the matched character codes to the source coding rule, the processing rate of the line coder can be improved about 37%.
https://doi.org/10.13067/JKIECS.2013.8.7.1011 인용 PDF KSCI

Recognition of Handwriting Chinese Characters Based on DP matching (DP 정합을 이용한 필기체 한자 인식)

전상엽;권희용
- Proceedings of the Korea Multimedia Society Conference
- /
- 2004.05a
- /
- pp.285-288
- /
- 2004
온라인 필기체 한자는 동일인의 동일 문자조차도 회수, 획순 및 형태의 변화가 다양할 뿐만 아니라 인식 대상이 방대하여 인식이 매우 어렵다. 또한 한자는 기본 자소의 조합에 의한 글자가 아닌 각각의 글자가 독립적으로 이루어져 있어 연속된 획들 간의 관련도를 파악하기 어렵고 획수도 1획에서 28획까지 다양하게 분포를 한다. 따라서 본 연구에서는 대분류 단계로 시작획 비교를 하고 이어진 세분류 단계에서 문자의 특징으로 방향코드와 특이점을 추출해내고 획수를 고려하여 DP 정합을 하는 2단계 인식 시스템을 제안하였다. 이로써 최적의 속도로 입력한 문자를 찾아낼 수 있도록 하였다.
PDF

High Performance Recognition System for Chinese Character (고성능 한자 인식 시스템)

An, Seong-Ok;Ju, Gi-Ho
- The Journal of Engineering Research
- /
- v.1 no.1
- /
- pp.59-64
- /
- 1997
More than 2,000 different chinese characters are used daily in Korea newspapers and publications. The large repertoire of character pattern are the main difficulties when machine recognition of chinese characters is concerned. The goal of this paper is to conceive, evaluate and refine techniques for high performance Chinese character recognition. A new character classifier was being developed using prototype creation method.
PDF

A Chinese Character(Hanja) Input System Based on Unicode 3.0 (유니코드 3.0 한자 입력시스템)

윤지헌;변정용
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.04b
- /
- pp.375-377
- /
- 2000
인터넷의 급속한 보급은 인간 생활의 많은 부분을 바꾸어 놓고있는데, 가장 대표적인 예로 전자상거래와 온라인 문서를 들 수 있다. 전자상거래와 온라인 문서는 과거 자국의 문자위주 PC통신상에서만 이루어지고 있었지만 현재는 대부분이 인터넷과 연동되어있다. 따라서 전자상거래와 온라인 문서 등을 전세계 사람들이 이용하기 위해서 만국 공통의 코드가 필요하게 되었다. 이러한 요구로 ISO10646 코드가 제정되고 발전하여 현재의 유니코드 3.0에 이르게 되었다. 유니코드 3.0에는 세계각국의 문자가 포함되어있고, 한국, 중국, 일본 등 한자문화권에서 공통적으로 많이 사용하는 한자 2만 7천여자도 포함되어있다. 이것은 과거 국내 표준인 완성형 한자 4천 8백여자와 비교하면 무척 많은 양이라 할 수 있다. 이러한 유니코드의 출현으로 국내외의 고문헌과 법전 등의 한자가 포함된 각종 문서를 인터넷상에서 제공할 수 있지만, 현재 유니코드 한자를 입력하기위한 방법은 MS Word2000의 한자 입력기만 있고 다른 운영체제나 인터넷 환경에서는 거의 전무한 상태이다. 본 논문에서는 운영체제에 독립적으로 작동하는 유니코드 한자입력시스템에 관하여 연구 개발하였다.
PDF

A Study on the Printed Korean and Chinese Character Recognition (인쇄체 한글 및 한자의 인식에 관한 연구)

김정우;이세행
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.17 no.11
- /
- pp.1175-1184
- /
- 1992
A new classification method and recognition algorithms for printed Korean and Chinese character is studied for Korean text which contains both Korean and Chinese characters. The proposed method utilizes structural features of the vertical and horizontal vowel in Korean character. Korean characters are classified into 6 groups. Vowel and consonant are separated by means of different vowel extraction methods applied to each group. Time consuming thinning process is excluded. A modified crossing distance feature is measured to recognize extracted consonant. For Chinese character, an average of stroke crossing number is calculated on every characters, which allows the characters to be classified into several groups. A recognition process is then followed in terms of the stroke crossing number and the black dot rate of character. Classification between Korean and Chinese character was at the rate of 90.5%, and classification rate of Ming-style 2512 Korean characters was 90.0%. The recognition algorithm was applied on 1278 characters. The recognition rate was 92.2%. The densest class after classification of 4585 Chinese characters was found to contain only 124 characters, only 1/40 of total numbers. The recognition rate was 89.2%.
PDF

"삼국시대의 한자음" 펴낸 유창균 교수

Gang, Cheol-Ju
- The Korean Publising Journal, Monthly
- /
- s.80
- /
- pp.4-5
- /
- 1991
훈민정음 창제 이전 우리 문자생활의 주요한 도구였던 한자가 우리나라에 언제쯤 도입됐으며, 한자음이 우리말로 토착음화하는 과정으 어떠했는가에 논의의 초점을 두고 있는 이 책은, 고대한자음에 관련된 이제까지의 학계 통설에 대한 가장 본격적인 비판을 담은 성실한 연구성과라는 점에서 특히 주목을 끈다.
PDF

Search Result 68, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)