• Title/Summary/Keyword: Unicode

Search Result 67, Processing Time 0.024 seconds

Compatibility of UTF-8 Encoding System to HDB-3 Scrambling Method (Unicode의 UTF-8 부호화 방식의 HDB-3 스크램블링 방식과의 적합성)

  • Hong, Wan-Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.2
    • /
    • pp.227-240
    • /
    • 2013
  • This paper studied how much influence UTF coded data is given to HDB-3 operation efficiency. This paper applied the Hangul Jamo and Compatability Hangul Jamo in Unicode to compare to its UTF-8 coded data. As a result of study, When Unicode is reformatted to UTF-8 code, the data code run counter to the source coding rule was very much increased.

A Development of Unicode-based Multi-lingual Namecard Recognizer (Unicode 기반 다국어 명함인식기 개발)

  • Jang, Dong-Hyeub;Lee, Jae-Hong
    • The KIPS Transactions:PartB
    • /
    • v.16B no.2
    • /
    • pp.117-122
    • /
    • 2009
  • We developed a multi-lingual namecard recognizer for building up a global client management systems. At first, we created the Unicode-based character image database for character recognition and learning of multi languages, and applied many color image processing techniques to get more correct data for namecard images which were acquired by various input devices. And by applying multi-layer perceptron neural network, individual character recognition applied for language types, and post-processing utilizing keyword databases made for individual languages, we increased a recognition rate for multi-lingual namecards.

A study on Code System of Latin Character to Improve Transmission Efficiency in Data Communications (데이터통신 전송효율과 라틴어 부호 체계 고찰)

  • Hong, Wan-Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.7 no.4
    • /
    • pp.761-776
    • /
    • 2012
  • This paper proposes the revised Roman character code system using Unicode 3.0. The background of the paper is whether the Latin character code system th using in the world in Unicode V.3 is proper or not in the side of the transmission efficiency in data communications. In data communications, when the consecutive 4 bits or 8 bits of "0" bit from the information devices input into the line coder, its consecutive "0" bits are scrambled to the predetermined bit patterns to avoid the syncronization loss. The paper was based on the statistical data for the using frequency of the alphabet letter and the proposed rule for characters coding in [1]. The paper was focused to improve of Unicode itself and UTF-8 code system. As a result of the paper, when the proposed coding systems for Latin character in Unicode 3.0 itself and UTF-8 code system, the scrambler efficiency using HDB-3 in the line coder of the data transmission system could be improved about 3645 ~ 31400% and 480 ~ 1700% respectively.

Correction for Hangul Normalization in Unicode (유니코드 환경에서의 올바른 한글 정규화를 위한 수정 방안)

  • Ahn, Dae-Hyuk;Park, Young-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.2
    • /
    • pp.169-177
    • /
    • 2007
  • Hangul text normalization in current Unicode makes wrong Hangul syllable problems when using with precomposed modern Hangul syllables and composing old Hangul by using conjoining-Hangul Jamo and compatibility Hangul Jamo. This problem comes from allowing incorrect normalization form of compatibility Hangul Jamo and Hangul Symbol and also permitting to use conjoining-Hangul Jamo mixture with precomposed Hangul syllable in Unicode Hangul composing rule. It is caused by lack of consideration of old Hangul and/or insufficient understanding of Hangul code processing when writing specification for normalization forms in Unicode. Therefore on this paper, we study Hangul code in Unicode environment, specifically problems of normalization used for Web and XML, IDN in nowadays. Also we propose modification of Hangul normalization methods and Hangul composing rules for correct processing of Hangul normalization in Unicode.

Problems with Chinese Ideographs Search in Unicode and Solutions to Them (유니코드 한자 검색의 문제점 및 개선방안)

  • Lee, Jeong-hyeon
    • Informatization Policy
    • /
    • v.19 no.3
    • /
    • pp.50-63
    • /
    • 2012
  • This thesis is designed to analyze how the search for Chinese ideographs is done in Koreanology-related domestic databases, domestic library databases, domestic academic databases, and overseas library databases, with a view to identifying problems and suggesting solutions to them. The major reasons that impede Chinese ideographs search in Unicode are classified as 'multicode characters', 'simplified characters', and 'variant characters', and three characters are chosen as samples to describe the current practice. Thirteen Koreanology-related databases, five domestic library databases, five domestic academic databases and two overseas library databases are analyzed in terms of Chinese ideographs search. To support search for multicode characters, the open source of the Unicode consortium must be applied. To improve search for simplified and variant characters, a matching table must be standardized and proposed to the Unicode consortium.

  • PDF

A Phonological Process Code in Unicode for the Standardization of Hangul(Korean Alphabet) Pronunciation

  • Min, Cheong-Sick
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.12
    • /
    • pp.145-152
    • /
    • 2015
  • In this paper, a standardized phonological rule should be embodied for all people who want to use Hangul beyond nationality. This researcher suggested a phonological process protocol in unicode for the purpose of developing a computer system-based program as one of methods to pronounce Hangul correctly. First of all, to solve a problem of standardizing Hangul pronunciation, it is necessary to develop a phonological process system. In other words, when a mobile phone or a keyboard is used, inputted strings should be changed exactly by an automatic device made according to the phonological rule. For this, a phonological process table is the most important and unicode which digitalizes Hangul is the most effective. Therefore, the purpose of this study is to complete a phonological process table, suggest an algorithm by the table, and develop a computer program pursuing correct Hangul pronunciation. For the methodology of this study, the features of unicode was used to solve a problem of coding each syllable by phonemes after strings were inputted from the phonological process program. This study suggested a protocol that sounds of inputted strings were automatically changed via recombination using a phonological process table. The ultimate purpose of the study was to help pronounce Hangul correctly according to the pronunciation rule.

Scrambling Occurrence Frequency in HDB-3 in UTF-8 Coding of UNICODE Hangul Jamo (Unicode 한글낱자의 UTF-8 부호화에 따른 HDB-3 스크램블링 발생빈도)

  • Hong, Wan-Pyo
    • Journal of Advanced Navigation Technology
    • /
    • v.19 no.2
    • /
    • pp.153-160
    • /
    • 2015
  • This paper has studied about the scrambling occurrence frequency in UTF-8 coding system for Unicode Hangul Jamo codes. The scrambling methode applied in the study is HDB-3 in AMI line coding that is international transmission standard. In the study, the source coding rule was applied to analysis the scrambling occurrence. The quantity of the scrambling occurrence was calculated by the number of times and frequency rate of the scrambling occurrence in Hangul Jamo and Compatibity Hangul Jamo. In the case of Hangul Jamo, the number of times and frequency rate in Unicode and UTF-8 were 24times, 52% and 148times, 228% respectively. In the case of Compatibility Hangul Jamo, that were 10times, 14% and 83times, 131% respectively. As a result, when Hangul Jamo and Compatibility Hangul Jamo in UNICODE were transformed to UTF-8, the scrambling frequency rates were increased 340% and 851% respectively.

A Study on the Hangul Syllables of Unicode System considering Data Transmission Efficiency (데이터전송효율을 고려한 유니코드의 한글글자마디에 대한 연구)

  • Hong, Wan-Pyo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.1
    • /
    • pp.39-46
    • /
    • 2015
  • The paper studied possibility of improvement of efficient of data processing in the line coder when Hangul syllables in Unicode system is used for the source code. The scrambling in the line coder is to solve the problem happened due to the source code. The study is based on the HDB-3 scrambling method in ITU-T standards that is applied to AMI line coder. The referred data of Hangul syllables and its use frequency which are required to analysis was used the data extracted from the source data of the National Korean Language Institute. According to the analysis, the average 24% scrambling was generated in source code of Hangul syllables in Unicode system. When the referred Hangul syllables was applied to Unicode system, the average 27% scrambling was producted. Total 8,924ea Hangul syllables in 11,172ea Hangul syllables in Unicode system were not scrambled. Therefore the referred Hangul syllables 1,540ea were accepted in the unscrambled code areas. As a result, the existing Unicode Hangul syllable codes can't prevent the scrambling, but it is possible to completely remove the 27% scrambling with new source coding system. And then, it can be improved the data processing efficient upto minimum 27% in line coder by software in presentation layer instead of physical layer.

송본(宋本) "상한론(傷寒論)"의 한중(韓中) Code 비교(比較) 비교대어송본(比較對於宋本)"상한론(傷寒論)"적한국화중국지(的韓國和中國之)Code

  • Lee, Byeong-Uk;Sin, Sang-U;Kim, Eun-Ha
    • Journal of Korean Medical classics
    • /
    • v.18 no.4 s.31
    • /
    • pp.83-92
    • /
    • 2005
  • 도금일(到今日), 동양의학재한중일삼국수저자기적특색이연구발전이래(東洋醫學在韓中日三國隨著自己的特色而硏究發展而來). 이차저한중일삼국위료동양의학적세계화(而且這韓中日三國爲了東洋醫學的世界化), 과학화부단지재진행협조화노력(科學化不斷地在進行協助和努力). 유우동양의학이경주상료세계화(由于東洋醫學已經走上了世界化), 저삼국재소용적의학술어화기유적나사문헌자료적교류수요취월래월고료(這三國在所用的醫學述語和旣有的那些文獻資料的交流需要就越來越高了). 가시재문헌교류화의학용어적표준화과정중발현료일개흔대적장애, 저개장애취시(這個障碍就是)unicode. 당연(當然)unicode시위료재국가지간사적저사정보령활교류이제정작성적(是爲了在國家之間使的這些情報靈活交流而制定作成的). 가시(可是)unicode제정지전(制定之前), 각국이경위료각각적수구이연발료적합자기적한자계통(各國已經爲了各各的需求而硏發了適合自己的漢子系統). 현재적(現在的)unicode취시용나사기유적한자(就是用那些旣有的漢子)code제정적(制定的). 유우(由于)unicode피저양제정종이산생불소계통상적모순문제(被這樣制定終而産生不少系統上的矛盾問題). 저사문제불근영향도료계통지외(這些問題不僅影響到了系統之外), 이차우조애료한의학정보적령활교류(而且又阻碍了韓醫學情報的靈活交流). 위료해결저사문제(爲了解決這些問題). 본인이상한론위연구대상래비교료일하한국화중국적한자(本人以傷寒論爲硏究對象來比較了一下韓國和中國的漢子)code차이(差異).

  • PDF

Hangul Encoding Standard based on Unicode (유니코드의 한글 인코딩 표준안)

  • Ahn, Dae-Hyuk;Park, Young-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.12
    • /
    • pp.1083-1092
    • /
    • 2007
  • In Unicode, two types of Hangul encoding schemes are currently in use, namely, the "precomposed modern Hangul syllables" model and the "conjoining Hangul characters" model. The current Unicode Hangul conjoining rules allow a precomposed Hangul syllable to be a member of a syllable which includes conjoining Hangul characters; this has resulted in a number of different Hangul encoding implementations. This unfortunate problem stems from an incomplete understanding of the Hangul writing system when the normalization and encoding schemes were originally designed. In particular, the extended use of old Hangul was not taken into consideration. As a result, there are different ways to represent Hangul syllables, and this cause problem in the processing of Hangul text, for instance in searching, comparison and sorting functions. In this paper, we discuss the problems with the normalization of current Hangul encodings, and suggest a single efficient rule to correctly process the Hangul encoding in Unicode.