• Title/Summary/Keyword: 한자 문자

Search Result 68, Processing Time 0.026 seconds

A New Korean Search Pattern of the Operator LIKE (연산자 LIKE의 새로운 한글 탐색 패턴)

  • Park, Sung-Chul;Roh, Eun-Hyang;Park, Young-Chul;Park, Jong-Cheol
    • Journal of KIISE:Databases
    • /
    • v.34 no.3
    • /
    • pp.244-260
    • /
    • 2007
  • The operator LIKE of the database language SQL is a string pattern search operator. By providing the string pattern, the operator can identify column values that match with the string pattern. As a phonetic symbol, each Korean syllable is composed either of a leading sound and a medial sound or of a leading sound, a medial sound, and a trailing sound. As a search pattern of Korean syllables of the operator LIKE, in addition to the traditional Korean search pattern, this paper proposes a new search pattern that is based on leading sounds and medial sounds of Korean. With the new Korean search pattern, Korean syllables having specific leading sounds, specific medial sounds, or both specific leading sounds and medial sounds can be found. Formulating predicates equivalent with the new Korean search pattern by way of existing SQL operators is cumbersome and might cause the portability problem of applications depending on the underlying character set of the DBMS. This paper presents algorithms for the execution of the operator LIKE considering the new Korean search pattern based on the characters that are represented in KS X 1001, which is a Korean standard code for information interchange of Korean and Chinese.

DaHae: Japanese Morphological Analyzer for Japanese to Korean Machine Translation (DaHae: 일한 기계번역을 위한 일본어 형태소 분석기)

  • Yuh, Sang-Hwa;Jung, Han-Min;Chang, Won;Kim, Tae-Wan;Hwang, Do-Sam;Park, Dong-In
    • Annual Conference on Human and Language Technology
    • /
    • 1995.10a
    • /
    • pp.195-207
    • /
    • 1995
  • 일본어는 한자, 히라가나, 가다가나 등 다양한 종류의 문자를 사용하며 이들의 혼용 비율이 매우 높아 띄어쓰기를 하지 않아도 문서의 가독성을 유지한다. ICOT 사전, EDR 사전, ATLAS I/JK사전 등 기존의 전자 사전에서 복합 자종의 표제어가 차지하는 비율(한자+히라가나의 표제어 제외)은 평균 8.8%로 그 수가 매우 작다. 따라서, 문장 내에서 자종의 변화는 단어를 구분하는 하나의 delimiter로 이용될 수 있다. 본 시스템에서는 형태소 분석의 전단계로 전처리기를 두어 자종정보(character type information)에 의한 fragment 분리 및 예외 단어, 정형표현 처리를 수행하며 각 fragment 의 형태소 분석 방법을 제시한다. 형태소 분석기는 전처리기의 처리 결과를 입력받아 각각의 fragment를 전처리기가 제시한 분석 방법에 따라 분석하여 입력 문장의 가능한 모든 분석을 추출한다. 이 방법은 불필요한 사전 탐색과 접속 체크 회수를 줄여 분석 성능을 향상시킨다.

  • PDF

A Study on the Feature Extraction of Strokes using the Maximum Block Methode (최대 블록화 방법을 이용한 묵자획 특징 추출에 관한 연구)

  • Kim, Ui-Jeong;Kim, Tae-Gyun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.4
    • /
    • pp.1141-1151
    • /
    • 1997
  • In this paper the Maximum Block Method is suggested for the Feature Extraction of stokes of off-line Chinese characters.The Maximum Block Method is a technique which enlarges the block from the first found pixel that wxtracts the skeleton and features of the input characters.The maximum Block mthod is an adequate technique for the correct extraction of the features since the exsting thining methods have shortcomings of making the feature extraction difficult from the distoritions generated from the effiects of the parial noises,inflection points and blemishes. The printed outputs and chinese books of the middle and high school students,and other materials are used for the test.It was found that the Maxthod is also an effective technique for the extraction of skeleton line and features,which is the preoprocessing of the pattern recognition,for the Korean chracters and English as well as chinese chracters.

  • PDF

An Adaptive Binarization Algorithm for Degraded Document Images (저화질 문서영상들을 위한 적응적 이진화 알고리즘)

  • Ju, Jae-Hyon;Oh, Jeong-Su
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.7A
    • /
    • pp.581-585
    • /
    • 2012
  • This paper proposes an adaptive binarization algorithm which is highly effective for a degraded document image including printed Hangul and Chinese characters. Because of the attribute of character composed of thin horizontal strokes and thick vertical strokes, the conventional algorithms can't easily extract horizontal strokes which have weaker components than vertical ones in the degraded document image. The proposed algorithm solves the conventional algorithm's problem by adding a vertical-directional reference adaptive binarization algorithm to an omni-directional reference one. The simulation results show the proposed algorithm extracts well characters from various degraded document images.

Grapheme Segmentation Method for Low Quality Printed Hangul Text Recognition (저해상도 인쇄체 한글 영상 인식을 위한 자소 분할 방법)

  • Lee Seong-Hun;Cho Kyu-Tae;Kim Jin-Sik;Kim Jin-Hyung;Jung Cheol-Kon;Kim Sang-Kyun;Moon Young-Su;Kim Ji-Yeun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.06b
    • /
    • pp.382-384
    • /
    • 2006
  • 본 논문에서는 저해상도 한글 영상을 자소 단위로 분리하는 방법을 제안한다. 비디오 자막이나 저해상도 스캔 영상의 경우 자소간 획이 접촉되거나 잡영이 많이 포함되어 기존의 자소 분할 방법으로는 한계가 있다. 한자 문자열을 문자 단위로 분할하는데 사용된 비선형 분할 경로 알고리즘을 한글 낱자 영상에 적용하여 자소 단위로 분할한다. 기존의 분할 경로 알고리즘을 한글 자소 분할에 효과적으로 적용하기 위해서 우세점 탐지 알고리즘을 이용하여 자소간 접촉점을 찾고 이를 바탕으로 생성된 분할 경로에 따라 여러 개의 자소 후보 영상이 생성된다. 자소 영상을 자소 인식기로 인식한 결과 높은 인식률을 보이는 것을 실험을 통하여 확인하였다.

  • PDF

Implementation of Industrial Information Display System (산업용 정보표시 시스템 구현)

  • Kim, Whi-Young;Hong, Jung-Hwan;Gang, Uk;Park, Seong-Jun;Kim, Hee-Je
    • Proceedings of the KIEE Conference
    • /
    • 2001.07d
    • /
    • pp.2048-2050
    • /
    • 2001
  • 기존의 생산관리 현황반, 각종 산업용 판넬의 Faul Indicator, 각종 기계의 상태표시, 엘리베이터 정보출력장치, 주차타워 안내표시, 버스 행선지 안내표시, 병원, 은행 등 각종 광고용에서 사용되는 정보표시장치를 휘도와 안정성이 우수한 40mm 3 Color LED Dot Matrix Module을 이용한 Message 표시장치로서, 각종 Panel 및 기계 장치에 부착되거나 단독 설치되어 Parallel 또는 Serial Port로 입력을 받아 그 입력에 해당되는 Message를 출력 하며 User에 의해 제작되는 Graphic과 Text 형태가 있으며 외부입력에 의해 선택되는 Text Message와 출력 형태는 User에 의해 제작되는 Program에 의해 출력되는 방식으로 구현하여 영문, 숫자는 물론 한글 및 한자 표현도 가능 하도록 하여, 문자의 크기가 5${\times}$7 LED Dot Matrix에 비해 상대적으로 크고 미려하여 현장에서 상황의 인지도를 높이고 ASC-II 및 KS-5601의 Hex Code 입력방식보다 일반사용자가 Programming 하기 쉬우며 Message를 Graphic Symbol형태 및 문자로 작성하여 Message출력 형태를 선택할 수 있는 Mode Programming방식을 적용해 사용자의 용이이성이 배가 되도륵 하여 비교 검토한 결과 사용에 있어 편리성을 입증할 수 있었다.

  • PDF

A Study on the Graphic Contents of Hyuk-Wha in the late Chosun Dynasty (조선후기 혁화의 그래픽 콘텐츠 연구)

  • 이명구;남인복
    • Archives of design research
    • /
    • v.16 no.4
    • /
    • pp.37-46
    • /
    • 2003
  • About 18th century in the late Chosun dynasty, various kinds of 'Min-Wha' had played a significant role and had an important meaning in the lives of the people in that period. Therefore, both in material and in technique, so many diversified 'Min-Wha' were mass produced in that time. Starting from those backgrounds, 'Hyuk-Wha', is considered as one of unique style of expression. Though, 'Hyuk-Wha', in techniques, was originated from 'Bibaekseo' classified as one of the style of expression in Oriental drawing and writing art. 'Hyuk-Wha' shows and expresses visual differentiation from rough 'Bibaekseo', in substance, written by brush made from the skin of a willow tree or the stem of a sort of reeds. 'Hyuk-Wha', in mode, has very dose relation to the process of the development of 'Min-Wha'. Judging from this point of view, 'Hyuk-Wha' has deep relationship to Taosmic character painting of 'Gilsang: an auspicious sign' or Confucian character painting of 'Hyojae: filial piety. Accordingly, 'Hyuk-Wha' has been developed to that character painting designed by another type of creative differentiations. For these reasons, 'Hyuk-Wha' which significantly shapes and contains the meanings of Chinese Character also has been esteemed to have interrelation with Pictography in application of Word mark or Brand logotype in graphic areas. 'Hyuk-Wha' which was prevalent in use of home decorations for the people existed in the past has been ceased to exist nowadays in use of home decorations by appearance of all sort of decoration articles. All these days, 'Hyuk-Wha' which was diversified as a part 'Min-Wha' and developed together with oriental drawing and writing art and character painting is to be necessarily relighted. And 'Hyuk-Wha', which is also vigorously in practical application in Western Europe is desirable to be reconsidered.

  • PDF

A Study on the Characteristics and Preference of the Symbol Mark Modeling Performance in Chinese Regional History Museums (중국 지역 역사 박물관 심벌마크의 조형적 표현 특징 및 선호도 연구)

  • Zeng, Long;Park, Yong-Jin
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.10
    • /
    • pp.225-238
    • /
    • 2022
  • The purpose of this study is to explore the performance characteristics and laws of the symbol mark design of representative regional history museums in China, as well as the preferences of Chinese audiences for the symbol marks of different types of Chinese regional history museums. First, the performance theme, performance type, and type performance tendency of symbol mark modeling of the regional history museums among the top 100 museums in China are analyzed. Second, design laws based on the interrelationship of performance theme types and design performance types are explored. Finally, the questionnaire survey is carried out to explore preference from the aspects of attention, readability, closeness, originality, aesthetics and comprehensiveness. According to the results, the theme of regional history is the most in terms of themes. As for the modeling performance types, the concrete type and the visualization of Chinese character are the most. According to the content characteristics of different performance types, the following model characteristics are formed: expressing the theme of regional history, architecture, and regional natural ecological environment through the concrete type, expressing the concept through the abstract type, and expressing the concept and implying some building features through the geometric abstract figure. The three forms of the literal type, the concrete type expressing architecture, regional history, and regional natural ecological environment theme content, and the abstract type expressing concept are combined with each other, and expressed through the visualization of character, the mixture of abstract and literal type, the mixture of concrete and abstract type, and the mixture of concrete and abstract literal type in the mixture type. According to the survey results, Chinese audiences have higher preference for the concrete type in the symbol mark performance type and the regional historical theme in the performance content.

A Hangul Script Matching Algorithm for PDA (PDA상에서의 한글 필기체 매칭 알고리즘)

  • Cho, Mi-Gyung;Cho, Hwan-Gue
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.10
    • /
    • pp.684-693
    • /
    • 2002
  • Electronic Ink is a stored data in the form of the handwritten text or the script without converting it into ASCII by handwritten recognition on the pen-based computers and Personal Digital Assistants(PDAs) for supporting natural and convenient data input. One of the most Important issue is to search the electronic ink in order to use it. We proposed and implemented a script matching algorithm for the electronic ink. Proposed matching algorithm separated the input stroke into a set of primitive stroke using the curvature of the stroke curve. After determining the type of separated strokes, it produced a stroke feature vector. And then it calculated the distance between the stroke feature vector of input strokes and one of strokes in the database using the dynamic programming technique. We did various experiments and our algorithm showed high matching rate over 97.7% for only the Korean script and 94% for the data mixed Korean with the Chinese character.

A historical study on the flexibility square-format typeface and the prospects - Focused on the three-pairs fonts of hangeul - (탈네모글꼴에 관한 역사적 연구와 전망 - 세벌식 한글 글꼴을 중심으로 -)

  • Yu, Jeong-Mi
    • Archives of design research
    • /
    • v.19 no.2 s.64
    • /
    • pp.241-250
    • /
    • 2006
  • Hangeul as the Korean unique characters were invented according to some character-making principles and based on scholars' exhaustive researches. While most of the characters in the world evolved naturally, Hangeul was invented based on a precise linguistic analysis of the time, and therefore, it is most scientific and reasonable among various characters throughout the world. Nevertheless, Hangeul typeface designs do not seem to inherit the ideology of scientific and reasonable Hangeul correctly. For the square forms have been used intact due to the influences from the Chinese characters which prevailed during the time. If a single set of square characters should be designed, as much as 11,172 fonts should be designed, which suggests that advantages of Mangeul may not well be used fully; Hangeul was invented to visualize every sound with the combinations of 28 vowels and consonants. Problems of such square fonts began to be identified since 1900's when typewriters were introduced first from the West. Since a typewriter is designed with 28 characters laid out on its keyboard by using such combinations, the letters may be easily combined on it. The so-called the flexibility square-format typeface was born as such. Specially, the three-pairs fonts of these can be combined up to 67 letters including vowels and consonants. The three-pairs fonts system can help to solve the problems arising form the conventional square fonts and inherit the original ideology of Hangeul invention. This study aims to review the history of the three-pairs fonts designs facilitated by mechanic encoding of Hangeul and thereupon, suggest some desirable directions for future Hangeul fonts. Since the flexibility square-format typeface is expected to evolve more and more owing to development of the digital technology, they would serve our age of information in terms of both functions and convenience. Just as Hunminjongum tried to be literally independent from the Chinese characters, so the flexibility square-format typeface designs would serve to recover identity of our Hangeul font designs.

  • PDF