Segmentation of Korean Compound Nouns Using Semantic Category Analysis of Unregistered Nouns

Kang Yu-Hwan;Seo Young-Hoon;

Journal of Information Technology Applications and Management

Volume 11 Issue 4
/
Pages.95-102
/
2004
/
1598-6284(pISSN)
/
2508-1209(eISSN)

Korea Data Strategy Society (한국데이터전략학회)

Segmentation of Korean Compound Nouns Using Semantic Category Analysis of Unregistered Nouns

미등록어의 의미 범주 분석을 이용한 복합명사 분해

강유환 (충북대학교 컴퓨터공학과) ;
서영훈 (충북대학교 전기전자컴퓨터공학부)

Published : 2004.12.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper proposes a method of segmenting compound nouns which include unregistered nouns into a correct combination of unit nouns using characteristics of person's names, loanwords, and location names. Korean person's name is generally composed of 3 syllables, only relatively small number of syllables is used as last names, and the second and the third syllables combination is somewhat restrictive. Also many person's names appear with clue words in compound nouns. Most loanwords have one or more syllables which cannot appear in Korean words, or have sequences of syllables different from usual Korean words. Location names are generally used with clue words designating districts in compound nouns. Use of above characteristics to analyze compound nouns not only makes segmentation more accurate, helps natural language systems use semantic categories of those unregistered nouns. Experimental results show that the precision of our method is approximately 98% on average. The precision of human names and loanwords recognition is about 94% and about 92% respectively.

Journal of Information Technology Applications and Management

Segmentation of Korean Compound Nouns Using Semantic Category Analysis of Unregistered Nouns

미등록어의 의미 범주 분석을 이용한 복합명사 분해

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)