A Rule-Based Analysis from Raw Korean Text to Morphologically Annotated Corpora

Lee, Ki-Yong;Markus Schulze;

한국언어정보학회지:언어와정보 (Language and Information)

제6권2호
/
Pages.105-128
/
2002
/
1226-7430(pISSN)

한국언어정보학회 (Korean Society for Language and Information)

A Rule-Based Analysis from Raw Korean Text to Morphologically Annotated Corpora

Lee, Ki-Yong (Korea University) ;
Markus Schulze (Universitat Erlangen-Nurnberg)

발행 : 2002.12.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Morphologically annotated corpora are the basis for many tasks of computational linguistics. Most current approaches use statistically driven methods of morphological analysis, that provide just POS-tags. While this is sufficient for some applications, a rule-based full morphological analysis also yielding lemmatization and segmentation is needed for many others. This work thus aims at 〔1〕 introducing a rule-based Korean morphological analyzer called Kormoran based on the principle of linearity that prohibits any combination of left-to-right or right-to-left analysis or backtracking and then at 〔2〕 showing how it on be used as a POS-tagger by adopting an ordinary technique of preprocessing and also by filtering out irrelevant morpho-syntactic information in analyzed feature structures. It is shown that, besides providing a basis for subsequent syntactic or semantic processing, full morphological analyzers like Kormoran have the greater power of resolving ambiguities than simple POS-taggers. The focus of our present analysis is on Korean text.

한국언어정보학회지:언어와정보 (Language and Information)

A Rule-Based Analysis from Raw Korean Text to Morphologically Annotated Corpora

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)