한국언어정보학회:학술대회논문집 (Proceedings of the Korean Society for Language and Information Conference)
- 한국언어정보학회 2002년도 Language, Information, and Computation Proceedings of The 16th Pacific Asia Conference
- /
- Pages.69-78
- /
- 2002
Penn Korean Treebank: Development and Evaluation
- Han, Chung-hye (Dept. of Linguistics, Simon Fraser University, 8888 University Drive, Burnaby BC V5A 156, Canada) ;
- Han, Na-Rae (Dept. of Linguistics, University of Pennsylvania, 619 Williams Hall, Philadelphia, PA 19104, USA) ;
- Ko, Eon-Suk (Dept. of Linguistics, University of Pennsylvania, 619 Williams Hall, Philadelphia, PA 19104, USA) ;
- Martha Palmer (Dept, of Computer Information and Science, University of Pennsylvani, 256 Moore School, Philadephia, PA 19104, USA) ;
- Heejong Yi (Dept. of Lingistics, University of Delaware, 46E. Delaware Ave., Newark, DE 19716,7SA)
- 발행 : 2002.02.01
초록
This paper discusses issues in building a 54-thousand-word Korean Treebank using a phrase structure annotation, along with developing annotation guidelines based on the morpho-syntactic phenomena represented in the corpus. Various methods that were employed for quality control and the evaluation on the Treebank are also presented.
키워드