Proceedings of the Korean Society for Language and Information Conference (한국언어정보학회:학술대회논문집)
- 2007.11a
- /
- Pages.375-384
- /
- 2007
Automatic Acquisition of Lexical-Functional Grammar Resources from a Japanese Dependency Corpus
- Oya, Masanori (National Centre for Language Technology and School of Computing, Dublin City University) ;
- Genabith, Josef Van (National Centre for Language Technology and School of Computing, Dublin City University)
- Published : 2007.11.01
Abstract
This paper describes a method for automatic acquisition of wide-coverage treebank-based deep linguistic resources for Japanese, as part of a project on treebank-based induction of multilingual resources in the framework of Lexical-Functional Grammar (LFG). We automatically annotate LFG f-structure functional equations (i.e. labelled dependencies) to the Kyoto Text Corpus version 4.0 (KTC4) (Kurohashi and Nagao 1997) and the output of of Kurohashi-Nagao Parser (KNP) (Kurohashi and Nagao 1998), a dependency parser for Japanese. The original KTC4 and KNP provide unlabelled dependencies. Our method also includes zero pronoun identification. The performance of the f-structure annotation algorithm with zero-pronoun identification for KTC4 is evaluated against a manually-corrected Gold Standard of 500 sentences randomly chosen from KTC4 and results in a pred-only dependency f-score of 94.72%. The parsing experiments on KNP output yield a pred-only dependency f-score of 82.08%.
Keywords
- Lexical-Functional Grammar;
- Japanese;
- automatic linguistic resource acquisition;
- zero-pronoun identification