DOI QR코드

DOI QR Code

조응성 정보와 중심화 이론에 기반한 영형 주어의 선행사 식별

Antecedent Identification of Zero Subjects using Anaphoricity Information and Centering Theory

  • 김계성 (경북대학교 소프트웨어기술연구소) ;
  • 박성배 (경북대학교 IT대학 컴퓨터학부) ;
  • 이상조 (경북대학교 IT대학 컴퓨터학부)
  • 투고 : 2013.08.08
  • 심사 : 2013.10.24
  • 발행 : 2013.12.31

초록

본 논문은 지역적 응집성을 모델링하는 중심화 이론을 이용하여 한국어 영형대명사의 지시해결에 접근한다. 중심화 이론은 영어 대명사의 해결을 위해 널리 사용되고 있지만, 일본어, 한국어 등의 언어에서 나타나는 영형대명사 해결에 중심화의 프레임워크를 적용하는 데에는 많은 어려움이 따른다. Grosz et al.의 중심화 이론은 지시적 표현들의 비조응적 사용을 고려하지 않으므로, 문서에 나타나는 비조응적 기능의 영형 대명사가 중심화 이론을 이용한 영형대명사의 선행사 식별에 중요한 영향을 미친다. 본 논문은 이를 위해 먼저 절 간의 결속 관계를 이용하여 영형대명사, 특히 영형주어의 문장 내 조응성을 결정하고, 다음으로 중심화의 순위를 이용하여 그 영형의 선행사를 식별하는 방법을 제안한다. 실험을 통해 조응성 결정을 이용하는 제안한 방법이 이를 이용하지 않는 베이스라인 시스템보다 우수함을 알 수 있었다.

This paper approaches the problem of resolving Korean zero pronouns using Centering Theory modeling local coherence. Centering Theory has been widely used to resolve English pronouns. However, it is much difficult to apply the centering framework for zero pronoun resolution in languages such as Japanese and Korean. Since in particular the use of non-anaphoric zero pronouns without explicit antecedents is not considered in the Centering Theory of Grosz et al., the presence of non-anaphoric cases negatively affects the performance of the resolution system based on Centering Theory. To overcome this, this paper presents a method which determines the intra-sentential anaphoricity of zero pronouns in subject position by using relationships between clauses, and then identifies antecedents of zero subjects. In our experiments, the proposed method outperforms the baseline method relying solely on Centering Theory.

키워드

참고문헌

  1. S. Zhao and H. T. Ng, "Identification and Resolution of Chinese Zero Pronouns: A Machine Learning Approach", In Proceedings of the Joint Conference on EMNLP-CoNLL, pp.541-550, 2007.
  2. F. Kong and G. Zhou, "A tree kernel-based unified framework for Chinese zero anaphora resolution," In Proceedings of Empirical Methods in Natural Language Processing, pp. 882-891, October, 2010.
  3. R. Iida, K. Inui, and Y. Matsumoto, "Zero-Anaphora Resolution by Learning Rich Syntactic Pattern Features", ACM Transactions on Asian Language Information Processing, Vol.6, No.4, article 12, December, 2007.
  4. B. J. Grosz, S. Weinstein, and A. K. Joshi, "Centering: A Framework for Modeling the Local Coherence of Discourse", Computational Linguistics, Vol.21 No.2, pp.203-225, June, 1995.
  5. M. Poesio, R. Stevenson, B. D. Eugenio, and J. Hitzeman, "Centering: A parametric theory and its instantiations," Computational Linguistics, Vol.30, No.3, pp.309-363, September, 2004. https://doi.org/10.1162/0891201041850911
  6. Y. Cui, Q. Hu, H. Pan, and J. Hu, "Zero anaphora resolution in Chinese discourse," In Proceedings of Computational Linguistics and Intelligent Text Processing, pp.245-248, 2006.
  7. Y. Sakurai, "Centering in Japanese: How to rank forward-looking centers in a complex sentence," In Proceedings of the 22nd Northwest Linguistic Conference, pp. 243-256, 2006.
  8. Q. Hu, "A corpus-based study on zero anaphora resolution in Chinese discourse," Ph.D. dissertation, Department of Chinese, Translation and Linguistics, City University of Hong Kong, 2008.
  9. C.-L. Yeh and Y.-C. Chen, "Zero anaphora resolution in Chinese with partial parsing based on centering theory," In Proceedings of IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE'03), pp.683-688, 2003.
  10. R. Iida and M. Poesio, "A cross-lingual ILP solution to zero anaphora resolution," In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp.804-813, 2011.
  11. A. Rahman and V. Ng, "Syntactic Parsing for Ranking-Based Coreference Resolution," In Proceedings of IJCNLP, pp. 465-473, 2011.
  12. K.-S. Kim, S.-B. Park, S.-Y. Park and S.-J. Lee, "Anaphoricity determination of zero pronouns for intra-sentential zero anaphora resolution", Journal of KIISE : Software and applications, Vol.37, No.12, pp.928-935, Sep., 2010. (in Korean)
  13. M. Zhang, J. Zhang, and J. Su, "Exploring syntactic features for relation extraction using a convolution tree kernel," In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pp.288-295, 2006.
  14. M. Collins and N. Duffy, "Convolution Kernels for Natural Language", In Proceedings of Neural Information Processing Systems, pp.625-632, 2001.
  15. T. Joachims, "Making large-Scale SVM Learning Practical", Advances in Kernel Methods - Support Vector Learning, B. Scholkopf and C. Burges and A. Smola (ed.), MIT-Press, 1999.