DOI QR코드

DOI QR Code

Coreference Resolution for Korean using Mention Pair with SVM

SVM 기반의 멘션 페어 모델을 이용한 한국어 상호참조해결

  • Received : 2014.09.15
  • Accepted : 2015.01.21
  • Published : 2015.04.15

Abstract

In this paper, we suggest a Coreference Resolution system for Korean using Mention Pair with SVM. The system introduced in this paper, also be able to extract Mention from document which is including automatically tagged name entity information, dependency trees and POS tags. We also built a corpus, including 214 documents with Coreference tags, referencing online news and Wikipedia for training the system and testing the system's performance. The corpus had 14 documents from online news, along with 200 question-and-answer documents from Wikipedia. When we tested the system by corpus, the performance of the system was extracted by MUC-F1 55.68%, B-cube-F1 57.19%, and CEAFE-F1 61.75%.

본 논문에서는 품사태그가 부착된 의존구문 트리와 개체명 정보가 자동 태깅된 말뭉치에서 멘션(Mention)을 추출하고, SVM을 기반으로 한 멘션 페어 모델(Mention Pair Model) 이용하는 한국어 상호참조해결 시스템을 제안한다. 시스템의 학습과 평가를 위해서 신문기사를 기반으로 하는 14개의 문서와, 위키피디아(Wikipedia)를 기반으로 하는 200개의 질의응답 문서를 분석하여 상호참조해결 정보가 담긴 말뭉치를 구축했다. 실험결과 본 논문에서 제안한 시스템의 성능은 MUC-F1 55.68%, B-cube-F1 57.19%, CEAFE-F1 61.75% 로 나타났다.

Keywords

Acknowledgement

Grant : 휴먼 지식증강서비스를 위한 지능진화형 WiseQA 플랫폼 기술 개발

Supported by : 한국산업기술평가관리원

References

  1. Sameer Pradhan, Lance Ramshaw, Mitchell Marcus, Martha Palmer, Ralph Weischedel, and Nianwen Xue, "Conll-2011 shared task: Modeling unrestricted coreference in ontonotes," Proc. of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, Association for Computational Linguistics, pp. 1-27, 2011.
  2. Changki Lee, and Myung-Gil Jang, "Fast training of structured SVM using fixed-threshold sequential minimal optimization," ETRI journal 31.2 (2009): pp. 121-128, 2009. https://doi.org/10.4218/etrij.09.0108.0276
  3. Marc Vilain, John Burger, John Aberdeen, Dennis Connolly, and Lynette Hirschman, "A model-theoretic coreference scoring scheme," Proc. of the 6th conference on Message understanding, Association for Computational Linguistics, pp. 45-52, 1995.
  4. Amit Bagga, and Breck Baldwin, "Algorithms for scoring coreference chains," Proc. of the first international conference on language resources and evaluation workshop on linguistics coreference, Vol. 1, pp. 563-566, 1998.
  5. Luo Xiaoqiang, "On coreference resolution performance metrics," Proc. of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, Association for Computational Linguistics, pp. 25-32, 2005.
  6. Heeyoung Lee, Yves Peirsman, Angel Chang, Nathanael Chambers, Mihai Surdeanu, and Dan Jurafsky, "Stanford's multi-pass sieve coreference resolution system at the CoNLL-2011 shared task," Proc. of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, Association for Computational Linguistics, pp. 28-34, 2011.
  7. Altaf Rahman, and Vincent Ng, "Supervised models for coreference resolution," Proc. of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2-Volume 2, Association for Computational Linguistics, pp. 968-977, 2009.
  8. Cheoneum Park, Kyoungho Choi, and Changki Lee, "Korean Coreference Resolution using the Multipass Sieve," Journal of KIISE, JOK, Vol. 41, No. 11, pp. 992-1005, 2014. https://doi.org/10.5626/JOK.2014.41.11.992

Cited by

  1. Korean Coreference Resolution with Guided Mention Pair Model using the Deep Learning vol.38, pp.6, 2016, https://doi.org/10.4218/etrij.16.0115.0896