DOI QR코드

DOI QR Code

A Semantic Representation Based-on Term Co-occurrence Network and Graph Kernel

  • Noh, Tae-Gil (School of Computer Science and Engineering, Kyungpook National University) ;
  • Park, Seong-Bae (School of Computer Science and Engineering, Kyungpook National University) ;
  • Lee, Sang-Jo (School of Computer Science and Engineering, Kyungpook National University)
  • Received : 2011.08.23
  • Accepted : 2011.11.04
  • Published : 2011.12.25

Abstract

This paper proposes a new semantic representation and its associated similarity measure. The representation expresses textual context observed in a context of a certain term as a network where nodes are terms and edges are the number of cooccurrences between connected terms. To compare terms represented in networks, a graph kernel is adopted as a similarity measure. The proposed representation has two notable merits compared with previous semantic representations. First, it can process polysemous words in a better way than a vector representation. A network of a polysemous term is regarded as a combination of sub-networks that represent senses and the appropriate sub-network is identified by context before compared by the kernel. Second, the representation permits not only words but also senses or contexts to be represented directly from corresponding set of terms. The validity of the representation and its similarity measure is evaluated with two tasks: synonym test and unsupervised word sense disambiguation. The method performed well and could compete with the state-of-the-art unsupervised methods.

Keywords

References

  1. T. Griffiths, M. Steyvers, and J. Tenenbaum, "Topics in semantic representation," Psychological Review, vol. 114, no. 2, pp. 211-244, 2007. https://doi.org/10.1037/0033-295X.114.2.211
  2. D. Blei, A. Ng, and M. Jordan, "Latent dirichlet allocation," The Journal of Machine Learning Research, vol. 3, pp. 993-1022, 2003.
  3. M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth, "The author-topic model for authors and documents," in Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 487-494, 2004.
  4. T. Landauer and S. Dumais, "A solution to plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge," Psychological Review, vol. 104, no. 2, pp. 211-240, 1997. https://doi.org/10.1037/0033-295X.104.2.211
  5. G. Denhi'ere and B. Lemaire, "Effects of high-order co-occurrences on word semantic similarity," Current Psychology Letters: Behaviour, Brain & Cognition, vol. 1, no. 18, 2006.
  6. T. Hughes and D. Ramage, "Lexical semantic relatedness with random graph walks," in Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 581-589, 2007.
  7. E. Gabrilovich and S. Markovitch, "Computing semantic relatedness using wikipedia-based explicit semantic analysis," in Proceedings of the 20th International Joint Conference on Artificial Intelligence, pp. 6-12, 2007.
  8. J. Boyd-Graber, D. Blei, and X. Zhu, "A topic model for word sense disambiguation," in Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 1024-1033, 2007.
  9. J. Veronis, "Hyperlex: lexical cartography for information retrieval," Computer Speech & Language, vol. 18, no. 3, pp. 223-252, 2004. https://doi.org/10.1016/j.csl.2004.05.002
  10. I. Klapaftis and S. Manandhar, "Word sense induction using graphs of collocations," in Proceedings of the 18th European Conference on Artificial Intelligence, pp. 298-302, 2008.
  11. W. Duan, M. Song, and A. Yates, "Fast maxmargin clustering for unsupervised word sense disambiguation in biomedical texts," BMC Bioinformatics, vol. 10, no. 3, 2009.
  12. E. Agirre, E. Alfonseca, K. Hall, J. Kravalova, M. Pas¸ca, and A. Soroa, "A study on similarity and relatedness using distributional and wordnet-based approaches," in Proceedings of 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL '09, pp. 19-27, 2009.
  13. T.-G. Noh, S.-B. Park, H.-G. Yoon, S.-J. Lee, and S.-Y. Park, "An automatic translation of tags for multimedia contents using folksonomy networks," in Proceedings of the 32nd ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 492-499, 2009.
  14. D. Haussler, "Convolution kernels on discrete structures, UCSC-CRL-99-10," tech. rep., University of Santa Cruz, 1999.
  15. T. G¨artner, P. Flach, and S. Wrobel, "On graph kernels: Hardness results and efficient alternatives," in Proceedings of the 16th Computational Learning Theory and Kernel Machines, pp. 129-143, 2003.
  16. Y. Tsuruoka, "Bidirectional inference with the easiest-first strategy for tagging sequence data," in Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 467-474, 2005.
  17. K. Rothenhausler and H. Schutze, "Part of speech filtered word spaces," in Proceedings of the Workshop on Contextual Information in Semantic Space Models, pp. 25-33, 2007.
  18. J. Bigham, M. Littman, V. Shnayder, and P. Turney, "Combining independent modules to solve multiplechoice synonym and analogy problems," in Proceedings of the International Conference on Recent Advances in Natural Language Processing, pp. 482-489, 2003.
  19. R. Mihalcea, T. Chklovski, and A. Kilgarriff, "The Senseval-3 english lexical sample task," in Proceedings of the 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 25-28, 2004.
  20. E. Agirre, D. Mart´ınez, O. Lacalle, and A. Soroa, "Two graph-based algorithms for state-of-the-art wsd," in Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 585-593, 2006.
  21. E. Agirre, O. de Lacalle, and D. Martnez, "Exploring feature spaces with SVD and unlabeled data for word sense disambiguation," in Proceedings of the Conference on Recent Advances on Natural Language Processing, 2005.
  22. C. Niu, W. Li, R. Srihari, and H. Li, "Word independent context pair classification model for word sense disambiguation," in Proceedings of the 9th Conference on Computational Natural Language Learning, pp. 33-39, 2005.
  23. J. Preiss, "Probabilistic WSD in Senseval-3," in Proceedings of the 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 213-216, 2004.