DOI QR코드

DOI QR Code

A Crowdsourcing-based Emotional Words Tagging Game for Building a Polarity Lexicon in Korean

한국어 극성 사전 구축을 위한 크라우드소싱 기반 감성 단어 극성 태깅 게임

  • Kim, Jun-Gi (Graduate School of Culture, Information, and Public Policy (Game Producing Major), Hongik University) ;
  • Kang, Shin-Jin (Graduate School of Culture, Information, and Public Policy (Game Producing Major), Hongik University) ;
  • Bae, Byung-Chull (Graduate School of Culture, Information, and Public Policy (Game Producing Major), Hongik University)
  • 김준기 (홍익대학교 문화정보정책대학원 게임프로듀싱전공) ;
  • 강신진 (홍익대학교 문화정보정책대학원 게임프로듀싱전공) ;
  • 배병철 (홍익대학교 문화정보정책대학원 게임프로듀싱전공)
  • Received : 2017.03.21
  • Accepted : 2017.04.20
  • Published : 2017.04.20

Abstract

Sentiment analysis refers to a way of analyzing the writer's subjective opinions or feelings through text. For effective sentiment analysis, it is essential to build emotional word polarity lexicon. This paper introduces a crowdsourcing-based game that we have developed for efficiently building a polarity lexicon in Korean. First, we collected a corpus from the relating Internet communities using a crawler, and we classified them into words using the Twitter POS analyzer. These POS-tagged words are provided as a form of mobile platform based tagging game in which the players voluntarily tagged the polarities of the words, and then the result was collected into the database. So far we have tagged the polarities of about 1200 words. We expect that our research can contribute to the Korean sentiment analysis research especially in the game domain by collecting more emotional word data in the future.

감성 분석은 글을 통해 작성자의 주관적인 생각이나 느낌을 분석하는 방법으로 효과적인 감성 분석을 위해서는 감성 단어 극성 사전 구축이 필수적이다. 본 논문은 효율적인 한국어 극성 사전 구축을 위해 우리가 개발한 크라우드소싱 기반 게임을 소개한다. 먼저, 크롤러를 이용해 인터넷 커뮤니티에서 말뭉치들을 수집했고, Twitter 형태소를 이용해 수집한 말뭉치를 형태소별로 분류하고 단어화했다. 이 단어들은 모바일 플랫폼 기반 태깅 게임 형태로 제공되어 게임플레이를 통해 플레이어들이 자발적으로 단어들의 극성을 선택하고 결과가 데이터 베이스에 축적되도록 게임이 설계되었다. 현재까지 약 1200여개의 단어들의 극성을 태깅하였으며, 향후 좀 더 많은 감성 단어 데이터들을 축적함으로써 특히 게임 도메인에서 한국어 감성 분석 연구에 기여할 것으로 기대한다.

Keywords

References

  1. http://isis.kisa.or.kr/board/?pageId=060100&bbsId=7&itemId=818
  2. B. Pang and L. Lee, "Opinion mining and sentiment analysis", Foundations and Trends(R) in Information Retrieval, Vol 2, No. 1-2, pp. 1-135, 2008. https://doi.org/10.1561/1500000011
  3. H. Wang., D. Can, A. Kazemzadeh, F. Bar, and S. Narayanan, "A system for real-time twitter sentiment analysis of 2012 us presidential election cycle", Proceedings of the ACL System Demonstrations. Association for Computational Linguistics, pp. 115-120, 2012.
  4. C. Dellarocas, X. M. Zhang, and N. F. Awad, "Exploring the value of online product reviews in forecasting sales: The case of motion pictures", Journal of Interactive marketing, Vol. 21, No. 4, pp. 23-45, 2007. https://doi.org/10.1002/dir.20087
  5. J. Bollen, H. Mao, and X. Zeng, "Twitter mood predicts the stock market", Journal of computational science, Vol.2 No. 1, pp. 1-8, 2011. https://doi.org/10.1016/j.jocs.2010.12.007
  6. K. Ravi and V. Ravi, "A survey on opinion mining and sentiment analysis: tasks, approaches and applications" Knowledge-Based Systems, Vol. 89, pp. 14-46, 2015. https://doi.org/10.1016/j.knosys.2015.06.015
  7. L. v. Ahn, "Games with a purpose", IEEE Computer Magazine, 39 (6), pp. 92-94, 2006.
  8. P. D. Turney, "Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews", Proceedings of the 40th annual meeting on association for computational linguistics, pp. 417-424, 2002.
  9. B. Pang, L. Lee, and S. Vaithyanathan. "Thumbs up?: sentiment classification using machine learning techniques", Proceedings of the ACL-02 conference on Empirical methods in natural language processing, Vol. 10, pp. 79-86, 2002.
  10. S. Baccianella, A. Esuli, and F. Sebastiani, "SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining", LREC, Vol. 10, pp. 2200-2204, 2010.
  11. An, J., and H.-W. Kim, "Building a Korean Sentiment Lexicon Using Collective Intelligence", Journal of Intelligence and Information Systems, pp. 49-67, 2015.
  12. C.-C. M. Thisone, A. Ghasemi, and B. Faltings, "Sentiment analysis using a novel human computation game", Proceedings of the 3rd Workshop on the People's Web Meets NLP: Collaboratively Constructed Semantic Resources and their Applications to NLP, pp. 1-9, 2012.
  13. M. Furini and M. Montangero, "TSentiment: on gamifying Twitter sentiment analysis", Computers and Communication (ISCC), IEEE Symposium on, pp. 91-96, 2016.
  14. Yoonsung Hong, Haewoon Kwak, Youngmin Baek, Sue Moon, "Tower of Babel: A crowdsourcing game building sentiment lexicons for resource-scarce languages", Proceedings of the 22nd International Conference on WWW, pp. 549-556, 2013.
  15. D. C. Brabham, "Crowdsourcing as a Model for Problem Solving: An Introduction and Cases", Convergence, Vol. 14, No. 1, pp. 75-90, 2008. https://doi.org/10.1177/1354856507084420
  16. L. v. Ahn and L. Dabbish, "Labeling image s with a Computer Game", Proceedings of the SIGCHI conference on Human factors in computing system, pp. 319-326, 2004.
  17. L. v. Ahn and M. Kedia and M. Blum, "Verbosity: A Game for Collecting Common-Sense Facts", Proceedings of the SIGCHI conference on Human Factors in computing systems, pp. 75-78, 2006.
  18. C. J. Ho, T. H. Chang, J. C. Lee, J. Y. Hsu, and K. T. Chen, "KissKissBan: A Competitive Human Computation Game for Image Annotation", Proceedings of the acm sigkdd workshop on human computation, pp. 11-14, 2009.
  19. L. v. Ahn, R. Liu, and M. Blum, "Peekaboom: A Game for Locating Objects In Images", Proceedings of the SIGCHI conference on Human Factors in computing systems, pp. 55-64, 2006.
  20. J. S. Kim, M. J. Greene, A. Zlateski, K. Lee, M. Richardson, S. C. Turaga, M. Purcaro, M. Balkam, A. Robinson, B. F. Behabadi, M. Campos, W. Denk, H. S. Seung, and the EyeWirers, "Space-time wiring specificity supports direction selectivity in the retina", Nature, Vol. 509, No. 7500, pp. 331-336, 2014. https://doi.org/10.1038/nature13240
  21. F. Khatib, S. Cooper, M. D. Tyka, K. Xu, I. Makedon, Z. Popovic, D. Bakera, and Foldit Players, "Algorithm discovery by protein folding game players", Proceedings of the National Academy of Sciences, Vol. 108, No. 47, pp. 18949-18953, 2011. https://doi.org/10.1073/pnas.1115898108
  22. J. Lee, W. Kladwang, M. Lee, D. Cantu, M. Azizyan, H. Kim, A. Limpaecher, S. Yoon, A. Treuillea, R. Das, and EteRNA Participants, "RNA design rules from a massive open laboratory", Proceedings of the National Academy of Sciences, Vol. 111, No. 6, pp. 2122-2127, 2014. https://doi.org/10.1073/pnas.1313039111
  23. http://www.kocca.kr/cop/bbs/view/B0000147/1831102.do
  24. http://isis.kisa.or.kr/board/?pageId=060100&bbsId=7&itemId=815