DOI QR코드

DOI QR Code

Use of Word Clustering to Improve Emotion Recognition from Short Text

  • Yuan, Shuai (National Engineering Research Center for E-Learning, Central China Normal University) ;
  • Huang, Huan (School of Education, South Central University for Nationalities) ;
  • Wu, Linjing (School of Educational Information Technology, Central China Normal University)
  • Received : 2016.07.13
  • Accepted : 2016.11.24
  • Published : 2016.12.30

Abstract

Emotion recognition is an important component of affective computing, and is significant in the implementation of natural and friendly human-computer interaction. An effective approach to recognizing emotion from text is based on a machine learning technique, which deals with emotion recognition as a classification problem. However, in emotion recognition, the texts involved are usually very short, leaving a very large, sparse feature space, which decreases the performance of emotion classification. This paper proposes to resolve the problem of feature sparseness, and largely improve the emotion recognition performance from short texts by doing the following: representing short texts with word cluster features, offering a novel word clustering algorithm, and using a new feature weighting scheme. Emotion classification experiments were performed with different features and weighting schemes on a publicly available dataset. The experimental results suggest that the word cluster features and the proposed weighting scheme can partly resolve problems with feature sparseness and emotion recognition performance.

Keywords

References

  1. R. W. Picard, Affective Computing, 1th ed., Cambridge: MIT Press, 1997.
  2. R. A. Calvo and S. D'Mello, "Affect detection: an interdisciplinary review of models, methods, and their applications," IEEE Transactions on Affective Computing, vol. 1, no. 1, pp. 18-37, 2010. https://doi.org/10.1109/T-AFFC.2010.1
  3. C. O. Alm, D. Roth, and R. Sproat, "Emotions from text: machine learning for text-based emotion prediction," in Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, Vancouver, Canada, 2005, pp. 579-586.
  4. S. Aman and S. Szpakowicz, "Identifying expressions of emotion in text," in Proceedings of the 10th International Conference on Text, Speech and Dialogue, Plzen, Czech Republic, 2007, pp. 196-205.
  5. S. Aman and S. Szpakowicz, "Using Roget's thesaurus for fine-grained emotion recognition," in Proceedings of the 3rd International Joint Conference on Natural Language Processing, Hyderabad, India, 2008, pp. 312-318.
  6. T. Xu and Q. Peng, "Extended information inference model for unsupervised categorization of web short texts," Journal of Information Science, vol. 38, no. 6, pp. 512-531, 2012. https://doi.org/10.1177/0165551512448985
  7. L. Gaudette and N. Japkowicz, "Compact features for sentiment analysis," in Advances in Artificial Intelligence, Lecture Notes in Computer Science vol. 6657, Heidelberg: Springer, pp. 146-157, 2011.
  8. B. Pang and L. Lee, "Opinion mining and sentiment analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1-2, pp. 1-135, 2008. https://doi.org/10.1561/1500000011
  9. B. Liu, Sentiment Analysis and Opinion Mining, San Rafael, CA: Morgan & Claypool Publishers, 2012.
  10. R. Feldman, "Techniques and applications for sentiment analysis," Communications of the ACM, vol. 56, no. 4, pp. 82-89, 2013. https://doi.org/10.1145/2436256.2436274
  11. P. Subasic and A. Huettner, "Affect analysis of text using fuzzy semantic typing," IEEE Transactions on Fuzzy Systems, vol. 9, no. 4, pp. 483-496, 2001. https://doi.org/10.1109/91.940962
  12. A. Neviarouskaya, H. Prendinger, and M. Ishizuka, "Affect analysis model: novel rule-based approach to affect sensing from text," Natural Language Engineering, vol. 17, no. 1, pp. 95-135, 2011. https://doi.org/10.1017/S1351324910000239
  13. T. Joachims, "Text categorization with support vector machines: learning with many relevant features," in Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany, 1998, pp. 137-142.
  14. C. Elliott, "The affective reasoner: a process model of emotions in multi-agent system," Ph.D. dissertation, Northwestern University, Chicago, IL, 1992.
  15. A. C. Boucouvalas and X. Zhe, "Text-to-emotion engine for real time internet communication," in Proceedings of the 3rd International Symposium on Communication Systems, Networks and Digital Signal Processing, Stafford, VA, 2002, pp. 164-168.
  16. H. Liu, H. Lieberman, and T. Selker, "A model of textual affect sensing using real-world knowledge," in Proceedings of the 8th International Conference on Intelligent User Interfaces, Miami, FL, 2003, pp. 125-132.
  17. C. O. Alm, "Affect in text and speech," Ph.D. dissertation, University of Illinois at Urbana-Champaign, IL, 2008.
  18. P. Katz, M. Singleton, and R. Wicentowski, "SWAT-MP: the SemEval-2007 systems for task 5 and task 14," in Proceedings of the 4th International Workshop on Semantic Evaluations, Prague, Czech Republic, 2007, pp. 308-313.
  19. C. Strapparava and R. Mihalcea, "Learning to identify emotions in text," in Proceedings of the 23rd Annual ACM Symposium on Applied Computing, Fortaleza, Brazil, 2008, pp. 1556-1560.
  20. J. J. Jiang and D. W. Conrath, "Semantic similarity based on corpus statistics and lexical taxonomy," in Proceedings of the International Conference on Research in Computational Linguistics, Taiwan, 1997, pp. 19-33.
  21. G. A. Miller, C. Leacock, R. Tengi, and R. T. Bunker, "A semantic concordance," in Proceedings of the ARPA Workshop on Human Language Technology, Stroudsburg, PA, 1993, pp. 303-308.
  22. C. Strapparava and R. Mihalcea, "SemEval-2007 task 14: affective text," in Proceedings of the 4th International Workshop on Semantic Evaluations, Prague, Czech Republic, 2007, pp. 70-74.
  23. S. Dumais, J. Platt, D. Heckerman, and M. Sahami, "Inductive learning algorithms and representations for text categorization," in Proceedings of the 7th International Conference on Information and Knowledge Management, Bethesda, MD, 1998, pp. 148-155.
  24. R. E. Fan, K. W. Chang, C. J. Hsieh, X. R. Wang, and C. J. Lin, "LIBLINEAR: a library for large linear classification," Journal of Machine Learning Research, vol. 9, pp. 1871-1874, 2008.
  25. P. Ekman, "Facial expression and emotion," American Psychologist, vol. 48, no. 4, pp. 384-392, 1993. https://doi.org/10.1037/0003-066X.48.4.384
  26. S. Wang, D. Li, L. Zhao, and J. Zhang, "Sampling cutting method for imbalanced text sentiment classification based on BRC," Knowledge-Based Systems, vol. 37, no. 2, pp. 451-461, 2013. https://doi.org/10.1016/j.knosys.2012.09.003

Cited by

  1. Data Mining Techniques and Applications vol.9, pp.1, 2018, https://doi.org/10.4018/IJSITA.2018010104