DOI QR코드

DOI QR Code

A Sentiment Classification Approach of Sentences Clustering in Webcast Barrages

  • Li, Jun (School of Information and Communication, Guilin University of Electronic Technology) ;
  • Huang, Guimin (Guangxi Key Lab of Trusted Software, School of Computer Science and Information Security, Guilin University of Electronic Technology) ;
  • Zhou, Ya (Guangxi Key Lab of Trusted Software, School of Computer Science and Information Security, Guilin University of Electronic Technology)
  • 투고 : 2019.03.08
  • 심사 : 2019.09.26
  • 발행 : 2020.06.30

초록

Conducting sentiment analysis and opinion mining are challenging tasks in natural language processing. Many of the sentiment analysis and opinion mining applications focus on product reviews, social media reviews, forums and microblogs whose reviews are topic-similar and opinion-rich. In this paper, we try to analyze the sentiments of sentences from online webcast reviews that scroll across the screen, which we call live barrages. Contrary to social media comments or product reviews, the topics in live barrages are more fragmented, and there are plenty of invalid comments that we must remove in the preprocessing phase. To extract evaluative sentiment sentences, we proposed a novel approach that clusters the barrages from the same commenter to solve the problem of scattering the information for each barrage. The method developed in this paper contains two subtasks: in the data preprocessing phase, we cluster the sentences from the same commenter and remove unavailable sentences; and we use a semi-supervised machine learning approach, the naïve Bayes algorithm, to analyze the sentiment of the barrage. According to our experimental results, this method shows that it performs well in analyzing the sentiment of online webcast barrages.

키워드

참고문헌

  1. B. Liu, Sentiment Analysis: Mining Sentiments, Opinions, And Emotions. Cambridge, UK: Cambridge University, 2015.
  2. K. Ravi and V. Ravi, "A survey on opinion mining and sentiment analysis: tasks, approaches and applications," Knowledge-Based Systems, vol. 89, pp. 14-46, 2015. https://doi.org/10.1016/j.knosys.2015.06.015
  3. E. Cambria, "Affective computing and sentiment analysis," IEEE Intelligent Systems, vol. 31, no. 2, pp. 102-107, 2016. https://doi.org/10.1109/MIS.2016.31
  4. E. Cambria and B. White, "Jumping NLP curves: a review of natural language processing research," IEEE Computational Intelligence Magazine, vol. 9, no. 2, pp. 48-57, 2014. https://doi.org/10.1109/MCI.2014.2307227
  5. B. Pang, L. Lee, and S. Vaithyanathan, "Thumbs up?: sentiment classification using machine learning techniques," in Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA, 2002, pp. 79-86.
  6. Z. Zhai, B. Liu, L. Zhang, H. Xu, and P. Jia, "Identifying evaluative sentences in online discussions," in Proceedings of the 25th AAAI Conference on Artificial Intelligence, San Francisco, CA, 2011, pp. 933-938.
  7. B. Pang and L. Lee, "A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts," in Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, Spain, 2004, pp. 271-278.
  8. B. Pang and L. Lee, "Opinion mining and sentiment analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1-2, pp. 1-135, 2008. https://doi.org/10.1561/1500000011
  9. M. Hu and B. Liu, "Mining and summarizing customer reviews," in Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, WA, 2004, pp. 168-177.
  10. L. Jiang, M. Yu, M. Zhou, X. Liu, and T. Zhao, "Target-dependent twitter sentiment classification," in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, OR, 2011, pp. 151-160.
  11. L. Barbosa and J. Feng, "Robust sentiment detection on twitter from biased and noisy data," in Proceedings of the 23rd International Conference on Computational Linguistics: Posters, Cordoba, Spain, 2020, pp. 36- 44.
  12. X. Hu, L. Tang, J. Tang, and H. Liu, "Exploiting social relations for sentiment analysis in microblogging," in Proceedings of the 6th ACM International Conference on Web Search and Data Mining, Rome, Italy, 2013, pp. 537-546.
  13. L. Gong, M. Al Boni, and H. Wang, "Modeling social norms evolution for personalized sentiment classification," in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Berlin, Germany, 2016, pp. 855-865.
  14. S. Dong, X. Zhang, and Y. Li, "Microblog sentiment analysis method based on spectral clustering," Journal of Information Processing Systems, vol. 14, no. 3, pp. 727-739, 2018. https://doi.org/10.3745/JIPS.04.0076
  15. A. Hassan, V. Qazvinian, and D. Radev, "What's with the attitude? Identifying sentences with attitude in online discussions," in Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, 2010, pp. 1245-1255.
  16. M. Salehan and D. J. Kim, "Predicting the performance of online consumer reviews: a sentiment mining approach to big data analytics," Decision Support Systems, vol. 81, pp. 30-40, 2016. https://doi.org/10.1016/j.dss.2015.10.006
  17. R. Socher, J. Pennington, E. H. Huang, A. Y. Ng, and C. D. Manning, "Semi-supervised recursive autoencoders for predicting sentiment distributions," in Proceedings of the Conference on Empirical Methods in Natural Language Processing, Edinburgh, UK, 2011, pp. 151-161.
  18. A. Ramesh, S. H. Kumar, J. Foulds, and L. Getoor, "Weakly supervised models of aspect-sentiment for online course discussion forums," in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Beijing, China, 2015, pp. 74-83.
  19. M. Fernandez-Gavilanes, T. Alvarez-Lopez, J. Juncal-Martinez, E. Costa-Montenegro, and F. J. Gonzalez-Castano, "Unsupervised method for sentiment analysis in online texts," Expert Systems with Applications, vol. 58, pp. 57-75, 2016. https://doi.org/10.1016/j.eswa.2016.03.031
  20. J. Li, S. Fong, Y. Zhuang, and R. Khoury, "Hierarchical classification in text mining for sentiment analysis of online news," Soft Computing, vol. 20, no. 9, pp. 3411-3420, 2016. https://doi.org/10.1007/s00500-015-1812-4
  21. D. D. Wu, L. Zheng, and D. L. Olson, "A decision support approach for online stock forum sentiment analysis," IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 44, no. 8, pp. 1077-1087, 2014. https://doi.org/10.1109/tsmc.2013.2295353
  22. Y. Liu, J. W. Bi, and Z. P. Fan, "Ranking products through online reviews: a method based on sentiment analysis technique and intuitionistic fuzzy set theory," Information Fusion, vol. 36, pp. 149-161, 2017. https://doi.org/10.1016/j.inffus.2016.11.012
  23. D. T. Vo and Y. Zhang, "Don't count, predict! an automatic approach to learning sentiment lexicons for short text," in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Berlin, Germany, 2016, pp. 219-224.
  24. H. Saif, Y. He, and H. Alani, "Semantic sentiment analysis of twitter," in The Semantic Web - ISWC 2012. Heidelberg: Germany, 2012, pp. 508-524.
  25. K. Schouten and F. Frasincar, "Survey on aspect-level sentiment analysis," IEEE Transactions on Knowledge and Data Engineering, vol. 28, no. 3, pp. 813-830, 2015. https://doi.org/10.1109/TKDE.2015.2485209
  26. A. Hussain and E. Cambria, "Semi-supervised learning for big social data analysis," Neurocomputing, vol. 275, pp. 1662-1673, 2018. https://doi.org/10.1016/j.neucom.2017.10.010
  27. Y. Wu and M. Ester, "Flame: a probabilistic model combining aspect based opinion mining and collaborative filtering," in Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, Shanghai, China, 2015, pp. 199-208.
  28. Y. Wang, M. Huang, L. Zhao, and X. Zhu, "Attention-based LSTM for aspect-level sentiment classification," in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, TX, 2016, pp. 606-615.
  29. R. K. Amplayo and S. W. Hwang, "Aspect sentiment model for micro reviews," in Proceedings of 2017 IEEE International Conference on Data Mining (ICDM), New Orleans, LA, 2017, pp. 727-732.
  30. J. Feng, S. Cai, and X. Ma, "Enhanced sentiment labeling and implicit aspect identification by integration of deep convolution neural network and sequential algorithm," Cluster Computing, vol. 22, no. 3, pp. 5839-5857, 2019. https://doi.org/10.1007/s10586-017-1626-5
  31. L. Peng and Y. Liu, "Feature selection and overlapping clustering-based multilabel classification model," Mathematical Problems in Engineering, vol. 2018, article no. 2814897, 2018.
  32. N. W. Xue, "Chinese word segmentation as character tagging," Computational Linguistics and Chinese Language Processing, vol. 8, no. 1, pp. 29-48, 2003.
  33. C. J. Hutto and E. Gilbert, "Vader: a parsimonious rule-based model for sentiment analysis of social media text," in Proceedings of the 8th International AAAI Conference on Weblogs and Social Media, Ann Arbor, MI, 2014, pp. 216-225.
  34. F. Pagani, M. De Astis, M. Graziano, A. Lanzi, and D. Balzarotti, "Measuring the role of greylisting and nolisting in fighting spam," in Proceedings of 2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Toulouse, France, 2016, pp. 562-571.
  35. W. Che, Y. Zhao, H. Guo, Z. Su, and T. Liu, "Sentence compression for aspect-based sentiment analysis," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 12, pp. 2111-2124, 2015. https://doi.org/10.1109/TASLP.2015.2443982