DOI QR코드

DOI QR Code

Burmese Sentiment Analysis Based on Transfer Learning

  • Mao, Cunli (College of Information Engineering and Automation & Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology) ;
  • Man, Zhibo (College of Information Engineering and Automation & Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology) ;
  • Yu, Zhengtao (College of Information Engineering and Automation & Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology) ;
  • Wu, Xia (College of Information Engineering and Automation & Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology) ;
  • Liang, Haoyuan (College of Information Engineering and Automation & Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology)
  • Received : 2020.11.19
  • Accepted : 2022.03.25
  • Published : 2022.08.31

Abstract

Using a rich resource language to classify sentiments in a language with few resources is a popular subject of research in natural language processing. Burmese is a low-resource language. In light of the scarcity of labeled training data for sentiment classification in Burmese, in this study, we propose a method of transfer learning for sentiment analysis of a language that uses the feature transfer technique on sentiments in English. This method generates a cross-language word-embedding representation of Burmese vocabulary to map Burmese text to the semantic space of English text. A model to classify sentiments in English is then pre-trained using a convolutional neural network and an attention mechanism, where the network shares the model for sentiment analysis of English. The parameters of the network layer are used to learn the cross-language features of the sentiments, which are then transferred to the model to classify sentiments in Burmese. Finally, the model was tuned using the labeled Burmese data. The results of the experiments show that the proposed method can significantly improve the classification of sentiments in Burmese compared to a model trained using only a Burmese corpus.

Keywords

Acknowledgement

This work was supported by the Key Program of the National Natural Science Foundation of China (No. U21B2027 and 61732005); the National Natural Science Foundation of China (No. 61866019, 62166023, and 61972186), Yunnan Provincial Major Science and Technology Special Plan Projects (No. 202103AA080015 and 202002AD080001), the Program for Applied & Basic Research of Yunnan Province in Key Areas (No. 2019FA023), and the Candidates of the Young and Middle Aged Academic and Technical Leaders of Yunnan Province (No. 2019HB006).

References

  1. J. W. Watkins, "Burmese," Journal of the International Phonetic Association, vol. 31, no. 2, pp. 291-295, 2001. https://doi.org/10.1017/s0025100301002122
  2. R. Mihalcea, C. Corley, and C. Strapparava, "Corpus-based and knowledge-based measures of text semantic similarity," in Proceedings of the 21st National Conference of Artificial Intelligence (AAAI), Boston, MA, 2006, pp. 775-780.
  3. B. Ohana and B. Tierney, "Sentiment classification of reviews using SentiWordNet," Proceedings of IT&T, vol. 11, no. 6, pp. 75-78, 2009.
  4. V. Hatzivassiloglou and K. McKeown, "Predicting the semantic orientation of adjectives. In 35th annual meeting of the association for computational linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, Madrid, Spain, 1997, pp. 174-181.
  5. R. Xia, C. Wang, X. Dai, and T. Li, "Co-training for semi-supervised sentiment classification based on dual-view bags-of-words representation," in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Beijing, China, 2015, pp. 1054-1063.
  6. Q. Chen, W. Li, Y. Lei, X. Liu, and Y. He, "Learning to adapt credible knowledge in cross-lingual sentiment analysis," in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Beijing, China, 2015, pp. 419-429.
  7. P. Ruangkanokmas, T. Achalakul, and K. Akkarajitsakul, "Deep belief networks with feature selection for sentiment classification," in Proceedings of 2016 7th International Conference on Intelligent Systems, Modelling and Simulation (ISMS), Bangkok, Thailand, 2016, pp. 9-14.
  8. X. Wan, "Co-training for cross-lingual sentiment classification," in Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Singapore, 2009, pp. 235-243.
  9. Z. Lin, X. Jin, X. Xu, W. Wang, X. Cheng, and Y. Wang, "A cross-lingual joint aspect/sentiment model for sentiment analysis," in Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, Shanghai, China, 2014, pp. 1089-1098.
  10. X. Chen, Y. Sun, B. Athiwaratkun, C. Cardie, and K. Weinberger, "Adversarial deep averaging networks for cross-lingual sentiment classification," Transactions of the Association for Computational Linguistics, vol. 6, pp. 557-570, 2018. https://doi.org/10.1162/tacl_a_00039
  11. X. Zhou, X. Wan, and J. Xiao, "Cross-lingual sentiment classification with bilingual document representation learning," in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Berlin, Germany, 2016, pp. 1403-1412.
  12. H. Yan, J. Dai, X. Qiu, and Z. Zhang, "A unified generative framework for aspect-based sentiment analysis," in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL/IJCNLP) (Volume 1: Long Papers), Virtual Event, 2021, pp. 2416-2429.
  13. L. Xu, Y. K. Chia, and L. Bing, "Learning span-level interactions for aspect sentiment triplet extraction," in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL/IJCNLP) (Volume 1: Long Papers), Virtual Event, 2021, pp. 4755-5766.
  14. R. Li, H. Chen, F. Feng, Z. Ma, X. Wang, and E. Hovy, "Dual graph convolutional networks for aspect-based sentiment analysis," in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Virtual Event, 2021, pp. 6319-6329.
  15. E. Kuriyozov, Y. Doval, and C. Gomez-Rodriguez, "Cross-lingual word embeddings for Turkic languages," in Proceedings of the 12th Language Resources and Evaluation Conference (LREC), Marseille, France, 2020, pp. 4054-4062.
  16. D. Wang, B. Jing, C. Lu, J. Wu, G. Liu, C. Du, and F. Zhuang, "Coarse alignment of topic and sentiment: a unified model for cross-lingual sentiment classification," IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 2, pp. 736-747, 2020.
  17. T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, "Distributed representations of words and phrases and their compositionality," Advances in Neural Information Processing Systems, vol. 26, pp. 3111-3119, 2013.
  18. S. Li, L. Huang, J. Wang, and G. Zhou, "Semi-stacking for semi-supervised sentiment classification," in Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Beijing, China, 2015, pp. 27-31.
  19. D. Zeng, K. Liu, S. Lai, G. Zhou, and J. Zhao, "Relation classification via convolutional deep neural network," in Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers (COLING), Dublin, Ireland, 2014, pp. 2335-2344.
  20. C. Zhang, H. Zhang, J. Qiao, D. Yuan, and M. Zhang, "Deep transfer learning for intelligent cellular traffic prediction based on cross-domain big data," IEEE Journal on Selected Areas in Communications, vol. 37, no. 6, pp. 1389-1401, 2019. https://doi.org/10.1109/jsac.2019.2904363
  21. A. Joulin, E. Grave, P. Bojanowski, and T. Mikolov, "Bag of tricks for efficient text classification," in Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, 2017, pp. 427-431.
  22. B. Rink and S. Harabagiu, "UTD: classifying semantic relations by combining lexical and semantic resources," in Proceedings of the 5th International Workshop on Semantic Evaluation, Uppsala, Sweden, 2010, pp. 256-259.