DOI QR코드

DOI QR Code

Development of a Korean chatbot system that enables emotional communication with users in real time

사용자와 실시간으로 감성적 소통이 가능한 한국어 챗봇 시스템 개발

  • Baek, Sungdae (School of Electronic and Electrical Engineering, Kyungpook National University) ;
  • Lee, Minho (Department of Artificial Intelligence, Kyungpook National University)
  • 백성대 (경북대학교 전자전기공학부) ;
  • 이민호 (경북대학교 인공지능학과)
  • Received : 2021.11.15
  • Accepted : 2021.11.22
  • Published : 2021.11.30

Abstract

In this study, the creation of emotional dialogue was investigated within the process of developing a robot's natural language understanding and emotional dialogue processing. Unlike an English-based dataset, which is the mainstay of natural language processing, the Korean-based dataset has several shortcomings. Therefore, in a situation where the Korean language base is insufficient, the Korean dataset should be dealt with in detail, and in particular, the unique characteristics of the language should be considered. Hence, the first step is to base this study on a specific Korean dataset consisting of conversations on emotional topics. Subsequently, a model was built that learns to extract the continuous dialogue features from a pre-trained language model to generate sentences while maintaining the context of the dialogue. To validate the model, a chatbot system was implemented and meaningful results were obtained by collecting the external subjects and conducting experiments. As a result, the proposed model was influenced by the dataset in which the conversation topic was consultation, to facilitate free and emotional communication with users as if they were consulting with a chatbot. The results were analyzed to identify and explain the advantages and disadvantages of the current model. Finally, as a necessary element to reach the aforementioned ultimate research goal, a discussion is presented on the areas for future studies.

Keywords

Acknowledgement

이 논문은 2018학년도 경북대학교 국립대학육성사업 지원비에 의하여 연구되었음

References

  1. L. C. De Silva, T. Miyasato, and R. Nakatsu, "Facial emotion recognition using multi-modal information", Proceedings of ICICS, 1997 Int. Conf. on Inf., Commun. Signal Process. Theme: Trends Inf. Syst. Eng. Wirel. Multimed. Commun. (Cat., IEEE), Vol. 1, pp. 397-401, 1997.
  2. S. Kim, P. G. Georgiou, S. Lee, and S. Narayanan, "Real-time emotion detection system using speech: Multi-modal fusion of different timescale features". 2007 IEEE 9th Workshop Multimed. Signal Process.,pp. 48-51, 2007.
  3. A. Dufaux, L. Besacier, M. Ansorge, and F. Pellandini, "Automatic sound detection and recognition for noisy environment", 2000 10th Eur. Signal Process. Conf., IEEE, pp. 1-4, 2000.
  4. D. S. Moirangthem and M. Lee, "Chat Discrimination for Intelligent Conversational Agents with a Hybrid CNN-LMTGRU Network", Proceedings of The 3rd Workshop Represent. Learn. NLP, pp. 30-40, Melbourne, Australia, 2018.
  5. D. Kang, D. S. Moirangthem, and M. Lee, "An intelligent chat bot agent system that can provide various services according to user's intention", Int. Conf. Electronics, Inf. Commun. (ICEIC) 2019, pp. 1010-1011, Auckland, New Zealand, 2019.
  6. T. Park, J. Bae, C. Woo, J. Seo, J. Park, D. Kang, and M. Lee, "Investigation of Embedding Methods for Korean Gag Chat-bot", The 3rd Int. Conf. Consum. Electronics (ICCE) Asia, pp. 42-44, Jeju, Korea, 2018.
  7. D. S. Moirangthem and M. Lee, "Improving Text Classification with Deep Recurrent Neural Networks", 18th China-Japan-Korea Joint Workshop Neurobiol. Neuroinformatics, pp. 7(1)-7(1), Jeju, Korea, 2018.
  8. G. C. Sergio and M. Lee, "Robust Sentiment Classification in Noisy User-generated Text with BERT-based model", Thirty-seventh Int. Conf. Mach. Learn.(ICML 2020), pp. 24(1)-24(1), Virtual Event, 2020.
  9. G. C. Sergio, and M. Lee, "Stacked DeBERT: All attention in incomplete data for text classification", Neural Netw., Vol. 136, pp. 87-96, 2021. https://doi.org/10.1016/j.neunet.2020.12.018
  10. G. C. Sergio and M. Lee, "Emotional video to audio transformation using deep recurrent neural networks and a neuro-fuzzy system", Math. Probl. Eng. 2020, pp. 8478527(1)-8478527(15), 2020.
  11. D. Kang and M. Lee, "Seq-DNC-seq: Context Aware Dialog Generation System Through External Memory", Int. Joint Conf. Neural Netw (IJCNN), pp. 1-8, Budapest, Hungary, 2019.
  12. M. A. Kiasari, D. S. Moirangthem, and M. Lee. "Joint moment-matching autoencoders", Neural Netw., Vol. 106, pp. 185-193, 2018. https://doi.org/10.1016/j.neunet.2018.07.001
  13. D. S. Moirangthem and M. Lee, "Abstractive summarization of long texts by representing multiple compositionalities with temporal hierarchical pointer generator network", Neural Netw., Vol. 124, pp. 1-11, 2020. https://doi.org/10.1016/j.neunet.2019.12.022
  14. D. S. Moirangthem and M. Lee, "Hierarchical and lateral multiple timescales gated recurrent units with pre-trained encoder for long text classification", Expert Syst. Appl., Vol. 165, pp. 113898(1)-113898(8), 2021.
  15. https://nodejs.org/ko/, (retrieved on Nov. 14, 2021).
  16. https://www.aihub.or.kr/aidata/7978 (retrieved on Nov. 14, 2021).
  17. R. Zandieand M. H. Mahoor. "Emptransfo: A multi-head transformer architecture for creating empathetic dialog systems", The Thirty-Third Int.Flairs Conf. 2020, pp. 276-281, Florida, USA, 2020.
  18. T. Wolf, V. Sanh, J. Chaumond, and C. Delangue, "Transfertransfo: A transfer learning approach for neural network based conversational agents", arXiv preprint, pp.1901.08149(1)-1901.08149(6), 2019.
  19. R. C. Streijl, S. Winkler, and D. S. Hands, "Mean opinion score (MOS) revisited: methods and applications, limitations and alternatives", Multimedia Systems, Vol. 22, No. 2, pp. 213-227, 2016. https://doi.org/10.1007/s00530-014-0446-1
  20. K. Papineni, S. Roukos, T. Ward, and W. J. Zhu, "Bleu: a method for automatic evaluation of machine translation", Proc. of 40th Annu. Meet. Association Comput. Linguist., pp. 311-318, 2002.
  21. D. Adiwardana, M. T. Luong, D. R. So, J. Hall, N. Fiedel, R. Thoppilan, Z. Yang, A. Kulshreshtha, G. Nemade, Y. Lu, and Q. V. Le, "Towards a human-like open-domain chatbot", arXiv preprint, pp. 2001.09977(1)-2001.09977(38), 2020.
  22. K. Clark, M. T. Luong, Q. V. Le, and C. D. Manning, "Electra: Pre-training text encoders as discriminators rather than generators", arXiv preprint, pp. 2003.10555(1)-2003.10555(18), 2020.
  23. E. Wallace, S. Feng, N. Kandpal, M. Gardner, and S. Singh, "Universal adversarial triggers for attacking and analyzing NLP", arXiv preprint, pp. 1908.07125(1)-1908.07125(15), 2019.