DOI QR코드

DOI QR Code

Methodology for Deriving Required Quality of Product Using Analysis of Customer Reviews

사용자 리뷰 분석을 통한 제품 요구품질 도출 방법론

  • Yerin Yu (Graduate School of Business IT, Kookmin University) ;
  • Jeongeun Byun (R&BD Analysis Research Team, Technology Commercialization Research Center, Korea Institute of Science and Technology Information ) ;
  • Kuk Jin Bae (R&BD Analysis Research Team, Technology Commercialization Research Center, Korea Institute of Science and Technology Information) ;
  • Sumin Seo (R&BD Analysis Research Team, Technology Commercialization Research Center, Korea Institute of Science and Technology Information) ;
  • Younha Kim (Graduate School of Business IT, Kookmin University) ;
  • Namgyu Kim (Graduate School of Business IT, Kookmin University)
  • Received : 2023.01.25
  • Accepted : 2023.02.20
  • Published : 2023.04.30

Abstract

Recently, as technology development has accelerated and product life cycles have been shortened, it is necessary to derive key product features from customers in the R&D planning and evaluation stage. More companies want differentiated competitiveness by providing consumer-tailored products based on big data and artificial intelligence technology. To achieve this, the need to correctly grasp the required quality, which is a requirement of consumers, is increasing. However, the existing methods are centered on suppliers or domain experts, so there is a gap from the actual perspective of consumers. In other words, product attributes were defined by suppliers or field experts, but this may not consider consumers' actual perspective. Accordingly, the demand for deriving the product's main attributes through reviews containing consumers' perspectives has recently increased. Therefore, we propose a review data analysis-based required quality methodology containing customer requirements. Specifically, a pre-training language model with a good understanding of Korean reviews was established, consumer intent was correctly identified, and key contents were extracted from the review through a combination of KeyBERT and topic modeling to derive the required quality for each product. RevBERT, a Korean review domain-specific pre-training language model, was established through further pre-training. By comparing the existing pre-training language model KcBERT, we confirmed that RevBERT had a deeper understanding of customer reviews. In addition, all processes other than that of selecting the required quality were linked to the automation process, resulting in the automation of deriving the required quality based on data.

Keywords

Acknowledgement

This work was supported by the Korea Institute of Science and Technology Information (K-23-L03-C03-S01).

References

  1. Beltagy, I., Lo, K., and Cohan, A., "SciBERT: A Pretrained Language Model for Scientific Text", arXiv preprint arXiv:1903.10676, 2019. 
  2. Blei, D. and Lafferty, J., "Correlated topic models", Advances in Neural Information Processing Systems, Vol. 18, 2006, pp. 1-47. 
  3. Blei, D. and Lafferty, J., "Dynamic topic models", Proceedings of the 23rd International Conference on Machine learning, June, 2006, pp. 113-120. 
  4. Blei, D., Ng, A. Y., and Jordan, M., "Latent dirichlet allocation", Journal of Machine Learning Research, January, 2003, pp. 993-1022. 
  5. Byun, S., Lee, D., and Kim, N., "Methodology for identifying issues of user reviews from the perspective of evaluation criteria: Focus on a hotel information site", Journal of Intelligence and Information Systems, Vol. 22, No. 3, 2016, pp. 23-43.  https://doi.org/10.13088/JIIS.2016.22.3.023
  6. Chalkidis, I., Fergadiotis, M., Malakasiotis, P., Aletras, N., and Androutsopoulos, I., "LEGAL-BERT: The Muppets Straight Out of Law School", arXiv preprint arXiv: 2010.02559, 2020. 
  7. Chang, A., Hua, B., and Yu, D., "Keyword Extraction and Technology Entity Extraction for Disruptive Technology Policy Texts", EEKE@ JCDL, October, 2021, pp. 36-40. 
  8. Chung, P., Ahn, H., and Kwahk, K., "Identification of core features and values of smartphone design using text mining and social network analysis", Korean Journal of Business Administration, Vol. 32, No. 1, January, 2019, pp. 27-47.  https://doi.org/10.18032/kaaba.2019.32.1.27
  9. Devlin, J., Chang, M., Lee, K., and Toutanova, K., "Bert: Pre-training of deep bidirectional transformers for language understanding", arXiv preprint arXiv:1810.04805, 2018. 
  10. Grootendorst, M., "KeyBERT", GitHub, https://github.com/MaartenGr/KeyBERT, 2020. 
  11. Hong, T., Hong, J., Kim, E., and Kim, M., "An Intelligent Recommendation System by Integrating the Attributes of Product and Customer in the Movie Reviews", Journal of Intelligence and Information Systems (JIIS), Vol. 28, No. 2, June, 2022, pp. 1-18. 
  12. Kim, K., Kim, Y., and Kim, J., "A study on customer satisfaction of mobile shopping apps using topic analysis of user reviews", Journal of Society for e-Business Studies, Vol. 23, No. 4, November, 2018, pp. 41-62.  https://doi.org/10.7838/JSEBS.2018.23.4.041
  13. Kim, M., Kim, N., and Jung, I., "A Methodology for Extracting Shopping-Related Keywords by Analyzing Internet Navigation Patterns", Journal of Intelligence and Information Systems, Vol. 20, No. 2, 2014, pp. 123-136.  https://doi.org/10.13088/JIIS.2014.20.2.123
  14. Korea Institute of Patent Information, "KorPatBERT", GitHub, https://github.com/kipi-ai/korpatbert, 2021. 
  15. Lee, J. and Moon, S., "A Study on Keyword Analysis Related to Travel Stress in the Post-Corona Era Using Big Data", The Society of Convergence Knowledge Transactions, Vol. 9, No. 4, December, 2021, 87-94.  https://doi.org/10.22716/SCKT.2021.9.4.046
  16. Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C., and Kang, J., "BioBERT: a pre-trained biomedical language representation model for biomedical text mining", Bioinformatics, Vol. 36. No. 4, 2020, pp. 1234-1240.  https://doi.org/10.1093/bioinformatics/btz682
  17. Lee, S. and Choi, W., "Schedule related keyword extraction method using embedding layer and LSTM", Proceeding of the Conference of the Korea Information Science Society, Vol. 2020, No. 07, July, 2020, pp. 608-610. 
  18. Lim, M., Kim, P., and Shin, J., "Sentiment Analysis Model with Semantic Topic Classification of Reviews", Smart Media Journal, Vol. 9, No. 2, June, 2020, pp. 69-77.  https://doi.org/10.30693/SMJ.2020.9.2.69
  19. Lim, Y. and Kim, H., "A study on the sentiment analysis using big data of hotels online review", Korean Journal of Hospitality and Tourism, Vol. 28, No. 7, October, 2019, pp. 105-123.  https://doi.org/10.24992/KJHT.2019.10.28.07.105.
  20. Mihalcea, R. and Tarau, P., "Textrank: Bringing order into text", Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, July, 2004, pp. 404-411. 
  21. Newman, D., Lau, J., Grieser, K., and Baldwin, T., "Automatic evaluation of topic coherence", Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, June, 2010, pp. 100-108. 
  22. Papadimitriou, C., Tamaki, H., Raghavan, P., and Vempala, S., "Latent semantic indexing: A probabilistic analysis", Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, May, 1998, pp. 159-168. 
  23. Park, D. and Kim, H., "A Proposal of Join Vector for Semantic Factor Reflection in TF-IDF Based Keyword Extraction", The Journal of Korean Institute of Information Technology, Vol. 16, No. 2, February, 2018, pp. 1-16.  https://doi.org/10.14801/jkiit.2018.16.2.1
  24. Rosen-Zvi, M., Griffiths, T., Steyvers, M., and Smyth, P., "The author-topic model for authors and documents", arXiv preprint arXiv:1207.4169, 2012. 
  25. SCATTER LAB, "Dialog-BERT", Naver Deview, https://deview.kr/2019/schedule, 2019. 
  26. Seo, S. and Kim, N., "Improving Classification Accuracy Using Further Pretraining Scheme with Selective Masking", KIISE Transactions on Computing Practices (KTCP), Vol. 27, No. 9, September, 2021, pp. 428-439.  https://doi.org/10.5626/KTCP.2021.27.9.428
  27. Wang, X. and McCallum, A., "Topics over time: A non-markov continuous-time model of topical trends", Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August, 2006, pp. 424-433. 
  28. Wang, Y. and Zhang, J., "Keyword extraction from online product reviews based on bi-directional LSTM recurrent neural network", 2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), December, 2017, pp. 2241-2245. 
  29. Wu, X., Du, Z., and Guo, Y., "A visual attention-based keyword extraction for document classification", Multimedia Tools and Applications, Vol. 77, No. 19, 2018, pp. 25355-25367.  https://doi.org/10.1007/s11042-018-5788-9
  30. Yang, Y., Uy, M., and Huang, A., "Finbert: A pretrained language model for financial communications", arXiv preprint arXiv:2006.08097, 2020. 
  31. Yoo, H. and Kim, H., "Cluster-based keyword Ranking Technique", Proceedings of the Korea Information Processing Society Conference, Vol. 23, No. 2, January 01, 2016, pp. 529-532. 
  32. Yoo, Y., Lim, D., and Heo, T., "Solar cell patent classification method based on keyword extraction and deep neural network", arXiv preprint arXiv:2109.08796, 2021. 
  33. Zhang, Q., Wang, Y., Gong, Y., and Huang, X. J., "Keyphrase extraction using deep recurrent neural networks on twitter", Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, November, 2016, pp. 836-845.