DOI QR코드

DOI QR Code

Comparative Study of User Reactions in OTT Service Platforms Using Text Mining

텍스트 마이닝을 활용한 OTT 서비스 플랫폼별 사용자 반응 비교 연구

  • Soonchan Kwon (Graduate School of Information, Yonsei University) ;
  • Jieun Kim (Graduate School of Information, Yonsei University) ;
  • Beakcheol Jang (Graduate School of Information, Yonsei University)
  • 권순찬 ;
  • 김지은 ;
  • 장백철
  • Received : 2024.01.16
  • Accepted : 2024.05.29
  • Published : 2024.06.30

Abstract

This study employs text mining techniques to compare user responses across various Over-The-Top (OTT) service platforms. The primary objective of the research is to understand user satisfaction with OTT service platforms and contribute to the formulation of more effective review strategies. The key questions addressed in this study involve identifying prominent topics and keywords in user reviews of different OTT services and comprehending platform-specific user reactions. TF-IDF is utilized to extract significant words from positive and negative reviews, while BERTopic, an advanced topic modeling technique, is employed for a more nuanced and comprehensive analysis of intricate user reviews. The results from TF-IDF analysis reveal that positive app reviews exhibit a high frequency of content-related words, whereas negative reviews display a high frequency of words associated with potential issues during app usage. Through the utilization of BERTopic, we were able to extract keywords related to content diversity, app performance components, payment, and compatibility, by associating them with content attributes. This enabled us to verify that the distinguishing attributes of the platforms vary among themselves. The findings of this study offer significant insights into user behavior and preferences, which OTT service providers can leverage to improve user experience and satisfaction. We also anticipate that researchers exploring deep learning models will find our study results valuable for conducting analyses on user review text data.

본 연구는 텍스트 마이닝 기법을 활용하여 다양한 OTT(Over-The-Top) 서비스 플랫폼에 대한 사용자 반응을 비교한다. 연구의 주요 목표는 OTT 서비스 플랫폼의 사용자 만족도를 파악하여 보다 효과적인 리뷰 전략을 수립하는 데 기여하는 것이다. 본 연구에서 다루는 주요 질문에는 다양한 OTT 서비스에 대한 사용자 리뷰에서 두드러진 토픽과 키워드를 식별하고 플랫폼별 사용자 반응을 이해하는 것이 포함된다. 이를 위해 긍정, 부정 리뷰에서 중요 단어를 추출하기 위해 Tf-idf를, 복잡한 사용자 리뷰를 보다 정교하고 포괄적으로 분석하기 위해 고급 토픽 모델링 기법인 BERTopic을 사용한다. Tf-idf 분석한 결과, 앱에 대한 긍정 리뷰는 콘텐츠와 관련된 단어들의 수치가 높았으며 부정 리뷰에서는 앱 사용 과정에서 발생할 수 있는 문제점에 관한 단어 수치가 높게 기록되었다. BERTopic을 활용한 토픽 모델링에서는 콘텐츠의 속성과 연관 지어 콘텐츠의 다양성, 앱 성능 요소, 결제, 호환성에 관한 키워드를 도출하였으며, 플랫폼 별로 두각을 보이는 속성이 다르다는 점도 확인하였다. 본 연구 결과는 사용자 행동과 선호도에 대한 중요한 인사이트를 제공하며, 이를 통해 OTT 서비스 제공업체는 사용자 경험과 만족도를 개선하는 데 활용할 수 있다. 또한, 연구자들은 사용자 리뷰 텍스트 분석에서 딥러닝 모델을 활용한 연구의 아이디어를 얻을 수 있을 것이라 기대한다.

Keywords

Acknowledgement

This work was supported by the National Research Foundation of Korea (NRF) funded by Korean Government under Grant RS-2023-00273751

References

  1. JH Yoo, and JY Park. "Factors Influencing the Continued Usage Intention of Global OTT Service Users: A Case Study of Netflix," Korean Journal of Broadcasting & Telecommunications Research, No. 102, pp.46-79, 2018. https://doi.org/10.22876/kjbtr.2018.102.002
  2. JY Lee, and BS Jeon, "A Study on the Determinants of OTT Service Satisfaction and Continued Usage Intention," Korean Journal of Broadcasting and Telecommunication Studies, Vol.34, No.4, pp.116-144, 2020. https://doi.org/10.22876/kab.2020.34.4.004
  3. BJ Min, JK Go, and JY Song, "Netflix's Competitive Strategy: Strategic Combination of Network Effects, Content Resale, and Original Content," Journal of Strategic Management, Vol.23, No.2, pp.25-45, 2020. https://doi.org/10.17786/jsm.2020.23.2.002
  4. HS Cho, SA Kang, and MH Ryu, "An Analysis of OTT Service Review Using Text Mining: Focusing on the Competitive Advantage of Local Service," Journal of the Korea Institute of Information and Communication Engineering, Vol.46, No.4, pp.722-733, 2021. https://doi.org/10.7840/kics.2021.46.4.722
  5. SW Kim and DW Kim, "Rethinking OTT regulation based on the global OTT market trends and regulation cases," Journal of Korean Society for Internet Information, Vol.20, No.6, 2019. https://doi.org/10.7472/jksii.2019.20.6.143
  6. YJ Kim, "A Study on the Impact of OTT Service Proliferation on Content Production, Distribution, and Consumption," Broadcast Culture Research, Vol.27, No.1, pp.75-102, 2015. https://doi.org/10.22854/sbc.2015.27.1.75
  7. KH Hwang, and KA Kim, "Examining Factors Affecting the Binge-Watching Behaviors of OTT Services," Journal of the Korea Convergence Society, Vol.11, No.3, pp.181-186, 2020. https://doi.org/10.15207/JKCS.2020.11.3.181
  8. DK Kim, SH Choi, and SJ Kim, "An Analysis of the Users' Behavior Patterns in the Domestic OTT Services," The Journal of Internet Electronic Commerce Research, Vol.17, No.4, pp.69-82, 2017. http://www.koreaec.org/bbs/board.php?bo_table=kieca_ board17&wr_id=910
  9. JH Yoo, and JY Park, "A Study on the Factors Influencing the Continued Usage Intention of Global OTT Service Users: A Case Study of Netflix," Korean Journal of Broadcasting & Telecommunications Research, pp.46-79, 2018. https://doi.org/10.22876/kjbtr.2018.102.002
  10. YK Chung, and Wei Zhang, "Effects of Service Characteristics of a Subscription-based OTT on User Satisfaction and Continuance Intention: Evaluation by Netflix Users," The Journal of the Korea Contents Association, Vol.20, No12, pp.123-135, 2020. https://doi.org/10.5392/JKCA.2020.20.12.123
  11. HG Lee, CK Yeo, and SH Kang, "Identifying the Mechanism of Formation of Continuous Usage Intention in Domestic OTT Services," Journal of Korea Service Management Society, Vol.22, No.4, pp.145-169, 2021. https://doi.org/10.15706/jksms.2021.22.4.007
  12. S Han, and SH Kwon, "A Study on the Impact of Personalized Recommendation Services and Expectation Alignment on Frequency Analysis of Continued Usage Intention: Focused on YouTube and Netflix Thumbnails," Korean Journal of Communication & Information, Vol.111, pp.151-180, 2022. https://doi.org/10.46407/kjci.2022.02.111.151
  13. SJ An, J Seo, and JI Choi, "A Study on the Factors Affecting the Continuous Intention to Use Digital Content Over-the-Top Service," Journal of Korean Society for Quality Management, Vol.50, No.1, pp.105-124, 2022. https://doi.org/10.7469/JKSQM.2022.50.1.105
  14. MJ Ko, and SW Lee, "A Comparative Analysis of OTT Service Reviews Before and After the Onset of the Pandemic Using Text Mining Technique: Focusing on the Emotion-Focused Coping and Nostalgia," Journal of the Korea Contents Association, Vol.21, No.11, pp.375-388, 2021. https://doi.org/10.5392/JKCA.2021.21.11.375
  15. SS Choi, and SJ Yeon, "Exploring Satisfaction Factors of OTT(Over The Top) Services: Comparative Analysis of Netflix and Wavve's App Reviews in Korea Market," The Korea Contents Society, pp.373-374, 2021.
  16. SJ Lee. "A Study on Determinants Affecting User's Satisfaction and Dissatisfaction of Korean OTT Service Using Online Review Analysis: Based on Lexical Analysis and LDA Topic Modeling Method," Korean Journal of Communication Studies, Vol.30, No.2 pp.41-74, 2022. https://doi.org/10.23875/kca.30.2.2
  17. SY Yu, Mi Jin Noh, and YS Kim, "A study of changes in user experience and service evaluation - Topic modeling of Netflix app reviews," Smart Media Journal, Vol.12, No.6, pp.27-34, 2023. https://doi.org/10.30693/SMJ.2023.12.6.27
  18. Hyun Kwak and Ho Geun Lee, "Investigation of Factors Affecting the Effects of Online Consumer Reviews," Informatization Policy, Vol.20, No.3, pp.3-17, 2013. https://koreascience.kr/article/JAKO201320762921014.page 1014.page
  19. Hassani, Hossein, et al. "Text mining in big data analytics," Big Data and Cognitive Computing, Vol.4, No.1, pp.1, 2020. https://doi.org/10.3390/bdcc4010001
  20. Hotho, Andreas, Andreas Nurnberger, and Gerhard Paass, "A brief survey of text mining," Journal for Language Technology and Computational Linguistics, Vol.20., No.1 pp.19-62, 2005. https://doi.org/10.21248/jlcl.20.2005.68
  21. ZS Zeng, and HS Lee, "Analyse of the Box Office Causes of Korean Dramas in China Using Big Data : Focusing on the Case of ," Video Technology Research, pp.19-41, 2022. https://doi.org/10.3390/bdcc4010001
  22. Heimerl, Florian, et al. "Word cloud explorer: Text analytics based on word clouds," 2014 47th Hawaii international conference on system sciences, IEEE, 2014. https://doi.org/10.1109/hicss.2014.231
  23. Qaiser, Shahzad, and Ramsha Ali, "Text mining: use of TF-IDF to examine the relevance of words to documents," International Journal of Computer Applications, Vol.181, No.1, pp.25-29, 2018. https://doi.org/10.5120/ijca2018917395
  24. JY Sung, and CY Park, "The Study on Election Campaign Agenda by Using Big Data," Korean Journal of Communication Studies, Vol.27, No.3 pp.75-104, 2019. https://doi.org/10.23875/kca.27.3.3
  25. SJ Lee, and HJ Kim, "Keyword Extraction from News Corpus using Modified TF-IDF," The Journal of Society for e-Business Studies, Vol.14, No.4 pp.59-73, 2009. https://koreascience.kr/article/JAKO200910348031067.page 10348031067.page
  26. Alghamdi, Rubayyi, and Khalid Alfalqi, "A survey of topic modeling in text mining," Int. J. Adv. Comput. Sci. Appl.(IJACSA), Vol.6, No.1, 2015. https://doi.org/10.14569/IJACSA.2015.060121
  27. Blei, David M., Andrew Y. Ng, and Michael I. Jordan, "Latent dirichlet allocation," Journal of machine Learning research, pp.993-1022, 2003. https://doi.org/10.7551/mitpress/1120.003.0082
  28. Grootendorst, Maarten, "BERTopic: Neural topic modeling with a class-based TF-IDF procedure," arXiv preprint arXiv:2203.05794, 2022. https://doi.org/10.48550/arXiv.2203.05794
  29. Devlin, Jacob, et al. "Bert: Pre-training of deep bidirectional transformers for language understanding," arXiv preprint arXiv:1810.04805, 2018. https://doi.org/10.48550/arXiv.1810.04805
  30. Abuzayed, Abeer, and Hend Al-Khalifa, "BERT for Arabic topic modeling: An experimental study on BERTopic technique," Procedia computer science, Vol.189, pp.191-194, 2021. https://doi.org/10.1016/j.procs.2021.05.096
  31. Ko, Young Soo, et al. "Topic modeling insomnia social media corpus using BERTopic and building automatic deep learning classification model," Korea Society for Information Management, Vol.39, No.2, pp.111-129, 2022. https://doi.org/10.3743/KOSIM.2022.39.2.111
  32. Woo-Ryeong, Y. A. N. G., and Y. A. N. G. Hoe-Chang, "Topic Modeling Analysis of Social Media Marketing using BERTopic and LDA," The Journal of Industrial Distribution & Business (JIDB), Vol.13, No.9, pp.37-50, 2022. https://doi.org/10.13106/jidb.2022.vol13.no9.37
  33. WR Yang, and HC Yang, "Research Trend Analysis on Customer Satisfaction in Service Field Using BERTopic and LDA," The Journals of Economics, Marketing & Management, Vol.10, No.6, pp.27-37, 2022. https://doi.org/10.20482/jemm.2022.10.6.27
  34. Herzberg, Frederick, "The motivation-hygiene concept and problems of manpower," Personnel administration, 1964. https://psycnet.apa.org/record/1964-09377-001
  35. Matzler, Kurt, and Elmar Sauerwein, "The factor structure of customer satisfaction: An empirical test of the importance grid and the penalty reward contrast analysis," International journal of service industry management, Vol.13, No.4, pp.314-332, 2022. https://doi.org/10.1108/09564230210445078
  36. DeLone, William H., and Ephraim R. McLean, "The DeLone and McLean model of information systems success: a ten-year update," Journal of management information systems, Vol.19, No.4, pp.9-30, 2003. https://doi.org/10.1080/07421222.2003.11045748
  37. Oliver, Richard L., "A cognitive model of the antecedents and consequences of satisfaction decisions," Journal of marketing research, Vol.17, No.4, pp.460-469, 1980. https://doi.org/10.2307/3150499
  38. Bhattacherjee, Anol, "Understanding information systems continuance: An expectation-confirmation model," MIS quarterly, pp.351-370, 2001. https://doi.org/10.2307/3250921
  39. Kwak Eun-a, & Choi Jin-ho, "An Analysis of User's Perception regarding Service Attributes and Competitive Relationship among OTT Services in the Korean Market," Broadcasting & Communacation, Vol.20, No.2, pp.121-169, 2019. https://doi.org/10.22876/bnc.2019.20.2.004
  40. Reimers, Nils, and Iryna Gurevych, "Sentence-bert: Sentence embeddings using siamese bert-networks," arXiv preprint arXiv:1908.10084, 2019. https://doi.org/10.18653/v1/d19-1410
  41. McInnes, Leland, John Healy, and James Melville, "Umap: Uniform manifold approximation and projection for dimension reduction," arXiv preprint arXiv:1802.03426, 2018. https://doi.org/10.21105/joss.00861
  42. McInnes, Leland, John Healy, and Steve Astels, "hdbscan: Hierarchical density based clustering," J. Open Source Software, Vol.2, No.11, 205, 2017. https://doi.org/10.21105/joss.00205