DOI QR코드

DOI QR Code

Analysis of News Agenda Using Text mining and Semantic Network Analysis: Focused on COVID-19 Emotions

텍스트 마이닝과 의미 네트워크 분석을 활용한 뉴스 의제 분석: 코로나 19 관련 감정을 중심으로

  • Received : 2020.11.24
  • Accepted : 2021.03.05
  • Published : 2021.03.31

Abstract

The global spread of COVID-19 around the world has not only affected many parts of our daily life but also has a huge impact on many areas, including the economy and society. As the number of confirmed cases and deaths increases, medical staff and the public are said to be experiencing psychological problems such as anxiety, depression, and stress. The collective tragedy that accompanies the epidemic raises fear and anxiety, which is known to cause enormous disruptions to the behavior and psychological well-being of many. Long-term negative emotions can reduce people's immunity and destroy their physical balance, so it is essential to understand the psychological state of COVID-19. This study suggests a method of monitoring medial news reflecting current days which requires striving not only for physical but also for psychological quarantine in the prolonged COVID-19 situation. Moreover, it is presented how an easier method of analyzing social media networks applies to those cases. The aim of this study is to assist health policymakers in fast and complex decision-making processes. News plays a major role in setting the policy agenda. Among various major media, news headlines are considered important in the field of communication science as a summary of the core content that the media wants to convey to the audiences who read it. News data used in this study was easily collected using "Bigkinds" that is created by integrating big data technology. With the collected news data, keywords were classified through text mining, and the relationship between words was visualized through semantic network analysis between keywords. Using the KrKwic program, a Korean semantic network analysis tool, text mining was performed and the frequency of words was calculated to easily identify keywords. The frequency of words appearing in keywords of articles related to COVID-19 emotions was checked and visualized in word cloud 'China', 'anxiety', 'situation', 'mind', 'social', and 'health' appeared high in relation to the emotions of COVID-19. In addition, UCINET, a specialized social network analysis program, was used to analyze connection centrality and cluster analysis, and a method of visualizing a graph using Net Draw was performed. As a result of analyzing the connection centrality between each data, it was found that the most central keywords in the keyword-centric network were 'psychology', 'COVID-19', 'blue', and 'anxiety'. The network of frequency of co-occurrence among the keywords appearing in the headlines of the news was visualized as a graph. The thickness of the line on the graph is proportional to the frequency of co-occurrence, and if the frequency of two words appearing at the same time is high, it is indicated by a thick line. It can be seen that the 'COVID-blue' pair is displayed in the boldest, and the 'COVID-emotion' and 'COVID-anxiety' pairs are displayed with a relatively thick line. 'Blue' related to COVID-19 is a word that means depression, and it was confirmed that COVID-19 and depression are keywords that should be of interest now. The research methodology used in this study has the convenience of being able to quickly measure social phenomena and changes while reducing costs. In this study, by analyzing news headlines, we were able to identify people's feelings and perceptions on issues related to COVID-19 depression, and identify the main agendas to be analyzed by deriving important keywords. By presenting and visualizing the subject and important keywords related to the COVID-19 emotion at a time, medical policy managers will be able to be provided a variety of perspectives when identifying and researching the regarding phenomenon. It is expected that it can help to use it as basic data for support, treatment and service development for psychological quarantine issues related to COVID-19.

전 세계적으로 퍼진 코로나 19 상황은 우리의 일상생활의 많은 부분에 영향을 끼쳤을 뿐만 아니라, 경제·사회 등 많은 부분에 걸쳐 막대한 영향력을 미치고 있다. 확진자와 사망자 수가 증가함에 따라 의료진과 대중은 불안, 우울, 스트레스 등 심리적인 문제를 겪고 있다고 한다. 장기적인 부정적인 감정은 사람들의 면역력을 감소시키고 신체적인 균형을 파괴할 수도 있으므로 코로나 19로 인한 심리적인 상태를 이해하는 것이 필수적인 상황이다. 본 연구에서는 코로나 19 감정과 관련된 뉴스 데이터를 수집하여, 텍스트 마이닝을 통해 키워드를 분류하고, 키워드 사이의 의미 네트워크 분석을 통해 단어들의 관계를 시각화하였다. 코로나 감정과 관련된 기사의 키워드에 나타난 단어들의 빈도수를 확인하고 이를 워드 클라우드로 분석하였다. 키워드 빈도 분석 결과 코로나 19 감정과 관련하여 '중국', '불안', '상황', '마음', '사회', '건강'과 같은 단어의 빈도가 높게 나타난 것을 확인할 수 있었다. 각 데이터 간 연결 중심성을 분석한 결과 키워드 중심성 네트워크에서 가장 중심적인 핵심어는 '심리'와 '코로나 19', '블루', '불안'이라는 단어가 높은 연결 중심성을 가지는 것을 확인할 수 있었다. 기사의 헤드라인에 나타난 주요 핵심어 사이의 동시 출현 빈도 네트워크를 그래프로 시각화한 결과, '코로나-블루' 쌍이 가장 굵게 표시되었고, '코로나-감정', '코로나-불안' 쌍이 비교적 굵은 선으로 표시된 것을 알 수 있었다. 코로나와 관련된 '블루'는 우울증을 의미하는 단어로, 코로나와 우울증은 이제 관심을 가져야 할 키워드임을 확인할 수 있었다. 본 연구에서는 장기화한 코로나 19 상황에서 신체적인 방역뿐만 아니라 심리적인 방역에도 힘써야 할 이 시기에 보건 정책담당자가 빠르고 복잡한 의사결정 과정에 도움이 되고자 미디어 뉴스를 모니터링 함으로써, 더욱더 쉬운 소셜 미디어 네트워크 분석 방법을 제시하고자 한다.

Keywords

References

  1. Arafa, A., Z. Mohammed, O. Mahmoud, M. Elshazley, and A. Ewis, "Depressed, anxious, and stressed: What have healthcare workers on the frontlines in Egypt and Saudi Arabia experienced during the COVID-19 pandemic?," Journal of Affective Disorders, Vol.278, (2021), 365-371. https://doi.org/10.1016/j.jad.2020.09.080
  2. Baht, S. S. and S. Milne, "Network effects on cooperation in destination website development," Tourism Management, Vol.29, No.6(2008), 1131-1140. https://doi.org/10.1016/j.tourman.2008.02.010
  3. Blei, D. M., A. Y. Ng, and M. I. Jordan, "Latent Dirichlet allocation," Journal of Machine Learning Research, Vol.3, (2003), 993-1022.
  4. Cha, Y. J., J. H. Lee, J. E. Choi, and H. W. Kim, "A Topic Modeling Approach to Marketing Strategies for Smartphone Companies," Knowledge Management Research, Vol.16, No.4(2015), 69-87. https://doi.org/10.15813/kmr.2015.16.4.005
  5. Chung, P. L., H. C. Ahn, and K. Y. Kwahk, "Identification of Core Features and Values of Smartphone Design using Text Mining and Social Network Analysis," Korean Journal of Business Administration, Vol.32, No.1(2019), 27-47.
  6. Diesner, J. and K. M. Carley, "Revealing Social Structure from Texts: Meta-Matrix Text Analysis as a Novel Method for Network Text Analysis," Causal Mapping for Research in Information Technology, (2005), 81-108.
  7. Gim, E. J. and J. H. Koo, "Analysis of Social Network Change Characteristics of Participants in Urban Regeneration Project Using Net Miner: Focused on the Urban Regeneration Leading Area in Suncheon-City," Journal of Information Technology Services, Vol 19. No.1(2020), 1-16. https://doi.org/10.9716/KITS.2020.19.1.001
  8. Guidry, J. P. D., M. Messner, S. L. Meganck, P. B. Perrin, A. Lovari, and K. E. Carlyle, "#Ebola: Tweeting and Pinning an Epidemic," Atlantic Journal of Communication, (2020), 1-16.
  9. Hwang, S. I. and M. K. Kim, "An Analysis of Artificial Intelligence(A.I.)_related Studies' Trends in Korea Focused on Topic Modeling and Semantic Network Analysis," Journal of Digital Contents Society, Vol.20, No.9(2019), 1847-1855. https://doi.org/10.9728/dcs.2019.20.9.1847
  10. Hwang, S. I. and Y. W. Park, "An Analysis of Arts Management-Related Studies' Trend in Korea using Topic Modeling and Semantic Network Analysis," Korean Association of Arts Management, Vol.50, No.1(2019), 5-31.
  11. J. A. Terrizzi Jr., N. J. Shook, and M. A. McDaniel, "The behavioral immune system and social conservatism: A meta-analysis," Evol. Hum. Behav, Vol.34, (2013), 99-108. https://doi.org/10.1016/j.evolhumbehav.2012.10.003
  12. Kim, N. G., D. H. Lee, H. C. Choi, and W. X. S. Wong, "Investigations on Techniques and Applications of Text Analytics," The Journal of Korean Institute of Communications and Information Sciences, Vol.42, No.2(2017), 471-492. https://doi.org/10.7840/kics.2017.42.2.471
  13. Kwahk, K. Y., Social Network Analysis, 2nd edition, CheongRam Publishers, 2019.
  14. Lee, J. M., M. N. Liu, and G. G. Lim, "A study on the revitalization of tourism industry through Big Data analysis," Journal of Intelligence and Information Systems, Vol.24, No.2(2018), 149-169. https://doi.org/10.13088/JIIS.2018.24.2.149
  15. Lee, S. M., S. E. Ryu, and S. J. Ahn, "Mass Media and Social Media Agenda Analysis Using Text Mining : focused on 5-day Rotation Mask Distribution System," The Journal of the Korea Contents Association, Vol.20, No.6(2020), 460-469. https://doi.org/10.5392/JKCA.2020.20.06.460
  16. Lee, S. H. and H. Y. Lee, "A Data Mining and Social Network Analysis to Understand Multi-Destination Tour Behavior of Inbound Free Independent Tourists in Seoul," Korean Journal of Business Administration Academic Conference, (2017), 321-334.
  17. Li, S., Y. Wang, J. Xue, N. Zhao, and T. Zhu, "The impact of COVID-19 epidemic declaration on psychological consequences: A study on active weibo users," International Journal of Environmental Research and Public Health, Vol.17, No.6(2020), 2032. https://doi.org/10.3390/ijerph17062032
  18. Liu, M. N. and G. G. Lim, "Word-of-Mouth Effect for Online Sales of K-Beauty Products: Centered on China SINA Weibo and Meipai," Journal of Intelligence and Information Systems Vol.25, No.1(2019), 197-218. https://doi.org/10.13088/JIIS.2019.25.1.197
  19. Liu, S., L. Yang, C. Zhang, Y. Xiang, Z. Liu, S. Hu, and B. Zhang, "Online mental health services in China during the COVID-19 outbreak," The Lancet Psychiatry, Vol.7, No.4(2020), e17-e18. https://doi.org/10.1016/s2215-0366(20)30077-8
  20. Norris, F. H., M. J. Friedman, and P. J. Watson, "60,000 Disaster Victims Speak: Part II. Summary and Implications of the Disaster Mental Health Research," Psychiatry, Vol.65, No.3(2002), 240-260. https://doi.org/10.1521/psyc.65.3.240.20169
  21. Park, C. S. and C. W. Chung, "Text Network Analysis: Detecting Shared Meaning through Socio-cognitive Networks of Policy Stakeholders," Institute of Governmental Studies, Vol.19, No.2(2013), 73-108.
  22. Park. B. E. and G. G. Lim, "A Study on the Impact Factors of Contents Diffusion in Youtube using Integrated Content Network Analysis," Journal of Intelligence and Information Systems Vol.21, No.3(2015), 19-36. https://doi.org/10.13088/jiis.2015.21.3.19
  23. Park, H. W. and L. Leydesdorff, "Understanding the KrKwic: A computer program for the analysis of Korean text," Journal of The Korean Data Analysis Society(JKDAS),Vol.6, No.5(2004), 1377-1387.
  24. Sivasankaran, B. and J. Aleksandar, "Mass hysteria revisited," Current Opinion in Psychiatry, Vol.19, No.2(2006), 171-174. https://doi.org/10.1097/01.yco.0000214343.59872.7a
  25. Sohn, D. W., Social Network Analysis, KyungMoon Publishers, 2002.
  26. Solvic, P., "Perception of risk," Science 1987, Vol.236, No.4799(1987), 280-285. https://doi.org/10.1126/science.3563507
  27. Shin, M. S., M. G. Park, and S. H. Bae, "Nano Technology Trend Analysis Using Google Trend and Data Mining Method for Nano-Informatics," Journal of Society of Korea Industrial and Systems Engineering, Vol. 40, No. 4(2017), 237-245. https://doi.org/10.11627/jkise.2017.40.4.237
  28. Wang, Y., Y. Di, J. Ye, and W. Wei, 'Study on the public psychological states and its related factors during the outbreak of coronavirus disease 2019 (COVID-19) in some regions of China," Psychology, Health & Medicine, Vol.26, No.1(2021), 13-22. https://doi.org/10.1080/13548506.2020.1746817
  29. Xiang, Y., Y. Yang, W. Li, L. Zhang, Q. Zhang, T. Cheung, and C. H. Ng, "Timely mental health care for the 2019 novel coronavirus outbreak is urgently needed," The Lancet Psychiatry, Vol.7, No.3(2020), 228-229. https://doi.org/10.1016/s2215-0366(20)30046-8
  30. Yoon, J. E. and C. J. Suh, "Research Trend Analysis by using Text-Mining Techniques on the Convergence Studies of AI and Healthcare Technologies," Journal of Information Technology Services, Vol 18. No.2(2019), 123-141. https://doi.org/10.9716/KITS.2019.18.2.123
  31. Yu, D. S. and G. G. Lim, "A Study on the eWOM and Selecting Movie According to Online Media and Replies," Journal of Information Technology Services, Vol.14, No.2(2015), 177-193. https://doi.org/10.9716/KITS.2015.14.2.177