DOI QR코드

DOI QR Code

Topic Modeling of Korean Newspaper Articles on Aging via Latent Dirichlet Allocation

  • Received : 2021.05.29
  • Accepted : 2022.01.14
  • Published : 2022.02.28

Abstract

The purpose of this study is to explore the structure of social discourse on aging in Korea by analyzing newspaper articles on aging. The analysis is composed of three steps: first, data collection and preprocessing; second, identifying the latent topics; and third, observing yearly dynamics of topics. In total, 1,472 newspaper articles that included the word "aging" within the title were collected from 10 major newspapers between 2006 and 2019. The underlying topic structure was analyzed using Latent Dirichlet Allocation (LDA), a topic modeling method widely adopted by text mining academics and researchers. Seven latent topics were generated from the LDA model, defined as social issues, death, private insurance, economic growth, national debt, labor market innovation, and income security. The topic loadings demonstrated a clear increase in public interest on topics such as national debt and labor market innovation in recent years. This study concludes that media discourse on aging has shifted towards more productivity and efficiency related issues, requiring older people to be productive citizens. Such subjectivation connotes a decreased role of the government and society by shifting the responsibility to individuals not being able to adapt successfully as productive citizens within the labor market.

Keywords

Acknowledgement

Funding for this paper was provided by Namseoul University year 2020.

References

  1. Agrawal, A., Fu, W., & Menzies, T. (2018). What is wrong with topic modeling? And how to fix it using search-based software engineering. Information and Software Technology, 98, 74-88. https://doi.org/10.1016/j.infsof.2018.02.005
  2. Angermuller, J. (2015). Discourse studies. In J. Wright, (Ed.), Encyclopedia of the social and behavioral sciences (2nd ed., pp.510-515). Elsevier.
  3. Baek, Y. M. (2019). Text-mining using R. Hanul Academy.
  4. Big KINDS. (2017). Korea Press Foundation. www.bigkinds.or.kr
  5. Blei, D. M., Ng, A., & Jordan, M. (2003). Latent Dirichlet Allocation. Journal of Machine Learning Research, 3, 993-1022.
  6. Blei, D. M., & Lafferty, J. D. (2006). Dynamic topic models. In A. Pohoreckhy, L. Bottou, & M. L. Littman (Eds.), Proceedings of the International Conference on Machine Learning. 113-120. http://doi.org/10.1145.1143844.1143859 https://doi.org/10.1145.1143844.1143859
  7. Diamond, I., & Quinby, L. (1988). Feminism and Foucault: Reflections on resistance. Northeastern University Press.
  8. DiMaggio, P., Nag, M., & Blei, D. (2013). Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of U.S. government arts funding. Poetics, 41(6), 570-606. https://doi.org/10.1016/j.poetic.2013.08.004
  9. Han, G., & Yoon, S.(2007). Daejoong maechaeaeseo shinnonyeon damlon boonseok [Critical review of discourse on aging in Korean newspapers]. Journal of the Korean Gerontological Society, 27(2), 299-322. UCI: G704-000573.2007.27.2.002
  10. Jang, M. J. (2013). Media damoonhwa damlon boonseok [Multicultural discourse analysis in media]. Multicultural Education Studies, 6(3), 157-179. http://dx.doi.org/10.14328/MES.2013.09.30.161
  11. Kim, E. J. (2017). Mediaga pyobanghaneun goryeonghwa sahoieui balamjikhan noinsang [How media makes the elderly into welcoming citizens in the aged society]. Korean Journal of Journalism and Communications, 61(3), 157-188. http://doi.org/10.20879/kjjcs.2017.61.3.005
  12. Lee, B. H., & Kim, G. Y. (2019). Social big dataro bon noinguanlyun damloneui teukjinggua byunhwa [The themes and trends of discourse on the elderly in Korea identified by analyzing social media big data]. Social Welfare Policy, 46(3), 179-201. https://doi.org/10.15855/swp.2019.46.3.179
  13. Liu, L., Tang, L., Dong, W., Yao, S., & Zhou, W. (2016). An overview of topic modeling and its current applications in bioinformatics. SpringerPlus, 5(1608). http://doi.org/10.1186/s40064-016-3252-8
  14. McFarland, D. A., Ramage, D., Chuang, J., Heer, J., Manning, C. D., & Jurafsky, D. (2013). Differentiating language usage through topic models. Poetics, 41(6), 607-625. https://doi.org/10.1016/j.poetic.2013.06.004
  15. Park, H. J., Kim, H., & Hong, Y. J. (2017). Topic modellingeul hwalyonghan haksaeng inkwonjoryeeui sahoejeok issue boonseok [A topic modeling analysis on the major social issues of the Students' Human Rights Ordinance in Korea]. Asian Journal of Education, 18(4), 683-711. https://doi.org/10.15753/aje.2017.12.18.4.683
  16. Roh, B. R. & Yang, K. E. (2019). Hangook sahoe jeochoolsan noneui gujowa ke byonhwane gwanhan text mining boonseok [Text mining analysis of South Korea's birth-rate decline issue in newspaper articles: Transition patterns over 18 years]. Korean Journal of Social Welfare, 71(4), 154-176. https://doi.org/10.20970/kasw.2019.71.4.006
  17. Silge, J., & Robinson, D. (2017). Text Mining with R: A Tidy Approach. O'Reilly.
  18. Sutherland, I. & Kiatkawsin, K. (2020). Determinants of guest experience in Airbnb: A topic modeling approach using LDA. Sustainability, 12(8), 3402. https://doi.org/10.3390/su12083402
  19. Statistics Korea. (n.d.) http://kosis.kr