DOI QR코드

DOI QR Code

Analyzing the Issue Life Cycle by Mapping Inter-Period Issues

기간별 이슈 매핑을 통한 이슈 생명주기 분석 방법론

  • 임명수 (국민대학교 비즈니스IT전문대학원) ;
  • 김남규 (국민대학교 경영대학 경영정보학부)
  • Received : 2014.11.12
  • Accepted : 2014.12.07
  • Published : 2014.12.30

Abstract

Recently, the number of social media users has increased rapidly because of the prevalence of smart devices. As a result, the amount of real-time data has been increasing exponentially, which, in turn, is generating more interest in using such data to create added value. For instance, several attempts are being made to analyze the relevant search keywords that are frequently used on new portal sites and the words that are regularly mentioned on various social media in order to identify social issues. The technique of "topic analysis" is employed in order to identify topics and themes from a large amount of text documents. As one of the most prevalent applications of topic analysis, the technique of issue tracking investigates changes in the social issues that are identified through topic analysis. Currently, traditional issue tracking is conducted by identifying the main topics of documents that cover an entire period at the same time and analyzing the occurrence of each topic by the period of occurrence. However, this traditional issue tracking approach has two limitations. First, when a new period is included, topic analysis must be repeated for all the documents of the entire period, rather than being conducted only on the new documents of the added period. This creates practical limitations in the form of significant time and cost burdens. Therefore, this traditional approach is difficult to apply in most applications that need to perform an analysis on the additional period. Second, the issue is not only generated and terminated constantly, but also one issue can sometimes be distributed into several issues or multiple issues can be integrated into one single issue. In other words, each issue is characterized by a life cycle that consists of the stages of creation, transition (merging and segmentation), and termination. The existing issue tracking methods do not address the connection and effect relationship between these issues. The purpose of this study is to overcome the two limitations of the existing issue tracking method, one being the limitation regarding the analysis method and the other being the limitation involving the lack of consideration of the changeability of the issues. Let us assume that we perform multiple topic analysis for each multiple period. Then it is essential to map issues of different periods in order to trace trend of issues. However, it is not easy to discover connection between issues of different periods because the issues derived for each period mutually contain heterogeneity. In this study, to overcome these limitations without having to analyze the entire period's documents simultaneously, the analysis can be performed independently for each period. In addition, we performed issue mapping to link the identified issues of each period. An integrated approach on each details period was presented, and the issue flow of the entire integrated period was depicted in this study. Thus, as the entire process of the issue life cycle, including the stages of creation, transition (merging and segmentation), and extinction, is identified and examined systematically, the changeability of the issues was analyzed in this study. The proposed methodology is highly efficient in terms of time and cost, as it sufficiently considered the changeability of the issues. Further, the results of this study can be used to adapt the methodology to a practical situation. By applying the proposed methodology to actual Internet news, the potential practical applications of the proposed methodology are analyzed. Consequently, the proposed methodology was able to extend the period of the analysis and it could follow the course of progress of each issue's life cycle. Further, this methodology can facilitate a clearer understanding of complex social phenomena using topic analysis.

최근 스마트 기기를 통해 소셜미디어에 참여하는 사용자가 급격히 증가하고 있다. 이에 따라 빅데이터 분석에 대한 관심이 높아지고 있으며 최근 포털 사이트에서 검색어로 자주 입력되거나 다양한 소셜미디어에서 자주 언급되는 단어에 대한 분석을 통해 사회적 이슈를 파악하기 위한 시도가 이루어 지고 있다. 이처럼 다량의 텍스트를 통해 도출된 사회적 이슈의 기간별 추이를 비교하는 분석을 이슈 트래킹이라 한다. 하지만 기존의 이슈 트래킹은 두 가지 한계를 가지고 있다. 첫째, 전통적 방식의 이슈 트래킹은 전체 기간의 문서에 대해 일괄 토픽 분석을 실시하고 각 토픽의 기간별 분포를 파악하는 방식으로 이루어지므로, 새로운 기간의 문서가 추가되었을 때 추가된 문서에 대해서만 분석을 추가 실시하는 것이 아니라 전체 기간의 문서에 대한 분석을 다시 실시해야 한다는 실용성 측면의 한계를 갖고 있다. 둘째, 이슈는 끊임 없이 생성되고 소멸될 뿐 아니라, 때로는 하나의 이슈가 둘 이상의 이슈로 분화하고 둘 이상의 이슈가 하나로 통합되기도 한다. 즉, 이슈는 생성, 변화(병합, 분화), 그리고 소멸의 생명주기를 갖게 되는데, 전통적 이슈 트래킹은 이러한 이슈의 가변성을 다루지 않았다는 한계를 갖는다. 본 연구에서는 이러한 한계를 극복하기 위해 대상 기간 전체의 문서를 한꺼번에 분석하는 방식이 아닌 세부 기간별 문서에 대해 독립적인 분석을 수행하고 이를 통합할 수 있는 방안을 제시하였으며, 이를 통해 새로운 이슈가 생성되고 변화하며 소멸되는 전체 과정을 규명하였다. 또한 실제 인터넷 뉴스에 대해 제안 방법론을 적용함으로써, 제안 방법론의 실무 적용 가능성을 분석하였다.

Keywords

References

  1. Albright, R., Taming Text with the SVD, SAS Institute Inc., Cary, NC, 2004.
  2. Bae, J.-h., J.-e. Son, and M. Song, "Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques," Journal of Intelligence and Information Systems, Vol.19, No.3(2013), 141-156. https://doi.org/10.13088/jiis.2013.19.3.141
  3. Bae, J.-h., N.-g. Han, and M. Song, "Twitter Issue Tracking System by Topic Modeling Techniques," Journal of Intelligence and Information Systems, Vol.20, No.2(2014), 109-122. https://doi.org/10.13088/jiis.2014.20.2.109
  4. Ding, W. and C. Chen, "Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods," Journal of the Association for Information Science and Technology, Vol.65, No.10(2014), 2084-2097. https://doi.org/10.1002/asi.23134
  5. Fan, W., L. Wallace, S. Rich, and Z. Zhang, "Tapping the Power of Text Mining," Communications of the ACM, Vol.49, No.9 2006), 76-82.
  6. Gartner, 2012 Hype Cycle for Emerging Technologies, Gartner Inc., Stamford, 2012.
  7. Han, J. and M. Kamber, Data Mining: Concepts and Techniques, 3nd, Morgan Kaufmann Publishers, San Francisco, 2011.
  8. Hong, J. S., H. S. Choi, H. J. Han, J. S. Kim, E. J. Yu, S. R. Lim, and N. Kim, "A Data Analysis-based Hybrid Methodology for Selecting Pending National Issue Keywords," Entrue Journal of Information Technology, Vol.13, No.1(2014), 97-111.
  9. Hong, J.-S., N. Kim, and S. Lee, "A Methodology for Automatic Multi - Categorization of Single - Categorized Documents,"Journal of Intelligence and Information Systems, Vol.20, No.3(2014), 77-92. https://doi.org/10.13088/jiis.2014.20.3.077
  10. Jin, H., R. Schwarts, S. Sista, and F. Walls, "Topic Tracking for Radio, TV Broadcast and Newswire," Proceedings of DARPA Broadcast News Workshop, (1999).
  11. Kim, J., N. Kim, and Y. Cho, "User-Perspective Issue Clustering Using Multi-Layered Twoode Network Analysis,"Journal of Intelligence and Information Systems, Vol.20, No.2(2014), 93-107.
  12. Ma, J., Y. Wang, H. Zhu, and Y. Shen, "Research on Method of Adaptive Topic Tracking Based on Evolution of Public Opinion Ontology," ACEEE International Journal on Information Technology, Vol.4, No.1(2014), 1-10.
  13. Metzler, D., Y. Bernstein, W. B. Croft, A. Moffat, and J. Zobel, "Similarity Measures for Tracking Information Flow," Proceedings of the 14th ACM international conference on Information and knowledge management, (2005), 517-524.
  14. Mooney, R. J. and R. Bunescu, "Mining Knowledge from Text using Information Extraction," ACM SIGKDD Explorations Newsletter, Vol.7, No.1(2005), 3-10.
  15. Park, J.-H. and M. Song, "A Study on the Research Trends in Library & Information Science in Korea using Topic Modeling," Journal of the Korean Society for Information Management, Vol.30, No.1(2013), 7-32. https://doi.org/10.3743/KOSIM.2013.30.1.007
  16. Rijsbergen, C. J. V., Information Retrieval, 2nd edition, Butterworths, London, 1979.
  17. Salton, G., A. Wong, and C. S. Yang, "A Vector Space Model for Automatic Indexing," Communications of the ACM, Vol.18, No.11 1975), 613-620. https://doi.org/10.1145/361219.361220
  18. Sebastiani, F., Classification of Text, Automatic, The Encyclopedia of Language and Linguistics 14, 2nd edition, Elsevier Science Pub, 2006.
  19. Song, S. M., J. S. Yu, and E. M. Kim, "Offering system for major article Using Text Mining and Data Mining," Proceedings of the 32th annual conference on Korea Information Processing Society, (2009), 733-734.
  20. Weiss, S. M., N. Indurkhya, and T. Zhang, Fundamentals of Predictive Text Mining, Springer, 2010.
  21. Witten, I. H., Text Mining: Practical Handbook of Internet Computing, CRC Press, Florida, 2005.
  22. Yu, E.-J., J.-C. Kim, C.-Y. Lee, and N.-G. Kim, "Using Ontologies for Semantic Text Mining," The Journal of Information System, Vol.21, No.3(2012), 137-161. https://doi.org/10.5859/KAIS.2012.21.3.137

Cited by

  1. A Methodology for Analyzing Public Opinion about Science and Technology Issues Using Text Analysis vol.14, pp.3, 2015, https://doi.org/10.9716/KITS.2015.14.3.033
  2. Investigating Dynamic Mutation Process of Issues Using Unstructured Text Analysis vol.22, pp.1, 2016, https://doi.org/10.13088/jiis.2016.22.1.01
  3. A Method for Evaluating News Value based on Supply and Demand of Information Using Text Analysis vol.22, pp.4, 2016, https://doi.org/10.13088/jiis.2016.22.4.045
  4. 동적 토픽분석을 활용한 스마트그리드 연구동향 분석 vol.66, pp.4, 2014, https://doi.org/10.5370/kiee.2017.66.4.613
  5. 텍스트 마이닝을 이용한 메이커 운동의 트렌드 분석 vol.18, pp.12, 2014, https://doi.org/10.5392/jkca.2018.18.12.468
  6. 머신러닝 및 딥러닝 연구동향 분석: 토픽모델링을 중심으로 vol.15, pp.2, 2014, https://doi.org/10.17662/ksdim.2019.15.2.019
  7. 인과관계문형 기반 사회이슈 발생원인 도출 방법 연구 vol.19, pp.3, 2014, https://doi.org/10.14400/jdc.2021.19.3.167