• Title/Summary/Keyword: LDA토픽모델링

Search Result 229, Processing Time 0.025 seconds

Analysis of Social Issues of the Newspaper Articles on Gyeongju Earthquakes (신문기사에 나타난 경주지진 사건의 사회적 이슈분석)

  • Lee, Soo-Sang
    • Journal of Korean Library and Information Science Society
    • /
    • v.48 no.2
    • /
    • pp.53-72
    • /
    • 2017
  • The purpose of this study is to analyze types and features social issues about the Gyeongju earthquakes 2016, South Korea. The specific purpose is to identify types of topics related to Gyeongju Earthquakes, changes of topics over time, and the differences of topics depending on the each type of newspapers. According to the result of topic modeling, 55 topics were extracted. The result of this study is following these. First, the main topics have been changed with the course of time. In September, various topics were emerged, specifically urgent issues was found during two weeks after the first earthquake. After October, topics about social problems derived from the earthquakes received much attention at that time. Topics related to safety problems about nuclear plant have steadily found in all period. Second, topics varied depending whether the newspaper is national or regional. Also, differences of topics were found when dividing the newspapers by their characteristics considered conservative or liberal.

A System for Keyword Extraction and Keyword-based Sentiment Analysis for Topic Analysis in Discussion (토론 대화에서의 토픽 분석을 위한 키워드 추출 및 키워드 기반 감성분석 시스템)

  • Yong-Bin Jeong;Yu-Jin Oh;Jae-Wan Park;Sae-Mi Jang;Young-Gyun Hahm
    • Annual Conference on Human and Language Technology
    • /
    • 2022.10a
    • /
    • pp.164-169
    • /
    • 2022
  • 토픽 모델링은 비즈니스 분석이나 기술 동향 파악 등 다방면에서 많이 사용되고 있는 기술이다. 하지만 대표적인 방법인 LDA와 같은 비지도학습의 경우, 그 알고리즘 구조상 문서의 수가 많을 때 토픽 모델링이 가능하다. 본 논문에서는 문서의 수가 적은 경우도, 키워드 및 키프레이즈를 이용한 군집화를 통해 토픽 모델링을 하고 감성분석을 통해 토픽에 대한 분석도 제시하였다. 이에 필요한 데이터 제작 및 키워드 추출, 키워드 기반 감성분석, 키워드 임베딩 및 군집화를 구현하였고, 결과를 정성적으로 보았을 때 유의미한 분석이 되는 것을 확인하였다.

  • PDF

An analysis of the change in media's reports and attitudes about face masks during the COVID-19 pandemic in South Korea: a study using Big Data latent dirichlet allocation (LDA) topic modelling (빅데이터 LDA 토픽 모델링을 활용한 국내 코로나19 대유행 기간 마스크 관련 언론 보도 및 태도 변화 분석)

  • Suh, Ye-Ryoung;Koh, Keumseok Peter;Lee, Jaewoo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.5
    • /
    • pp.731-740
    • /
    • 2021
  • This study applied LDA topic modeling analysis to collect and analyze news media big data related to face masks in the three waves of the COVID-19 pandemic in Korea. The results empirically show that media reports focused on mask production and distribution policies in the first wave and the mandatory mask wearing in the second wave. In contrast, more reports on trivial, gossipy events consist of the media coverage in the second and third waves. The findings imply that Korea's governmental interventions to address the shortage of face masks and to regulate mask wearing were successful relatively in a short time. In contrast, the study also reports that there may be relative less number of science-based news reports like the ones on the effectiveness of face masks or different levels of filter types. This study exemplifies how a big data analysis can be applied to evaluate and enhance public health communication.

Trend Analysis of Dance Performance Research Using Keywords and Topic Modeling of LDA Techniques (LDA 토픽 모델링 기법을 활용한 무용공연의 연구 동향 분석)

  • SI YU
    • Journal of Industrial Convergence
    • /
    • v.22 no.3
    • /
    • pp.13-25
    • /
    • 2024
  • This study explores research topics related to dance performances published in Korea based on big data and examines research trends that change according to the trend of the times. The results derived from topic modeling analysis are as follows. (1) Six major topics were derived: a study on marketing strategies and development plans for dance performances, (2) a study on the re-watching factors of dance performance space and performance satisfaction, (3) a study on the popularity and contribution of dance performances in the stage environment, (4) a study on the current status of dance performances and the convergence of dance group operations, (5) a study on the definition of dance performances using various social media, and (6) a study on the direction and development of technology-applied dance performance contents. Accordingly, research trends and topics related to dance, including dance performances, social changes, key keywords of researchers' change interests were extracted, and keywords were compared and analyzed to present academic changes and countermeasures. Accordingly, the need for research to apply new technologies was emphasized as it diversified and fused.

An Analysis of the Support Policy for Small Businesses in the Post-Covid-19 Era Using the LDA Topic Model (LDA 토픽 모델을 활용한 포스트 Covid-19 시대의 소상공인 지원정책 분석)

  • Kyung-Do Suh;Jung-il Choi;Pan-Am Choi;Jaerim Jung
    • Journal of Industrial Convergence
    • /
    • v.22 no.6
    • /
    • pp.51-59
    • /
    • 2024
  • The purpose of the paper is to suggest government policies that are practically helpful to small business owners in pandemic situations such as COVID-19. To this end, keyword frequency analysis and word cloud analysis of text mining analysis were performed by crawling news articles centered on the keywords "COVID-19 Support for Small Businesses", "The Impact of Small Businesses by Response System to COVID-19 Infectious Diseases", and "COVID-19 Small Business Economic Policy", and major issues were identified through LDA topic modeling analysis. As a result of conducting LDA topic modeling, the support policy for small business owners formed a topic label with government cash and financial support, and the impact of small business owners according to the COVID-19 infectious disease response system formed a topic label with a government-led quarantine system and an individual-led quarantine system, and the COVID-19 economic policy formed a topic label with a policy for small business owners to acquire economic crisis and self-sustainability. Focusing on the organized topic label, it was intended to provide basic data for small business owners to understand the damage reduction policy for small business owners and the policy for enhancing market competitiveness in the future pandemic situation.

A Study on Analysis of Topic Modeling using Customer Reviews based on Sharing Economy: Focusing on Sharing Parking (공유경제 기반의 고객리뷰를 이용한 토픽모델링 분석: 공유주차를 중심으로)

  • Lee, Taewon
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.25 no.3
    • /
    • pp.39-51
    • /
    • 2020
  • This study will examine the social issues and consumer awareness of sharing parking through the method text mining. In this experiment, the topic by keyword was extracted and analyzed using TFIDF (Term frequency inverse document frequency) and LDA (Latent dirichlet allocation) technique. As a result of categorization by topic, citizens' complaints such as local government agreements, parking space negotiations, parking culture improvement, citizen participation, etc., played an important role in implementing shared parking services. The contribution of this study highly differentiated from previous studies that conducted exploratory studies using corporate and regional cases, and can be said to have a high academic contribution. In addition, based on the results obtained by utilizing the LDA analysis in this study, there is a practical contribution that it can be applied or utilized in establishing a sharing economy policy for revitalizing the local economy.

Analysis of trends in information security using LDA topic modeling

  • Se Young Yuk;Hyun-Jong Cha;Ah Reum Kang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.7
    • /
    • pp.99-107
    • /
    • 2024
  • In an environment where computer-related technologies are rapidly changing, cyber threats continue to emerge as they are advanced and diversified along with new technologies. Therefore, in this study, we would like to collect security-related news articles, conduct LDA topic modeling, and examine trends. To that end, news articles from January 2020 to August 2023 were collected and major topics were derived through LDA analysis. After that, the flow by topic was grasped and the main origin was analyzed. The analysis results show that ransomware attacks in 2021 and hacking of virtual asset exchanges in 2023 are major issues in the recent security sector. This allows you to check trends in security issues and see what research should be focused on in the future. It is also expected to be able to recognize the latest threats and support appropriate response strategies, contributing to the development of effective security measures.

An analysis of domestic research trends of mathematics curriculum research through topic modeling: Focused on domestic journals published from 1997 to 2019 (토픽모델링을 활용한 국내 수학과 교육과정 연구 동향 분석 : 1997년부터 2019년까지 게재된 국내 수학교육 학술지 논문을 중심으로)

  • Son, Taekwon;Lee, Kwangho
    • The Mathematical Education
    • /
    • v.59 no.3
    • /
    • pp.201-216
    • /
    • 2020
  • This study analyzed 493 domestic mathematics curriculum articles published in KCI's listings from 1997 to 2019 using LDA topic modeling. As a result, domestic mathematics curriculum research could be categorized into eight topics such as 'context in a curriculum', 'analysis a curriculum by the mathematical concept', 'form, system, meaning, and character of a curriculum', 'instruction and application of a curriculum', 'implementation and evaluation of a curriculum', 'tasks in a curriculum', 'analysis of a curriculum based on ability', 'compare and analysis curriculum and textbook'. The topic 'implementation and evaluation of a curriculum' was identified with the lowest proportion. Also, we performed the simple regression analysis with the weight of topics in the application period of the curriculum, and 'analysis of a curriculum based on ability' appeared as a 'hot topic'. Furthermore, topics appeared differently depending on the application period of the curriculum. Some of the appeared topics showed a tendency to match the emphasis of the highlight in a mathematics curriculum. Based on the results, future studies should develop frameworks for mathematics curriculum studies and extend the field of mathematical curriculum studies to make progress. Furthermore, future studies are needed to examine the enactment, feedback, and competency evaluation in the mathematical curriculum.

An Analysis of Civil Complaints about Traffic Policing Using the LDA Model (토픽모델링을 활용한 교통경찰 민원 분석)

  • Lee, Sangyub
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.4
    • /
    • pp.57-70
    • /
    • 2021
  • This study aims to investigate the security demand about the traffic policing by analyzing civil complaints. Latent Dirichlet Allocation(LDA) was applied to extract key topics for 2,062 civil complaints data related to traffic policing from e-People. And additional analysis was made of reports of violations, which accounted for a high proportion. In this process, the consistency and convergence of keywords and representative documents were considered together. As a result of the analysis, complaints related to traffic police could be classified into 41 topics, including traffic safety facilities, passing through intersections(signals), provisional impoundment of vehicle plate, and personal mobility. It is necessary to strengthen crackdowns on violations at intersections and violations of motorcycles and take preemptive measures for the installation and operation of unmanned traffic control equipments, crosswalks, and traffic lights. In addition, it is necessary to publicize the recently amended laws a implemented policies, e-fine, procedure after crackdown.

Comparison of policy perceptions between national R&D projects and standing committees using topic modeling analysis : focusing on the ICT field (토픽모델링 분석을 활용한 국가연구개발사업과제와 국회 상임위원회 사이의 정책 인식 비교 : ICT 분야를 중심으로)

  • Song, Byoungki;Kim, Sangung
    • Journal of Industrial Convergence
    • /
    • v.20 no.7
    • /
    • pp.1-11
    • /
    • 2022
  • In this paper, numerical values are derived using topic modeling among data-based evaluation methodologies discussed by various research institutes. In addition, we will focus on the ICT field to see if there is a difference in policy perception between the national R&D project and standing committee. First, we create model for classifying ICT documents by learning R&D project data using HAN model. And we perform LDA topic modeling analysis on ICT documents classified by applying the model, compare the distribution with the topics derived from the R&D project data and proceedings of standing committees. Specifically, a total of 26 topics were derived. Also, R&D project data had professionally topics, and the standing committee-discuss relatively social and popular issues. As the difference in perception can be numerically confirmed, it can be used as a basic study on indicators that can be used for future policy or project evaluation.