• Title/Summary/Keyword: 텍스트마이닝분석

Search Result 992, Processing Time 0.032 seconds

An Analysis of Effects of Emergency Fine Dust Reduction Measures and National Petition Using Regression Analysis and Text Mining (회귀분석과 텍스트마이닝을 활용한 미세먼지 비상저감조치의 실효성 및 국민청원 분석)

  • Kim, Annie;Jeong, So-Hee;Choi, Hyun-Bin;Kim, Hyon Hee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.11
    • /
    • pp.427-434
    • /
    • 2018
  • Recently, the Seoul government implemented 'Free Public Transportation' policy and 'Citizen Participatory Alternative-Day-No-Driving' system as 'Emergency Fine Dust Reduction Measures'. In this paper, after identifying the effectiveness of the two traffic policies, suggestions for direction of future fine dust policy were made. The effect of traffic on the fine dust was analyzed by regression analysis and the responses to the two traffic policies and petitions were analyzed using text mining. Our experimental results show that the responses to the policy were mostly negative, and the influence of the domestic factors was considerable unlike expectation of citizens. Moreover, the result made us possible to know people's specific needs on fine dust reduction policy. Finally, based on the result, the suggestions for fine dust reduction policy direction were provided.

Analyzing Game Streaming Application Reviews Using Text Mining Approach: Research to Strengthen Digital Competitiveness (텍스트마이닝 기법을 활용한 게임 스트리밍 애플리케이션 리뷰 분석: 디지털 경쟁력 강화를 위한 연구)

  • Jin, Wenhui;Lee, Jungwoo
    • Journal of Digital Convergence
    • /
    • v.20 no.4
    • /
    • pp.279-290
    • /
    • 2022
  • As the growth of the live streaming service market is accelerating due to COVID-19, the number of downloads and reviews of live streaming mobile applications is also rapidly skyrocketing. This study is to research game streaming applications using Twitch reviews as database. A total of 8 topics are extracted through LDA topic modeling and 7 out of them are detected to be inconvenience factors. Then, to pinpoint the main inconvenience factors, co-occurrence analysis is used in order to find out main factors. Finally, based on previous studies, several solutions are provided, which can solve the inconvenience factors(advertisement, UI design, technology problems) as well as strengthening digital competitiveness. This study will serve as an opportunity to improve digital competitiveness not only for Twitch but also for other game live streaming service companies in the future.

Topic Analysis of the "Right to be Forgotten" Using Text Mining (텍스트마이닝을 활용한 "잊힐 권리"의 토픽 분석)

  • Lee, So-Hyun;Koo, Bon-Jin
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.2
    • /
    • pp.275-298
    • /
    • 2022
  • This study examined the issues and characteristics that appeared in news and journal articles related to the 'right to be forgotten' using text mining analysis. Data for analysis were collected from 2010 to 2020 with the keyword 'right to be forgotten'. Keyword analysis and topic modeling analysis were performed on the collected data. As a result, in the last 10 years the issues about 'right to be forgotten' are not much different in news and journal articles and the approaches also are similar. However, it confirmed common issues and the partial difference between news and journal articles through comparison. Therefore in Archives and Records Management Studies, it is necessary to discuss derived in this study. In particular common issues are considered first but if there are differences in issues, it is needed to discuss them in various ways. This study is meaningful to understand the meaning and to draw issues that may arise in the future of the 'right to be forgotten'. The results of this study will contribute to be variously discussed on the 'right to be forgotten' in Archives and Records Management Studies.

How National Water Management Plans lead Hydrological Survey Projects? (텍스트 마이닝을 이용한 국가 물관리 정책 변화 시점별 수문조사사업의 방향 분석)

  • Chan Woo Kim;Min Kuk Kim;Jung Hwan Koh;Seung Won Han;In Jae Choi;Dong Ho Hyun;Seok Geun Park
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.429-429
    • /
    • 2023
  • 우리나라의 물 관련 정책 방향이 환경 중심의 수자원 관리에서 친수공간 및 정보의 확보와 같은 안전한 물관리로 확대되면서 정책추진에 기초가 될 수 있는 신뢰도 높은 수문자료의 생산이 보다 중요시되고 있다. 국가 수문조사사업은 이러한 정책기조에 맞춰 제도적인 뒷받침과 함께 조사의 범위와 기술, 품질관리 등의 영역을 넓히며 그 기능을 활발히 하고 있으나, 물관리 정책의 경향에 따른 수문조사사업의 방향성과 특징을 구조적으로 살펴본 연구는 부족한 것으로 파악된다. 따라서 본 연구는 친수·친환경적 물관리가 강조된 시기('97~현재)를 중점으로 하여 물관리 정책과 관련 계획의 변화가 수문조사사업에 어떠한 영향을 주는지 고찰하였다. 이를 위해 물관리 여건의 변화에 따라 달라진 관련 정책별 주제어의 분포와 수문조사사업과 연관된 주요어의 출현빈도 및 경향을 살펴보고, 주요 연관어와 연계한 사업의 방향과 구조를 분석하였다. 분석자료로는 물관리 관련 법령 등의 제도와 언론기사자료, 정책별 추진방향을 활용하였다. 정책의 추진방향은 1) 수자원의 종합적 개발에서 친환경적 측면과 지속가능성이 강조된 수자원장기종합계획(3-1차~4-3차)과 2) 사람과 자연이 함께 고려된 맑고 안전한 물, 통합물관리 등의 전략이 수록된 국가물관리기본계획(1차), 3) 정책의 기조에 따라 수립 및 보완된 수문조사 기본계획(1~2차)을 바탕으로 하였다. R프로그램을 통한 텍스트 마이닝을 활용하여 각 자료에서의 주제어 분포와 출현빈도를 분석하고, 정책별 추진방향과 수문조사사업의 연계성을 나타내었다. 연구의 함의를 담은 결과로서 물관리 여건이 변화된 시점별 주요연관어를 중심으로 한 정책동향과 수문조사사업의 특징 및 방향을 요약·비교하여 제시하였으며, 이는 물관리 분야에서의 국정운영 목표와 연계하여 국가 수문조사사업의 사업성을 고찰하는 연구의 기반이 될 수 있으리라 생각된다.

  • PDF

An Analysis of Newspaper Articles on Fine Particle Matter Using Text Mining Techniques (텍스트마이닝을 이용한 미세먼지 관련 신문기사 분석)

  • Yang, Ji-Yeon
    • Journal of Digital Convergence
    • /
    • v.20 no.1
    • /
    • pp.1-13
    • /
    • 2022
  • This study aims to examine the trend and characteristics of newspaper articles concerned with fine particle matter. Newspaper articles since 1995 collected from Bigkinds were analyzed using text mining techniques, sentiment analysis and regression analysis. Air pollution measurement and domestic pollutants appeared frequently previously, but "China" became the keyword in the 2010s along with political action, the effects on the health, AD/PR, and domestic pollutants. Korea JoongAng Daily, Hankyoreh and Kyunghyang Shinmun have had more focused on political regulations whereas most regional daily newspapers on emission sources and reduction measures at the regional level. The results of this study are expected to be used as a reference for understanding the trend of newspaper articles. Future work includes further analysis and discussion of fine particle pollution condition and news reports in the post-COVID era.

Feature-selection algorithm based on genetic algorithms using unstructured data for attack mail identification (공격 메일 식별을 위한 비정형 데이터를 사용한 유전자 알고리즘 기반의 특징선택 알고리즘)

  • Hong, Sung-Sam;Kim, Dong-Wook;Han, Myung-Mook
    • Journal of Internet Computing and Services
    • /
    • v.20 no.1
    • /
    • pp.1-10
    • /
    • 2019
  • Since big-data text mining extracts many features and data, clustering and classification can result in high computational complexity and low reliability of the analysis results. In particular, a term document matrix obtained through text mining represents term-document features, but produces a sparse matrix. We designed an advanced genetic algorithm (GA) to extract features in text mining for detection model. Term frequency inverse document frequency (TF-IDF) is used to reflect the document-term relationships in feature extraction. Through a repetitive process, a predetermined number of features are selected. And, we used the sparsity score to improve the performance of detection model. If a spam mail data set has the high sparsity, detection model have low performance and is difficult to search the optimization detection model. In addition, we find a low sparsity model that have also high TF-IDF score by using s(F) where the numerator in fitness function. We also verified its performance by applying the proposed algorithm to text classification. As a result, we have found that our algorithm shows higher performance (speed and accuracy) in attack mail classification.

Trend Analysis of Fraudulent Claims by Long Term Care Institutions for the Elderly using Text Mining and BIGKinds (텍스트 마이닝과 빅카인즈를 활용한 노인장기요양기관 부당청구 동향 분석)

  • Youn, Ki-Hyok
    • Journal of Internet of Things and Convergence
    • /
    • v.8 no.2
    • /
    • pp.13-24
    • /
    • 2022
  • In order to explore the context of fraudulent claims and the measures for preventing them targeting the long-term care institutions for the elderly, which is increasing every year in Korea, this study conducted the text mining analysis using the media report articles. The media report articles were collected from the news big data analysis system called 'BIG KINDS' for about 15 years from July 2008 when the Long-Term Care Insurance for the Elderly took effect, to February 28th 2022. During this period of time, total 2,627 articles were collected under keywords like 'elderly care+fraudulent claims' and 'long-term care+fraudulent claims', and among them, total 946 articles were selected after excluding overlapped articles. In the results of the text mining analysis in this study, first, the top 10 keywords mentioned in the highest frequency in every section(July 1st 2008-February 28th 2022) were shown in the order of long-term care institution for the elderly, fraudulent claims, National Health Insurance Service, Long-Term Care Insurance for the Elderly, long-term care benefits(expenses), elderly care facilities, The Ministry of Health & Welfare, the elderly, report, and reward(payment). Second, in the results of the N-gram analysis, they were shown in the order of long-term care benefits(expenses) and fraudulent claims, fraudulent claims and long-care institution for the elderly, falsehood and fraudulent claims, report and reward(payment), and long-term care institution for the elderly and report. Third, the analysis of TF-IDF was similar to the results of the frequency analysis while the rankings of report, reward(payment), and increase moved up. Based on such results of the analysis above, this study presented the future direction for the prevention of fraudulent claims of long-term care institutions for the elderly.

A Study on the Current Situation and Trend Analysis of The Elderly Healthcare Applications Using Big Data Analysis (텍스트마이닝을 활용한 노인 헬스케어 앱 사용 추이 및 동향 분석)

  • Byun, Hyun;Jeon, Sang-Wan;YI, Eun-Surk
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.5
    • /
    • pp.313-325
    • /
    • 2022
  • The purpose of this study is to examine the changes in the elderly healthcare app market through text mining analysis and to present basic data for activating elderly healthcare apps. Data collection was conducted on Naver, Daum, blog web, and cafe. As for the research method, text mining, TF-IDF(Term frequency-inverse document frequency), emotional analysis, and semantic network analysis were conducted using Textom and Ucinet6, which are big data analysis programs. As a result of this study, a total of six categories were finally derived: resolving the healthcare app information gap, convergence healthcare technology, diffusion media, elderly healthcare app industry, social background, and content. In conclusion, in order for elderly healthcare apps to be accepted and utilized by the elderly, they must have a good diffusion infrastructure, and the effectiveness of healthcare apps must be maximized through the active introduction of convergence technology and content development that can be easily used by the elderly.

A Study on Research Topics for Thyroid Cancer in Korea (국내 갑상선암 연구 주제 동향 분석)

  • Yang, Ji-Yeon;Shin, Seung-Hyeok;Heo, Seong-Min;Lee, Tae-Gyeong
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2019.01a
    • /
    • pp.409-410
    • /
    • 2019
  • 본 논문에서는 국내 갑상선암의 연구 동향을 파악하기 위해 텍스트 중심의 접근법을 제안한다. 국내 갑상선암은 2000년대에 들어서며 발생이 급증하여 과잉진단의 논란을 불러일으켰으나, 다양한 분야의 자정 노력으로 수술 환자수가 크게 줄었다. 본 연구에서는 텍스트 마이닝 기술을 사용하여 디비피아에 등록되어 있는 갑상선암 관련 논문의 키워드와 초록을 수집하여 분석하였다. 1980년대는 대부분의 사례보고가 있었고 1990년대에 들어서면서 검진을 통한 조기 진단의 내용이 자주 나타났다. 2000년대에는 여러 장비들을 활용한 검사방법과 미세한 암의 발견에 대한 논의가 증가하였음을 확인 할 수 있었다. 2010년대에 들어서는 환자의 삶의 질에 대한 연구가 많이 이루어졌다. 지난 수십 년 동안 갑상선 암 연구 주제에 대해 뚜렷한 변화가 나타났으며, 향후 연구의 기초자료로 활용될 수 있으리라 기대된다.

  • PDF

A study on the method of deriving the cause of social issues based on causal sentences (인과관계문형 기반 사회이슈 발생원인 도출 방법 연구)

  • Lee, Namyeon;Lee, Jae Hyung
    • Journal of Digital Convergence
    • /
    • v.19 no.3
    • /
    • pp.167-176
    • /
    • 2021
  • With development of big data analysis technology, many studies to find social issues using texts mining techniques have been conducted. In order to derive social issues, previous studies performed in a way that collects a large amount of text data from news or SNS, and then analyzes issues based on text mining techniques such as topic modeling and terms network analysis. Social issues are the results of various social phenomena and factors. However, since previous studies focused on deriving social issues that are results of various causes, there are limitations to revealing the cause of the issues. In order to effectively respond to social issues, it is necessary not only to derive social issues, but also to be able to identify the causes of social issues. In this study, in order to overcome these limitations, we proposed a method of deriving the factors that cause social issues from texts related to social issues based on the theory of part of Korean linguistics. To do this, we collected news data related to social issues for three years from 2017 to 2019 and proposed a methodology to find causes based causal sentences based on text mining techniques.