• Title/Summary/Keyword: news big data

Search Result 290, Processing Time 0.025 seconds

Big Data News Analysis in Healthcare Using Topic Modeling and Time Series Regression Analysis (토픽모델링과 시계열 회귀분석을 활용한 헬스케어 분야의 뉴스 빅데이터 분석 연구)

  • Eun-Jung Kim;Suk-Gwon Chang;Sang-Yong Tom Lee
    • Information Systems Review
    • /
    • v.25 no.3
    • /
    • pp.163-177
    • /
    • 2023
  • This research aims to identify key initiatives and a policy approach to support the industrialization of the sector. The research collected a total of 91,873 news data points relating to healthcare between 2013 to 2022. A total of 20 topics were derived through topic modeling analysis, and as a result of time series regression analysis, 4 hot topics (Healthcare, Biopharmaceuticals, Corporate outlook·Sales, Government·Policy), 3 cold topics (Smart devices, Stocks·Investment, Urban development·Construction) derived a significant topic. The research findings will serve as an important data source for government institutions that are engaged in the formulation and implementation of Korea's policies.

Sentiment Analysis of Elderly and Job in the Demographic Cliff (인구절벽사회에서 노인과 일자리 감성분석)

  • Kim, Yang-Woo
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.11
    • /
    • pp.110-118
    • /
    • 2020
  • Social media data serves as a proxy indicator to understand the problems and the future of public opinion in Korean society. This research used 109,015 news data from 2016 to 2018 to analyze the sensitivity of the elderly and employment in Korean society, and explored the possibility of expanding the labor force in Korean society, which is facing a cliff between the elderly and the population. Topic keywords for employment of the elderly include "elderly*employment", "elderly*employment", and "elderly*wage". As a result of the analysis, positive sensitivity prevails for most of the period, and it is possible to expand the working-age population. Positive feelings about expanding employment opportunities for the elderly and negative feelings about low wages have brought to light the reality of the elderly who are still poor despite their work. In this study, social big data was used to analyze the perceptions and sensibilities of Korean society related to the elderly and employment through hierarchical crowd analysis and related text mining analysis.

Machine Learning based Firm Value Prediction Model: using Online Firm Reviews (머신러닝 기반의 기업가치 예측 모형: 온라인 기업리뷰를 활용하여)

  • Lee, Hanjun;Shin, Dongwon;Kim, Hee-Eun
    • Journal of Internet Computing and Services
    • /
    • v.22 no.5
    • /
    • pp.79-86
    • /
    • 2021
  • As the usefulness of big data analysis has been drawing attention, many studies in the business research area begin to use big data to predict firm performance. Previous studies mainly rely on data outside of the firm through news articles and social media platforms. The voices within the firm in the form of employee satisfaction or evaluation of the strength and weakness of the firm can potentially affect firm value. However, there is insufficient evidence that online employee reviews are valid to predict firm value because the data is relatively difficult to obtain. To fill this gap, from 2014 to 2019, we employed 97,216 reviews collected by JobPlanet, an online firm review website in Korea, and developed a machine learning-based predictive model. Among the proposed models, the LSTM-based model showed the highest accuracy at 73.2%, and the MAE showed the lowest error at 0.359. We expect that this study can be a useful case in the field of firm value prediction on domestic companies.

Big Data Analysis on Daegu-Gyeongbuk Administrative Integration (대구·경북 행정통합에 대한 빅데이터 분석)

  • Song, Hwa Young;Park, Han Woo
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.5
    • /
    • pp.139-148
    • /
    • 2021
  • The study examines public attitude and reaction regarding administrative integration in Daegu and Gyeongbuk area. Specifically, it employs social big data including textual comments on online news articles and YouTube video clips. The collected data are analyzed in order to compare two periods, that is, before and after the inauguration of the Public Opinion Committee for One Daegu-Gyeongbuk. As a result, we have found that people's favorable response to administrative integration has gradually increased since the launch of the Committee. However, it still lacks specific administrative procedures and discussion topics among the frequently used words in the collected data. Thus, the Committee needs to provide a variety of information and materials related to administrative integration.

Analyzing the Relevancy of Policy by Abnormal Pattern Analysis : Focused on the Case of S-City's e-Card for Child Meal Support (이상 패턴 분석을 통한 정책의 적합성 분석 연구 : S 시의 아동 급식 전자 카드 사례를 중심으로)

  • Jeon, Jongshik;Kwon, Ohbyung
    • Journal of Information Technology Services
    • /
    • v.17 no.1
    • /
    • pp.135-153
    • /
    • 2018
  • E-Card Service for Child Nutrition Program is one of the main public policy services nowadays. In case of inconvenience during the use of the e-cards, it is recommended to cooperate with related organizations in order to promptly handle and provide guidance, and thoroughly manage child feeding service such as hygiene, nutrition and kindness etc. To do so, it is very important to provide food service that meets local actual conditions and children's needs in a cost effective manner for the underage who are worried about the poorly-fed by understanding the pattern of child feeding e-card service. Hence. this paper aims to investigate how child feeding e-card service efficiently provides meals according to the local situation and children's needs through big data analysis and to propose a method of identifying welfare conditions according to the purpose of service with actual application examples. The results suggest that, first of all, this study is able to judge appropriateness of public institution's policy in a timely and repetitive manner through non-standard data analysis such as Naver News and transaction data. Secondly, this paper proposes a multi-layered analysis framework, which performs online open data analysis to detect policy issues, visualizes retrieval and preprocessing of real data, and performs abnormal pattern recognition. These will be worthy of reference to other similar projects.

Multivariate Analysis of Factors for Search on Suicide Using Social Big Data (소셜 빅 데이터를 활용한 자살검색 요인 다변량 분석)

  • Song, Tae Min;Song, Juyoung;An, Ji-Young;Jin, Dallae
    • Korean Journal of Health Education and Promotion
    • /
    • v.30 no.3
    • /
    • pp.59-73
    • /
    • 2013
  • Objectives: The study is aimed at examining the individual reasons and regional/environmental factors of online search on suicide using social big data to predict practical behaviors related to suicide and to develop an online suicide prevention system on the governmental level. Methods: The study was conducted using suicide-related social big data collected from online news sites, blogs, caf$\acute{e}$s, social network services and message boards between January 1 and December 31, 2011 (321,506 buzzes from users assumed as adults and 67,742 buzzes from those assumed as teenagers). Technical analysis and development of the suicide search prediction model were done using SPSS 20.0, and the structural model, nd multi-group analysis was made using AMOS 20.0. Also, HLM 7.0 was applied for the multilevel model analysis of the determinants of search on suicide by teenagers. Results: A summary of the results of multivariate analysis is as follows. First, search on suicide by adults appeared to increase on days when there were higher number of suicide incidents, higher number of search on drinking, higher divorce rate, lower birth rate and higher average humidity. Second, search on suicide by teenagers rose on days when there were higher number of teenage suicide incidents, higher number of search on stress or drinking and less fine dust particles. Third, the comparison of the results of the structural equation model analysis of search on suicide by adults and teenagers showed that teenagers were more likely to proceed from search on stress to search on sports, drinking and suicide, while adults significantly tended to move from search on drinking to search on suicide. Fourth, the result of the multilevel model analysis of determinants of search on suicide by teenagers showed that monthly teenagers suicide rate and average humidity had positive effect on the amount of search on suicide. Conclusions: The study shows that both adults and teenagers are influenced by various reasons to experience stress and search on suicide on the Internet. Therefore, we need to develop diverse school-level programs that can help relieve teenagers of stress and workplace-level programs to get rid of the work-related stress of adults.

Social Factors Affecting Internet Searches on Cyber Bullying in Korea and America Using Social Big Data and Google Search Trends (소셜 빅데이터와 Google 검색트렌드를 활용한 한국과 미국의 사이버불링 검색에 영향을 미치는 요인 분석)

  • Song, Tae-Min;Song, Juyoung;Cheon, Mi-Kyung
    • The Journal of Bigdata
    • /
    • v.1 no.1
    • /
    • pp.67-75
    • /
    • 2016
  • The study analyzed big data extracted from Google and social media to identify factors related to searches on cyber bullying in Korea and America. Korea's cyber bullying analysis was conducted social big data collected from online news sites, blogs, $caf{\acute{e}}s$, social network services and message for between January 1, 2011 and March 31, 2013. Google search trends for the search words of stress, exercise, drinking, and cyber bullying were obtained for January 1, 2004 and December 22, 2013. The main results of this study were as follows: first, the significant factors stress were cyber bullying that Korea more than America. Secondly, a positive relationship was found between stress and drinking, exercise and cyber bullying both Korea and America. Thirdly, significant differences were found all path both Korea and America. The study shows that both adults and teenagers are influenced in Korea. We need to develop online application that if cyber bullying behavior was predicted can intervene in real time because these actual cyber bullying-related exposure to psychological and behavioral characteristic.

  • PDF

A study on the method of deriving the cause of social issues based on causal sentences (인과관계문형 기반 사회이슈 발생원인 도출 방법 연구)

  • Lee, Namyeon;Lee, Jae Hyung
    • Journal of Digital Convergence
    • /
    • v.19 no.3
    • /
    • pp.167-176
    • /
    • 2021
  • With development of big data analysis technology, many studies to find social issues using texts mining techniques have been conducted. In order to derive social issues, previous studies performed in a way that collects a large amount of text data from news or SNS, and then analyzes issues based on text mining techniques such as topic modeling and terms network analysis. Social issues are the results of various social phenomena and factors. However, since previous studies focused on deriving social issues that are results of various causes, there are limitations to revealing the cause of the issues. In order to effectively respond to social issues, it is necessary not only to derive social issues, but also to be able to identify the causes of social issues. In this study, in order to overcome these limitations, we proposed a method of deriving the factors that cause social issues from texts related to social issues based on the theory of part of Korean linguistics. To do this, we collected news data related to social issues for three years from 2017 to 2019 and proposed a methodology to find causes based causal sentences based on text mining techniques.

The Perception Analysis of Autonomous Vehicles using Network Graph (네트워크 그래프를 활용한 자율주행차에 대한 인식 분석)

  • Hyo-gyeong Park;Yeon-hwi You;Sung-jung Yong;Seo-young Lee;Il-young Moon
    • Journal of Practical Engineering Education
    • /
    • v.15 no.1
    • /
    • pp.97-105
    • /
    • 2023
  • Recently, with the development of artificial intelligence technology, many technologies for user convenience are being developed. Among them, interest in autonomous vehicles is increasing day by day. Currently, many automobile companies are aiming to commercialize autonomous vehicles. In order to lay the foundation for the government's new and reasonable policy establishment to support commercialization, we tried to analyze changes and perceptions of public opinion through news article data. Therefore, in this paper, 35,891 news article data mentioning terms similar to 'autonomous vehicles' over the past three years were collected and network analyzed. As a result of the analysis, major keywords such as 'autonomous driving', 'AI', 'future', 'Hyundai Motor', 'autonomous driving vehicle', 'automobile', 'industrial', and 'electric vehicle' were derived. In addition, the autonomous vehicle industry is developing into a faster and more diverse platform and service industry by converging with various industries such as semiconductor companies and big tech companies as well as automobile companies and is paying attention to the convergence of industries. To continuously confirm changes and perceptions in public opinion, it is necessary to analyze perceptions through continuous analysis of SNS data or technology trends.

A Study on the Relationship between Class Similarity and the Performance of Hierarchical Classification Method in a Text Document Classification Problem (텍스트 문서 분류에서 범주간 유사도와 계층적 분류 방법의 성과 관계 연구)

  • Jang, Soojung;Min, Daiki
    • The Journal of Society for e-Business Studies
    • /
    • v.25 no.3
    • /
    • pp.77-93
    • /
    • 2020
  • The literature has reported that hierarchical classification methods generally outperform the flat classification methods for a multi-class document classification problem. Unlike the literature that has constructed a class hierarchy, this paper evaluates the performance of hierarchical and flat classification methods under a situation where the class hierarchy is predefined. We conducted numerical evaluations for two data sets; research papers on climate change adaptation technologies in water sector and 20NewsGroup open data set. The evaluation results show that the hierarchical classification method outperforms the flat classification methods under a certain condition, which differs from the literature. The performance of hierarchical classification method over flat classification method depends on class similarities at levels in the class structure. More importantly, the hierarchical classification method works better when the upper level similarity is less that the lower level similarity.