• Title/Summary/Keyword: news data

Search Result 885, Processing Time 0.027 seconds

A Study on the Issue Lifecycle through the Analysis of News Texts - A Case of Samsung Galaxy Note 7 - (신문기사 분석을 통한 이슈 라이프사이클에 관한 연구 - 삼성 갤럭시노트7 사례 -)

  • Heo, Pil Hee;Kim, Yang Sok;Lee, Choong Kwon
    • Smart Media Journal
    • /
    • v.7 no.4
    • /
    • pp.99-105
    • /
    • 2018
  • It is often the case that products or services on the market are causing problems, which hurt the business and image of the company. Responding appropriately to the problem and minimizing the damage is very important to business organizations. This study collected and analyzed the news articles related to the recall of the Galaxy Note 7, which was developed and launched by Samsung Electronics, one of the smartphone market leaders. Based on the issue lifecycle, the characteristics of the news were expressed by stages and the contents of the news were analyzed and visualized using association rules. The results of this study are expected to help business organizations to understand the changes and trends of issues and search for counter measures.

Utilizing Natural Language Processing to Compare Perceptions of Metaverse between News Articles and Academic Research (자연어 처리를 활용한 메타버스 보도, 연구 간 인식 차이 비교)

  • Lee, Gyuho;Lee, Joonhwan
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.10
    • /
    • pp.1483-1498
    • /
    • 2022
  • While public interests in the metaverse are growing recently in the Korean media and research, its understanding has not been fully established yet. In this study, we aimed to probe whether the rapid growth in media attention about the metaverse has increased its usage as a buzzword accompanied by an absence of scientific context. We analyzed publications and online news containing "metaverse" from 2020 to 2022. The data analysis methods are 1) time series frequency, 2) keyword network, 3) natural language model. The findings indicate the perception gap about metaverse between research and news articles broadened as its popularity has grown. Research about metaverse gradually expanded its connections with related topics-virtual and augmented realities-focusing on social changes in a remote environment. However, media reporting frequently used "metaverse" as a buzzword rather than explaining its scientific background, stimulating the proliferation of related topics and the dispersion of news content. This study further discusses the need for a media strategy to improve public conception of the long-term development of the metaverse.

Stock Price Prediction by Utilizing Category Neutral Terms: Text Mining Approach (카테고리 중립 단어 활용을 통한 주가 예측 방안: 텍스트 마이닝 활용)

  • Lee, Minsik;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.123-138
    • /
    • 2017
  • Since the stock market is driven by the expectation of traders, studies have been conducted to predict stock price movements through analysis of various sources of text data. In order to predict stock price movements, research has been conducted not only on the relationship between text data and fluctuations in stock prices, but also on the trading stocks based on news articles and social media responses. Studies that predict the movements of stock prices have also applied classification algorithms with constructing term-document matrix in the same way as other text mining approaches. Because the document contains a lot of words, it is better to select words that contribute more for building a term-document matrix. Based on the frequency of words, words that show too little frequency or importance are removed. It also selects words according to their contribution by measuring the degree to which a word contributes to correctly classifying a document. The basic idea of constructing a term-document matrix was to collect all the documents to be analyzed and to select and use the words that have an influence on the classification. In this study, we analyze the documents for each individual item and select the words that are irrelevant for all categories as neutral words. We extract the words around the selected neutral word and use it to generate the term-document matrix. The neutral word itself starts with the idea that the stock movement is less related to the existence of the neutral words, and that the surrounding words of the neutral word are more likely to affect the stock price movements. And apply it to the algorithm that classifies the stock price fluctuations with the generated term-document matrix. In this study, we firstly removed stop words and selected neutral words for each stock. And we used a method to exclude words that are included in news articles for other stocks among the selected words. Through the online news portal, we collected four months of news articles on the top 10 market cap stocks. We split the news articles into 3 month news data as training data and apply the remaining one month news articles to the model to predict the stock price movements of the next day. We used SVM, Boosting and Random Forest for building models and predicting the movements of stock prices. The stock market opened for four months (2016/02/01 ~ 2016/05/31) for a total of 80 days, using the initial 60 days as a training set and the remaining 20 days as a test set. The proposed word - based algorithm in this study showed better classification performance than the word selection method based on sparsity. This study predicted stock price volatility by collecting and analyzing news articles of the top 10 stocks in market cap. We used the term - document matrix based classification model to estimate the stock price fluctuations and compared the performance of the existing sparse - based word extraction method and the suggested method of removing words from the term - document matrix. The suggested method differs from the word extraction method in that it uses not only the news articles for the corresponding stock but also other news items to determine the words to extract. In other words, it removed not only the words that appeared in all the increase and decrease but also the words that appeared common in the news for other stocks. When the prediction accuracy was compared, the suggested method showed higher accuracy. The limitation of this study is that the stock price prediction was set up to classify the rise and fall, and the experiment was conducted only for the top ten stocks. The 10 stocks used in the experiment do not represent the entire stock market. In addition, it is difficult to show the investment performance because stock price fluctuation and profit rate may be different. Therefore, it is necessary to study the research using more stocks and the yield prediction through trading simulation.

The Interaction Effects between News Frames and Community Structure on Vote Choice (지역공동체 구조와 뉴스프레임이 투표행위에 미치는 영향)

  • Park, Cheong-Yi
    • Korean journal of communication and information
    • /
    • v.17
    • /
    • pp.37-60
    • /
    • 2001
  • This study attempted to demonstrate the interaction effects between attitudinal frames of nine daily newspapers and community structure in the 1994s Michigan gubernatorial election. It was theoretically guided by framing research and the self-presentation theory of social-cognition perspective and empirically tested with archival data. For the purpose of this study, content analysis of nine statewide daily newspapers was employed in order to provide data on news framing. Data on voting rates for John Engler, winner of the 1994 Michigan Gubernatorial election, in each county of Michigan were used for vote choice while Michigan census data were used for constructing an Index of community structural differentiation. The results indicated that majority compliance frames were slightly more related with vote choice in homogeneous com-unities rather than were majority compliance frames in heterogeneous communities while social identification frames tended to have an influence on vote choice in heterogeneous communities more than did social identification frames in homogeneous communities.

  • PDF

An analysis of the change in media's reports and attitudes about face masks during the COVID-19 pandemic in South Korea: a study using Big Data latent dirichlet allocation (LDA) topic modelling (빅데이터 LDA 토픽 모델링을 활용한 국내 코로나19 대유행 기간 마스크 관련 언론 보도 및 태도 변화 분석)

  • Suh, Ye-Ryoung;Koh, Keumseok Peter;Lee, Jaewoo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.5
    • /
    • pp.731-740
    • /
    • 2021
  • This study applied LDA topic modeling analysis to collect and analyze news media big data related to face masks in the three waves of the COVID-19 pandemic in Korea. The results empirically show that media reports focused on mask production and distribution policies in the first wave and the mandatory mask wearing in the second wave. In contrast, more reports on trivial, gossipy events consist of the media coverage in the second and third waves. The findings imply that Korea's governmental interventions to address the shortage of face masks and to regulate mask wearing were successful relatively in a short time. In contrast, the study also reports that there may be relative less number of science-based news reports like the ones on the effectiveness of face masks or different levels of filter types. This study exemplifies how a big data analysis can be applied to evaluate and enhance public health communication.

Trend Analysis of Pet Plants Before and After COVID-19 Outbreak Using Topic Modeling: Focusing on Big Data of News Articles from 2018 to 2021

  • Park, Yumin;Shin, Yong-Wook
    • Journal of People, Plants, and Environment
    • /
    • v.24 no.6
    • /
    • pp.563-572
    • /
    • 2021
  • Background and objective: The ongoing COVID-19 pandemic restricted daily life, forcing people to spend time indoors. With the growing interest in mental health issues and residential environments, 'pet plants' have been receiving attention during the unprecedented social distancing measures. This study aims to analyze the change in trends of pet plants before and during the COVID-19 pandemic and provide basic data for studies related to pet plants and directions of future development. Methods: A total of 2,016 news articles using the keyword 'pet plants' were collected on Naver News from January 1, 2018 to August 15, 2019 (609 articles) and January 1, 2020 to August 15, 2021 (1,407 articles). The texts were tokenized into words using KoNLPy package, ultimately coming up with 63,597 words. The analyses included frequency of keywords and topic modeling based on Latent Dirichlet Allocation (LDA) to identify the inherent meanings of related words and each topic. Results: Topic modeling generated three topics in each period (before and during the COVID-19), and the results showed that pet plants in daily life have become the object of 'emotional support' and 'healing' during social distancing. In particular, pet plants, which had been distributed as a solution to prevent solitary deaths and depression among seniors living alone, are now expanded to help resolve the social isolation of the general public suffering from COVID-19. The new term 'plant butler' became a new trend, and there was a change in the trend in which people shared their hobbies and information about pet plants and communicated with others in online. Conclusion: Based on these findings, the trend data of pet plants before and after the outbreak of COVID-19 can provide the basis for activating research on pet plants and setting the direction for development of related industries considering the continuous popularity and trend of indoor gardening and green hobby.

A study on Deep Learning-based Stock Price Prediction using News Sentiment Analysis

  • Kang, Doo-Won;Yoo, So-Yeop;Lee, Ha-Young;Jeong, Ok-Ran
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.8
    • /
    • pp.31-39
    • /
    • 2022
  • Stock prices are influenced by a number of external factors, such as laws and trends, as well as number-based internal factors such as trading volume and closing prices. Since many factors affect stock prices, it is very difficult to accurately predict stock prices using only fragmentary stock data. In particular, since the value of a company is greatly affected by the perception of people who actually trade stocks, emotional information about a specific company is considered an important factor. In this paper, we propose a deep learning-based stock price prediction model using sentiment analysis with news data considering temporal characteristics. Stock and news data, two heterogeneous data with different characteristics, are integrated according to time scale and used as input to the model, and the effect of time scale and sentiment index on stock price prediction is finally compared and analyzed. Also, we verify that the accuracy of the proposed model is improved through comparative experiments with existing models.

News big-data Analysis on 'Education for Sustainable Development': Focusing on 2000 ~ 2021 ('지속가능발전교육' 관련 언론사 뉴스 빅데이터 분석: 2000 ~ 2021년을 중심으로)

  • Kim, Sung-ae
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.629-632
    • /
    • 2022
  • Education for sustainable development is an education that helps learners of all ages acquire the knowledge, skills, and attitudes necessary to solve interconnected international challenges such as climate change and environmental problems.It is an integral component of the Sustainable Development Goals (SDGs) #4 and contributes to the 17 SDGs. In order to find out the trend of ESD, 2718 news data from January 1, 2000 to December 31, 2021 were collected through 26 media outlets.As key keywords, international organizations leading sustainable development education such as the UN and UNESCO, local governments including Dobong-gu, and major issues such as climate change and ecological change could be identified. This can be used as basic data for various studies as it can explore trends for ESD.

  • PDF

Crisis Prediction of Regional Industry Ecosystem based on Text Sentiment Analysis Using News Data - Focused on the Automobile Industry in Gwangju - (뉴스 데이터를 활용한 텍스트 감성분석에 따른 지역 산업생태계 위기 예측 - 광주 지역 자동차 산업을 중심으로 -)

  • Kim, Hyun-Ji;Kim, Sung-Jin;Kim, Han-Gook
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.8
    • /
    • pp.1-9
    • /
    • 2020
  • As the aging problem of the regional industry ecosystem has gradually become serious, research to measure and regenerate the regional industry ecosystem decline has been actively conducted. However, little research has been done on regional industry ecosystem crises. Crisis emerges radically over a short period of time, and it is often impossible to respond by post-response, so you must respond before the crisis occurs. In other words, it is more necessary and required when looking at the crisis early and taking a proactive response from a long-term perspective. Therefore, it is necessary to develop a predictive model that can proactively recognize and respond to the crisis in the regional industry ecosystem. Therefore, this study checked the possibility of predicting the risk of regional industry and market according to the emotional score of the news by using large-scale news data. News sentiment analysis was performed using the Google sentiment analysis API, and this was organized by month to check the correlation between actual events.

An Analysis of the Fake News Assessment Criteria on Fact-check Coverage (팩트체크 보도의 가짜뉴스 판단 기준 검토)

  • Baek, Kanghui
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.2
    • /
    • pp.172-181
    • /
    • 2020
  • This study examines the fact-check coverage provided by the SNU fact-check center site(factcheck.snu.ac.kr). A total of 50 articles that were cross-checked by multiple news media organizations were analyzed. The study's variables were topics, types, characteristics, consistency of the news media organizations' judgement, and fact-check sources. This study found that fact-checking coverage was generally focused on presidential or general election candidates or politicians, as well as political topics. The types of fact-checking coverage primarily included factual information, as well as some opinions or interpretations. Fact-check coverage was mainly focused on the facts of the statements themselves, causal relationships, or the timing or target of the comparison criteria. On average, the fact-checking coverage most frequently assigned the judgment 'mostly false, and primarily used interviews of individuals or data from organizations involved in the issue, government data, and experts' statements as the bases for its fact-checking judgements.