• Title/Summary/Keyword: 뉴스 데이터 분석

Search Result 389, Processing Time 0.032 seconds

A Study on the Perception and Experience of Daejeon Public Library Users Using Text Mining: Focusing on SNS and Online News Articles (텍스트마이닝을 활용한 대전시 공공도서관 이용자의 인식과 경험 연구 - SNS와 온라인 뉴스 기사를 중심으로 -)

  • Jiwon Choi;Seung-Jin Kwak
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.58 no.2
    • /
    • pp.363-384
    • /
    • 2024
  • This study was conducted to examine the user's experiences with the public library in Daejeon using big data analysis, focusing on the text mining technique. To know this, first, the overall evaluation and perception of users about the public library in Daejeon were explored by collecting data on social media. Second, through analysis using online news articles, the pending issues that are being discussed socially were identified. As a result of the analysis, the proportion of users with children was first high. Next, it was found that topics through LDA analysis appeared in four categories: 'cultural event/program', 'data use', 'physical environment and facilities', and 'library service'. Finally, it was confirmed that keywords for the additional construction of libraries and complex cultural spaces and the establishment of a library cooperation system appeared at the core in the news article data. Based on this, it was proposed to build a library in consideration of regional balance and to create a social parenting community network through business agreements with childcare and childcare institutions. This will contribute to identifying the policy and social trends of public libraries in Daejeon and implementing data-based public library operations that reflect local community demands.

A Study on the Effect of the Document Summarization Technique on the Fake News Detection Model (문서 요약 기법이 가짜 뉴스 탐지 모형에 미치는 영향에 관한 연구)

  • Shim, Jae-Seung;Won, Ha-Ram;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.201-220
    • /
    • 2019
  • Fake news has emerged as a significant issue over the last few years, igniting discussions and research on how to solve this problem. In particular, studies on automated fact-checking and fake news detection using artificial intelligence and text analysis techniques have drawn attention. Fake news detection research entails a form of document classification; thus, document classification techniques have been widely used in this type of research. However, document summarization techniques have been inconspicuous in this field. At the same time, automatic news summarization services have become popular, and a recent study found that the use of news summarized through abstractive summarization has strengthened the predictive performance of fake news detection models. Therefore, the need to study the integration of document summarization technology in the domestic news data environment has become evident. In order to examine the effect of extractive summarization on the fake news detection model, we first summarized news articles through extractive summarization. Second, we created a summarized news-based detection model. Finally, we compared our model with the full-text-based detection model. The study found that BPN(Back Propagation Neural Network) and SVM(Support Vector Machine) did not exhibit a large difference in performance; however, for DT(Decision Tree), the full-text-based model demonstrated a somewhat better performance. In the case of LR(Logistic Regression), our model exhibited the superior performance. Nonetheless, the results did not show a statistically significant difference between our model and the full-text-based model. Therefore, when the summary is applied, at least the core information of the fake news is preserved, and the LR-based model can confirm the possibility of performance improvement. This study features an experimental application of extractive summarization in fake news detection research by employing various machine-learning algorithms. The study's limitations are, essentially, the relatively small amount of data and the lack of comparison between various summarization technologies. Therefore, an in-depth analysis that applies various analytical techniques to a larger data volume would be helpful in the future.

Pizza Sales Prediction by Using Big Data Analysis. (빅데이터 분석을 통한 피자 판매량 예측)

  • Lee, Daebum;Kim, Kyoungsup;Lee, Youngsoo;Kim, Hanahan;Byun, Dongsam;Park, Sungchul;Jeon, Hwaseong;Kim, Juntae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.890-893
    • /
    • 2014
  • IT산업의 새로운 패러다임으로 빅데이터 분석이 주요한 기술로 부각되고 있다. 본 논문에서는 빅데이터를 수집, 분석하여 이를 통해 피자 판매량을 예측하는 모델을 제안한다. 판매량 예측을 위하여 과거 판매 데이터와 함께 공휴일, 날씨, 뉴스기사, 경제지표, 트렌드, 스포츠 이벤트 등의 데이터를 수집하여 이용하였으며, 판매량 예측 방법으로는 회기분석과 인공신경망 학습 등을 사용하여 빅데이터를 사용하지 않은 경우와 정확도를 비교하였다. 실험 결과 빅데이터를 이용함으로써 예측 오차율이 5%이상 향상됨을 확인하였다.

VTS에 관한 언론 빅데이터 분석 및 대응방안 모색

  • Kim, Seong-Uk;Kim, Hyo-Jin;Jeong, Jae-Yong;Jeong, Cho-Yeong
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2018.05a
    • /
    • pp.265-267
    • /
    • 2018
  • 최근 해양사고와 관련하여 VTS가 언론에 노출되는 횟수가 많아지고 있다. VTS는 해양사고 예방업무를 주로 하고 있기 때문에, 만약 관제구역 내 사고가 발생하게 되면 부정적 언론보도의 표적이 될 수 있으므로, 이에 대한 언론 대응이 중요하다. VTS 업무에 대하여 시민들이 올바르게 이해하고, 적절한 언론 대응을 위해서는 현재 VTS 관련 언론에 대한 현황 분석이 중요하다. 이에 본 연구에서는 뉴스 빅데이터 분석시스템인 빅카인즈를 활용하여 VTS 관련 언론에 대해 양적 분석을 실시하였다.

  • PDF

A Study on AI Evolution Trend based on Topic Frame Modeling (인공지능발달 토픽 프레임 연구 -계열화(seriation)와 통합화(skeumorph)의 사회구성주의 중심으로-)

  • Kweon, Sang-Hee;Cha, Hyeon-Ju
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.7
    • /
    • pp.66-85
    • /
    • 2020
  • The purpose of this study is to explain and predict trends the AI development process based on AI technology patents (total) and AI reporting frames in major newspapers. To that end, a summary of South Korean and U.S. technology patents filed over the past nine years and the AI (Artificial Intelligence) news text of major domestic newspapers were analyzed. In this study, Topic Modeling and Time Series Return Analysis using Big Data were used, and additional network agenda correlation and regression analysis techniques were used. First, the results of this study were confirmed in the order of artificial intelligence and algorithm 5G (hot AI technology) in the AI technical patent summary, and in the news report, AI industrial application and data analysis market application were confirmed in the order, indicating the trend of reporting on AI's social culture. Second, as a result of the time series regression analysis, the social and cultural use of AI and the start of industrial application were derived from the rising trend topics. The downward trend was centered on system and hardware technology. Third, QAP analysis using correlation and regression relationship showed a high correlation between AI technology patents and news reporting frames. Through this, AI technology patents and news reporting frames have tended to be socially constructed by the determinants of media discourse in AI development.

Keyword Analysis of COVID-19 in News Big Data : Focused on 4 Major Daily Newspapers

  • Kwon, Seong-Wook
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.12
    • /
    • pp.101-107
    • /
    • 2020
  • This paper aims to compare and analyze the major keywords according to the political orientation of progressive and conservative newspapers by utilizing the big data of the four major domestic daily newspapers related to COVID-19, which has entered a long-term war. To this end, 93,917 news reports from Jan. 20 to Sept. 15, 2020 were divided into four stages and the major keywords of the four newspapers were implemented and analyzed in WordCloud. According to the analysis, the conservative newspaper focused on the government's response, criticism, and China's responsibility by mentioning the keywords "government," "president," "state of affairs" and "mask" more than the progressive newspaper, while the progressive newspaper uses keywords that emphasize the seriousness of the disease and the occurrence of a dangerous situation. The Chosun Ilbo found that the use of various keywords during the massive outbreak of collective infections (2.18-5.15), and that the JoongAng Ilbo used keywords criticizing government policies in relation to reports of infectious diseases such as COVID-19, but also used keywords that emphasize the seriousness of diseases used by progressive newspapers and the occurrence of dangerous situations.

Stock-Index Invest Model Using News Big Data Opinion Mining (뉴스와 주가 : 빅데이터 감성분석을 통한 지능형 투자의사결정모형)

  • Kim, Yoo-Sin;Kim, Nam-Gyu;Jeong, Seung-Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.143-156
    • /
    • 2012
  • People easily believe that news and stock index are closely related. They think that securing news before anyone else can help them forecast the stock prices and enjoy great profit, or perhaps capture the investment opportunity. However, it is no easy feat to determine to what extent the two are related, come up with the investment decision based on news, or find out such investment information is valid. If the significance of news and its impact on the stock market are analyzed, it will be possible to extract the information that can assist the investment decisions. The reality however is that the world is inundated with a massive wave of news in real time. And news is not patterned text. This study suggests the stock-index invest model based on "News Big Data" opinion mining that systematically collects, categorizes and analyzes the news and creates investment information. To verify the validity of the model, the relationship between the result of news opinion mining and stock-index was empirically analyzed by using statistics. Steps in the mining that converts news into information for investment decision making, are as follows. First, it is indexing information of news after getting a supply of news from news provider that collects news on real-time basis. Not only contents of news but also various information such as media, time, and news type and so on are collected and classified, and then are reworked as variable from which investment decision making can be inferred. Next step is to derive word that can judge polarity by separating text of news contents into morpheme, and to tag positive/negative polarity of each word by comparing this with sentimental dictionary. Third, positive/negative polarity of news is judged by using indexed classification information and scoring rule, and then final investment decision making information is derived according to daily scoring criteria. For this study, KOSPI index and its fluctuation range has been collected for 63 days that stock market was open during 3 months from July 2011 to September in Korea Exchange, and news data was collected by parsing 766 articles of economic news media M company on web page among article carried on stock information>news>main news of portal site Naver.com. In change of the price index of stocks during 3 months, it rose on 33 days and fell on 30 days, and news contents included 197 news articles before opening of stock market, 385 news articles during the session, 184 news articles after closing of market. Results of mining of collected news contents and of comparison with stock price showed that positive/negative opinion of news contents had significant relation with stock price, and change of the price index of stocks could be better explained in case of applying news opinion by deriving in positive/negative ratio instead of judging between simplified positive and negative opinion. And in order to check whether news had an effect on fluctuation of stock price, or at least went ahead of fluctuation of stock price, in the results that change of stock price was compared only with news happening before opening of stock market, it was verified to be statistically significant as well. In addition, because news contained various type and information such as social, economic, and overseas news, and corporate earnings, the present condition of type of industry, market outlook, the present condition of market and so on, it was expected that influence on stock market or significance of the relation would be different according to the type of news, and therefore each type of news was compared with fluctuation of stock price, and the results showed that market condition, outlook, and overseas news was the most useful to explain fluctuation of news. On the contrary, news about individual company was not statistically significant, but opinion mining value showed tendency opposite to stock price, and the reason can be thought to be the appearance of promotional and planned news for preventing stock price from falling. Finally, multiple regression analysis and logistic regression analysis was carried out in order to derive function of investment decision making on the basis of relation between positive/negative opinion of news and stock price, and the results showed that regression equation using variable of market conditions, outlook, and overseas news before opening of stock market was statistically significant, and classification accuracy of logistic regression accuracy results was shown to be 70.0% in rise of stock price, 78.8% in fall of stock price, and 74.6% on average. This study first analyzed relation between news and stock price through analyzing and quantifying sensitivity of atypical news contents by using opinion mining among big data analysis techniques, and furthermore, proposed and verified smart investment decision making model that could systematically carry out opinion mining and derive and support investment information. This shows that news can be used as variable to predict the price index of stocks for investment, and it is expected the model can be used as real investment support system if it is implemented as system and verified in the future.

A Study of the Relationships among Types of Web Portal Usage, Credibility of News on Web Portals, and Credibility of News Media Outlets (포털 사이트 뉴스 이용행태, 포털 사이트 뉴스 신뢰도, 언론사 뉴스 신뢰도 간 관계에 대한 연구)

  • Baek, Kanghui
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.10
    • /
    • pp.306-314
    • /
    • 2019
  • Based on the secondary data of the 2018 news media audience conducted by the Korean Press Foundation, this study examines how the credibility of news on web portals and from news media outlets affects the types of web portal usage. The web portal usage was categorized into the two types: paying attention to news edited by the web portals and paying attention to news media outlets. These were named "portal site centered" and "news media outlet centered," respectively. This study found that the credibility of news from the web portals showed positive relationships with both portal site centered and news media outlet centered. The credibility of news from news media outlets had statistically significant relationships with the two types of web portal usage, but it differed depending on the audiences' preferences toward news media outlets. In other words, the credibility of news from news media outlets that are usually preferred or frequently used showed a positive relationship with the portal site centered, but has a negative relationship with the news media outlet centered. On the other hand, the credibility of news from news media outlets that are not preferred or not frequently used had a negative relationship with the portal site centered, but had a positive relationship with the news media outlet centered.

A study on Deep Learning-based Stock Price Prediction using News Sentiment Analysis

  • Kang, Doo-Won;Yoo, So-Yeop;Lee, Ha-Young;Jeong, Ok-Ran
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.8
    • /
    • pp.31-39
    • /
    • 2022
  • Stock prices are influenced by a number of external factors, such as laws and trends, as well as number-based internal factors such as trading volume and closing prices. Since many factors affect stock prices, it is very difficult to accurately predict stock prices using only fragmentary stock data. In particular, since the value of a company is greatly affected by the perception of people who actually trade stocks, emotional information about a specific company is considered an important factor. In this paper, we propose a deep learning-based stock price prediction model using sentiment analysis with news data considering temporal characteristics. Stock and news data, two heterogeneous data with different characteristics, are integrated according to time scale and used as input to the model, and the effect of time scale and sentiment index on stock price prediction is finally compared and analyzed. Also, we verify that the accuracy of the proposed model is improved through comparative experiments with existing models.

An analysis of public perception on Artificial Intelligence(AI) education using Big Data: Based on News articles and Twitter (빅데이터 분석을 통해 본 AI교육에 대한 사회적 인식: 뉴스기사와 트위터를 중심으로)

  • Lee, Sang-Soog;Yoo, Inhyeok;Kim, Jinhee
    • Journal of Digital Convergence
    • /
    • v.18 no.6
    • /
    • pp.9-16
    • /
    • 2020
  • The purpose of this study is to understand the public needs for AI education actively promoted and supported by the current government. In doing so, 11 metropolitan news articles and Twitter posts regarding AI education that have been posted from January 1, 2018 to December 31, 2019 were collected. Then, word frequency analysis using TF(Term Frequency) method and LDA(Latent Dirichlet Allocation) method of topic modeling analysis were conducted. The topics of the news articles turn out to be a macroscopic policy support such as 'training female manpower in the AI field' and 'curriculum reform of university and K-12', whereas the topics of twitter delineate more detailed social perception on future society, such as future competencies and pedagogical methods, including 'coexistence with intelligent robots', 'coding education', and 'humane education competence development'. The findings are expected to be used to suggest the implications for the composition and management of AI curriculum as well as the basic framework of human resources development in the future industry.