• Title/Summary/Keyword: news data

Search Result 885, Processing Time 0.026 seconds

COVID-19-related Korean Fake News Detection Using Occurrence Frequencies of Parts of Speech (품사별 출현 빈도를 활용한 코로나19 관련 한국어 가짜뉴스 탐지)

  • Jihyeok Kim;Hyunchul Ahn
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.267-283
    • /
    • 2023
  • The COVID-19 pandemic, which began in December 2019 and continues to this day, has left the public needing information to help them cope with the pandemic. However, COVID-19-related fake news on social media seriously threatens the public's health. In particular, if fake news related to COVID-19 is massively spread with similar content, the time required for verification to determine whether it is genuine or fake will be prolonged, posing a severe threat to our society. In response, academics have been actively researching intelligent models that can quickly detect COVID-19-related fake news. Still, the data used in most of the existing studies are in English, and studies on Korean fake news detection are scarce. In this study, we collect data on COVID-19-related fake news written in Korean that is spread on social media and propose an intelligent fake news detection model using it. The proposed model utilizes the frequency information of parts of speech, one of the linguistic characteristics, to improve the prediction performance of the fake news detection model based on Doc2Vec, a document embedding technique mainly used in prior studies. The empirical analysis shows that the proposed model can more accurately identify Korean COVID-19-related fake news by increasing the recall and F1 score compared to the comparison model.

Customized Information Analysis System Using National Defense News Data (국방 기사 데이터를 이용한 맞춤형 정보 분석 시스템)

  • Choi, Jung-Whoan;Lim, Chea-O
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.12
    • /
    • pp.457-465
    • /
    • 2010
  • Customized information analysis system is a software system that can help to extract useful information from non-structured natural language data, process the information to customized form, and provide future forecast and reasoning information. To implement the information analysis system, we need natural language processing technology to analyze natural language, information extraction technology to detect necessary entity and its relationship from text, and data mining technology to discover new and unknown information from extracting data. This paper suggest virtual customized information analysis system processing national defense news data and introduce base technologies for information analysis.

Strategies for the Development of Watermelon Industry Using Unstructured Big Data Analysis

  • LEE, Seung-In;SON, Chansoo;SHIM, Joonyong;LEE, Hyerim;LEE, Hye-Jin;CHO, Yongbeen
    • The Journal of Industrial Distribution & Business
    • /
    • v.12 no.1
    • /
    • pp.47-62
    • /
    • 2021
  • Purpose: Our purpose in this study was to examine the strategies for the development of watermelon industry using unstructured big data analysis. That is, this study was to look the change of issues and consumer's perception about watermelon using big data and social network analysis and to investigate ways to strengthen the competitiveness of watermelon industry based on that. Methodology: For this purpose, the data was collected from Naver (blog, news) and Daum (blog, news) by TEXTOM 4.5 and the analysis period was set from 2015 to 2016 and from 2017-2018 and from 2019-2020 in order to understand change of issues and consumer's perception about watermelon or watermelon industry. For the data analysis, TEXTOM 4.5 was used to conduct key word frequency analysis, word cloud analysis and extraction of metrics data. UCINET 6.0 and NetDraw function of UCINET 6.0 were utilized to find the connection structure of words and to visualize the network relations, and to make a cluster of words. Results: The keywords related to the watermelon extracted such as 'the stalk end of a watermelon', 'E-mart', 'Haman', 'Gochang', and 'Lotte Mart' (news: 015-2016), 'apple watermelon', 'Haman', 'E-mart', 'Gochang', and' Mudeungsan watermelon' (news: 2017-2018), 'E-mart', 'apple watermelon', 'household', 'chobok', and 'donation' (news: 2019-2020), 'watermelon salad', 'taste', 'the heat', 'baby', and 'effect' (blog: 2015-2016), 'taste', 'watermelon juice', 'method', 'watermelon salad', and 'baby' (blog: 2017-2018), 'taste', 'effect', 'watermelon juice', 'method', and 'apple watermelon' (blog: 2019-2020) and the results from frequency and TF-IDF analysis presented. And in CONCOR analysis, appeared as four types, respectively. Conclusions: Based on the results, the authors discussed the strategies and policies for boosting the watermelon industry and limitations of this study and future research directions. The results of this study will help prioritize strategies and policies for boosting the consumption of the watermelon and contribute to improving the competitiveness of watermelon industry in Korea. Also, it is expected that this study will be used as a very important basis for agricultural big data studies to be conducted in the future and this study will offer watermelon producers and policy-makers practical points helpful in crafting tailor-made marketing strategies.

An Empirical Study on the Impact of Blogs and Online News on the Success of Film : Focusing on Before and After Film Release (블로그와 온라인 뉴스가 영화흥행에 미치는 영향에 대한 실증연구 : 영화 개봉 전·후의 구전효과를 중심으로)

  • Lim, Hyunjeong;Yang, Hee-Dong;Baek, Hyunmi
    • Journal of Information Technology Applications and Management
    • /
    • v.21 no.4
    • /
    • pp.157-171
    • /
    • 2014
  • As electronic word of mouth plays an important role in purchase behavior among consumers, the number of studies on the impact of electronic word of mouth is rapidly increasing. Nevertheless, it is difficult to discover comparative studies on the mass media which had a great impact on consumer's purchase behavior before the impact of electronic word of mouth becomes greater versus the social media where electronic words of mouth are created and distributed. It is considered that it seems to be necessary to find an appropriate mutual supplement point between the media designed for a successful marketing by comparing and analyzing the existing mass media versus the social media, major media for electronic word of mouth. Therefore, this study aims to compare and analyze the impact of comments on movie revenue in the representative forms of mass media such as online news and social media blogs. In particular, this study also considers an appropriate media for promoting movies by period by comparing and analyzing the two media before and after film release. For analysis, this study collects the information on the number of comments on online news and blogs in 70 Korean movies released in 2011 and 2012 from five weeks before film release to eight weeks after film release on a daily basis via Naver. This study also collects the information on the movie revenue using the statistical data of movie industry from Korean Film Commission. As a result of empirical data analysis, it is found that the two media showed no difference in movie revenue before film release, but after film release, the impact of blogs was more significant than that of online news.

Semantic Network Analysis of Online News and Social Media Text Related to Comprehensive Nursing Care Service (간호간병통합서비스 관련 온라인 기사 및 소셜미디어 빅데이터의 의미연결망 분석)

  • Kim, Minji;Choi, Mona;Youm, Yoosik
    • Journal of Korean Academy of Nursing
    • /
    • v.47 no.6
    • /
    • pp.806-816
    • /
    • 2017
  • Purpose: As comprehensive nursing care service has gradually expanded, it has become necessary to explore the various opinions about it. The purpose of this study is to explore the large amount of text data regarding comprehensive nursing care service extracted from online news and social media by applying a semantic network analysis. Methods: The web pages of the Korean Nurses Association (KNA) News, major daily newspapers, and Twitter were crawled by searching the keyword 'comprehensive nursing care service' using Python. A morphological analysis was performed using KoNLPy. Nodes on a 'comprehensive nursing care service' cluster were selected, and frequency, edge weight, and degree centrality were calculated and visualized with Gephi for the semantic network. Results: A total of 536 news pages and 464 tweets were analyzed. In the KNA News and major daily newspapers, 'nursing workforce' and 'nursing service' were highly rated in frequency, edge weight, and degree centrality. On Twitter, the most frequent nodes were 'National Health Insurance Service' and 'comprehensive nursing care service hospital.' The nodes with the highest edge weight were 'national health insurance,' 'wards without caregiver presence,' and 'caregiving costs.' 'National Health Insurance Service' was highest in degree centrality. Conclusion: This study provides an example of how to use atypical big data for a nursing issue through semantic network analysis to explore diverse perspectives surrounding the nursing community through various media sources. Applying semantic network analysis to online big data to gather information regarding various nursing issues would help to explore opinions for formulating and implementing nursing policies.

An Evaluation of Determinants to Viewer Acceptance of Artificial Intelligence-based News Anchor (인공지능(AI) 기술 기반의 뉴스 앵커에 대한 수용 의도의 선행요인 연구)

  • Shin, Ha-Yan;Kweon, Sang-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.4
    • /
    • pp.205-219
    • /
    • 2021
  • The present study identified determinants to user acceptance of artificial intelligence(AI)-based news anchor. Our conceptual model included three constructs of ability, benevolence, and integrity to determine whether these three constructs are predictive of trust perceived from AI news anchor. This work further examined the influences of social presence, anthropomorphism, perceived usefulness, understanding as well as trust as immediate determinants to user acceptance. The conceptual model was validated on survey data collected from 513 respondents. A series of scale refinement process was conducted by the examination of data normality, common method bias, structure of latent variables as well as internal consistency. In addition, a confirmatory factor analysis was performed to assess the extent to which the sample data collected from survey study measures the constructs adequately. The results from the analysis of structural equation model indicated that, (1) two constructs of ability and integrity were found to be significantly predictive of perceived trust, and (2) anthropomorphism, perceived usefulness, and trust emerged as significant and positive predictors of user acceptance of AI-based news anchor.

Material as a Key Element of Fashion Trend in 2010~2019 - Text Mining Analysis - (패션 트렌트(2010~2019)의 주요 요소로서 소재 - 텍스트마이닝을 통한 분석 -)

  • Jang, Namkyung;Kim, Min-Jeong
    • Fashion & Textile Research Journal
    • /
    • v.22 no.5
    • /
    • pp.551-560
    • /
    • 2020
  • Due to the nature of fashion design that responds quickly and sensitively to changes, accurate forecasting for upcoming fashion trends is an important factor in the performance of fashion product planning. This study analyzed the major phenomena of fashion trends by introducing text mining and a big data analysis method. The research questions were as follows. What is the key term of the 2010SS~2019FW fashion trend? What are the terms that are highly relevant to the key trend term by year? Which terms relevant to the key trend term has shown high frequency in news articles during the same period? Data were collected through the 2010SS~2019FW Pre-Trend data from the leading trend information company in Korea and 45,038 articles searched by "fashion+material" from the News Big Data System. Frequency, correlation coefficient, coefficient of variation and mapping were performed using R-3.5.1. Results showed that the fashion trend information were reflected in the consumer market. The term with the highest frequency in 2010SS~2019FW fashion trend information was material. In trend information, the terms most relevant to material were comfort, compact, look, casual, blend, functional, cotton, processing, metal and functional by year. In the news article, functional, comfort, sports, leather, casual, eco-friendly, classic, padding, culture, and high-quality showed the high frequency. Functional was the only fashion material term derived every year for 10 years. This study helps expand the scope and methods of fashion design research as well as improves the information analysis and forecasting capabilities of the fashion industry.

Trend Analysis of Complex Disasters in South Korea Using News Data (뉴스데이터를 활용한 국내 복합재난 발생 동향분석)

  • Eun Hye Shin;Do Woo Kim;Seong Rok Chang
    • Journal of the Korean Society of Safety
    • /
    • v.38 no.6
    • /
    • pp.50-59
    • /
    • 2023
  • As the diversity of disasters continues to increase, the concept of "complex disasters" has gained prominence in various policies and studies related to disaster management. However, there has been a certain limitation in the availability of the systematic statistics or data in advancing policies and research initiatives related to complex disasters. This study aims to analyze the macro-level characteristics of the complex disasters that have occurred domestically utilizing a 30-year span of a news data. Initially, we categorize the complex disasters into the three types: "Natural disaster-Natural disaster", "Natural disaster-Social disaster", and "Social disaster-Social disaster". As a result, the "natural diaster-social disaster" type is the most prevalent. It is noted that "natual disaster-natural disaster" type has increased significantly in recent 10 years (2011-2020). In terms of specific disaster types, "Storm and Flood", "Collapse", "Traffic Accident", "National Infrastructure Paralysis", and "Fire⋅Explosion" occur the most in conjunction with other disasters in a complex manner. It has been observed that the types of disasters co-ocuuring with others have become more diverse over time. Parcicularly, in recent 10 years (2011-2020), in addition to the aforementioned five types, "Heat Wave", "Heavy Snowfall⋅Cold Wave", "Earthquake", "Chemical Accident", "Infectious Disease", "Forest Fire", "Air Pollution", "Drought", and "Landslide" have been notable for their frequent co-occurrence with other disasters. These findings through the statistical analysis of the complex disasters using long-term news data are expected to serve as crucial data for future policy development and research on complex disaster management.

Access Frequency Based Selective Buffer Cache Management Strategy For Multimedia News Data (접근 요청 빈도에 기반한 멀티미디어 뉴스 데이터의 선별적 버퍼 캐쉬 관리 전략)

  • Park, Yong-Un;Seo, Won-Il;Jeong, Gi-Dong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.9
    • /
    • pp.2524-2532
    • /
    • 1999
  • In this paper, we present a new buffer pool management scheme designed for video type news objects to build a cost-effective News On Demand storage server for serving users requests beyond the limitation of disk bandwidth. In a News On Demand Server where many of users request for video type news objects have to be serviced keeping their playback deadline, the maximum numbers of concurrent users are limited by the maximum disk bandwidth the server provides. With our proposed buffer cache management scheme, a requested data is checked to see whether or not it is worthy of caching by checking its average arrival interval and current disk traffic density. Subsequently, only granted news objects are permitted to get into the buffer pool, where buffer allocation is made not on the block basis but on the object basis. We evaluated the performance of our proposed caching algorithm through simulation. As a result of the simulation, we show that by using this caching scheme to support users requests for real time news data, compared with serving those requests only by disks, 30% of extra requests are served without additional cost increase.

  • PDF

An Analysis of Trends on the Safety Area Utilizing Big Data : Focused on Fake News (빅데이터를 활용한 안전분야 트렌드 분석 : 가짜뉴스(fake news)를 중심으로)

  • Joo, Seong Bhin
    • Convergence Security Journal
    • /
    • v.17 no.5
    • /
    • pp.111-119
    • /
    • 2017
  • As of March 2017, fake news is largely focused on political issues. Outside the country, main issues of the fake news have been a hot topic in the US presidential election in 2016 and emerged as a new political and social problem in countries like Germany and France. In Korea, issues of the fake news are also linked with political issues such as presidential impeachment and prosecution, impeachment quota, early election, etc. This phenomenon has recently led to the production and spread of fake news related to safety and security issues as well as political issues in connection with various methods of generating articles and sharing information. As a result, there is a high possibility that the information will be transformed into information that can cause considerable confusion to the public. Therefore, the recognition of such problems means that it is important at this point to consider the related situation analysis and effective countermeasures. To do this, we tried to make accurate and meaningful analysis for the diagnosis, analysis, forecasting and management of issues utilizing Big Data. As a result, it is found that the fake news is continuously generated in relation to the safety and security issue as well as the political issue in the South Korea, and differs from the general form occurring outside the country.