• Title/Summary/Keyword: 뉴스댓글

Search Result 63, Processing Time 0.021 seconds

Prediction of Onion Purchase Using Structured and Unstructured Big Data (정형 및 비정형 빅데이터를 이용한 양파 소비 예측)

  • Rah, HyungChul;Oh, Eunhwa;Yoo, Do-il;Cho, Wan-Sup;Nasridinov, Aziz;Park, Sungho;Cho, Youngbeen;Yoo, Kwan-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.11
    • /
    • pp.30-37
    • /
    • 2018
  • The social media data and the broadcasting data related to onion as well as agri-food consumer panel data were collected and investigated if the amount of money spent to purchase onion in year 2014 when onion price plunged latest were correlated with the frequencies of onion-related keywords in the social media data and the broadcasting programs because onion price in year 2018 is expected to plunge due to overproduction and there has been needs to analyze impacts of social media and broadcasting program on onion purchase in the previous similar events, and identify potential factors that can promote onion consumption in advance. What we identified from our study include a) broadcasting news programs mentioning words "onion," were correlated with onion purchase with 3 - 6 weeks in advance; b) broadcasting entertainment programs mentioning words "onion and health," were correlated with onion purchase with 11 weeks in advance; c) blog mentioning words "onion and efficacy," were correlated with onion purchase with 5 weeks in advance. Our study provided a case on how social media and broadcasting programs could be analyzed for their effects on consumer purchase behavior using big data collection and analysis in the field of agriculture. We propose to use the findings from the study may be applied to promote onion consumption.

Social Big Data-based Co-occurrence Analysis of the Main Person's Characteristics and the Issues in the 2016 Rio Olympics Men's Soccer Games (소셜 빅데이터 기반 2016리우올림픽 축구 관련 이슈 및 인물에 대한 연관단어 분석)

  • Park, SungGeon;Lee, Soowon;Hwang, YoungChan
    • 한국체육학회지인문사회과학편
    • /
    • v.56 no.2
    • /
    • pp.303-320
    • /
    • 2017
  • This paper seeks to better understand the focal issues and persons related to Rio Olympic soccer games through social data science and analytics. This study collected its data from online news articles and comments specific to KOR during the Olympic football games. In order to investigate the public interests for each game and target persons, this study performed the co-occurrence words analysis. Then after, the study applied the NodeXL software to perform its visualization of the results. Through this application and process, the study found several major issues during the Rio Olympic men's football game including the following: the match between KOR and PIJ, KOR player Heungmin Son, commentator Young-Pyo Lee, sportscaster Woo-Jong Jo. The study also showed the general public opinion expressed positive words towards the South Korean national football team during the Rio Olympics, though there existed negative words as well. Furthermore the study revealed positive attitude towards the commentators and casters. In conclusion, the way to increase the public's interest in big sporting events can be achieved by providing the following: contents that include various professional sports analysis, a capable domain expert with thorough preparation, a commentator and/or caster with artistic sense as well as well-spoken, explanatory power and so on. Multidisciplinary research combined with sports science, social science, information technology and media can contribute to a wide range of theoretical studies and practical developments within the sports industry.

Building a Korean Sentiment Lexicon Using Collective Intelligence (집단지성을 이용한 한글 감성어 사전 구축)

  • An, Jungkook;Kim, Hee-Woong
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.49-67
    • /
    • 2015
  • Recently, emerging the notion of big data and social media has led us to enter data's big bang. Social networking services are widely used by people around the world, and they have become a part of major communication tools for all ages. Over the last decade, as online social networking sites become increasingly popular, companies tend to focus on advanced social media analysis for their marketing strategies. In addition to social media analysis, companies are mainly concerned about propagating of negative opinions on social networking sites such as Facebook and Twitter, as well as e-commerce sites. The effect of online word of mouth (WOM) such as product rating, product review, and product recommendations is very influential, and negative opinions have significant impact on product sales. This trend has increased researchers' attention to a natural language processing, such as a sentiment analysis. A sentiment analysis, also refers to as an opinion mining, is a process of identifying the polarity of subjective information and has been applied to various research and practical fields. However, there are obstacles lies when Korean language (Hangul) is used in a natural language processing because it is an agglutinative language with rich morphology pose problems. Therefore, there is a lack of Korean natural language processing resources such as a sentiment lexicon, and this has resulted in significant limitations for researchers and practitioners who are considering sentiment analysis. Our study builds a Korean sentiment lexicon with collective intelligence, and provides API (Application Programming Interface) service to open and share a sentiment lexicon data with the public (www.openhangul.com). For the pre-processing, we have created a Korean lexicon database with over 517,178 words and classified them into sentiment and non-sentiment words. In order to classify them, we first identified stop words which often quite likely to play a negative role in sentiment analysis and excluded them from our sentiment scoring. In general, sentiment words are nouns, adjectives, verbs, adverbs as they have sentimental expressions such as positive, neutral, and negative. On the other hands, non-sentiment words are interjection, determiner, numeral, postposition, etc. as they generally have no sentimental expressions. To build a reliable sentiment lexicon, we have adopted a concept of collective intelligence as a model for crowdsourcing. In addition, a concept of folksonomy has been implemented in the process of taxonomy to help collective intelligence. In order to make up for an inherent weakness of folksonomy, we have adopted a majority rule by building a voting system. Participants, as voters were offered three voting options to choose from positivity, negativity, and neutrality, and the voting have been conducted on one of the largest social networking sites for college students in Korea. More than 35,000 votes have been made by college students in Korea, and we keep this voting system open by maintaining the project as a perpetual study. Besides, any change in the sentiment score of words can be an important observation because it enables us to keep track of temporal changes in Korean language as a natural language. Lastly, our study offers a RESTful, JSON based API service through a web platform to make easier support for users such as researchers, companies, and developers. Finally, our study makes important contributions to both research and practice. In terms of research, our Korean sentiment lexicon plays an important role as a resource for Korean natural language processing. In terms of practice, practitioners such as managers and marketers can implement sentiment analysis effectively by using Korean sentiment lexicon we built. Moreover, our study sheds new light on the value of folksonomy by combining collective intelligence, and we also expect to give a new direction and a new start to the development of Korean natural language processing.