• Title/Summary/Keyword: News contents analysis

Search Result 246, Processing Time 0.03 seconds

Genetic Clustering with Semantic Vector Expansion (의미 벡터 확장을 통한 유전자 클러스터링)

  • Song, Wei;Park, Soon-Cheol
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.3
    • /
    • pp.1-8
    • /
    • 2009
  • This paper proposes a new document clustering system using fuzzy logic-based genetic algorithm (GA) and semantic vector expansion technology. It has been known in many GA papers that the success depends on two factors, the diversity of the population and the capability to convergence. We use the fuzzy logic-based operators to adaptively adjust the influence between these two factors. In traditional document clustering, the most popular and straightforward approach to represent the document is vector space model (VSM). However, this approach not only leads to a high dimensional feature space, but also ignores the semantic relationships between some important words, which would affect the accuracy of clustering. In this paper we use latent semantic analysis (LSA)to expand the documents to corresponding semantic vectors conceptually, rather than the individual terms. Meanwhile, the sizes of the vectors can be reduced drastically. We test our clustering algorithm on 20 news groups and Reuter collection data sets. The results show that our method outperforms the conventional GA in various document representation environments.

A domain-specific sentiment lexicon construction method for stock index directionality (주가지수 방향성 예측을 위한 도메인 맞춤형 감성사전 구축방안)

  • Kim, Jae-Bong;Kim, Hyoung-Joong
    • Journal of Digital Contents Society
    • /
    • v.18 no.3
    • /
    • pp.585-592
    • /
    • 2017
  • As development of personal devices have made everyday use of internet much easier than before, it is getting generalized to find information and share it through the social media. In particular, communities specialized in each field have become so powerful that they can significantly influence our society. Finally, businesses and governments pay attentions to reflecting their opinions in their strategies. The stock market fluctuates with various factors of society. In order to consider social trends, many studies have tried making use of bigdata analysis on stock market researches as well as traditional approaches using buzz amount. In the example at the top, the studies using text data such as newspaper articles are being published. In this paper, we analyzed the post of 'Paxnet', a securities specialists' site, to supplement the limitation of the news. Based on this, we help researchers analyze the sentiment of investors by generating a domain-specific sentiment lexicon for the stock market.

"Dangerous Media vs. Reliable Childcare Helper" : Discursive Analysis of Infants' Smart Media Use ('위험한 미디어 vs 든든한 육아 도우미' : 영유아 스마트 미디어 이용 담론에 대한 탐구)

  • Choi, Yisook;Kim, Banya
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.8
    • /
    • pp.515-525
    • /
    • 2021
  • The study examines how the discourse on infants' smart media use has been constructed under media-saturated situations. Infants' use of smart media has been regarded as a dangerous activity, rate of overdependence has been increasing. Newspapers during the recent three years (2018-20) were analyzed. The most prominent speakers in the news field were smart media content producers and platform operators. There were negative views and concerns about infants' smart media use by academics, civic groups, and parents. However, the industry went beyond these risk discourses and gave positive meaning: Smart media was redefined as safe media for infants and reliable childcare helpers for parents. Parents were portrayed as those responsible for their children's media use and in need of help for childcare, rather than being blamed for their children's overdependence on smart media. Digital parenting seems to be emerging as an acceptable and practicable way of childcare rather than harmful and incomplete parenting.

Big Data Analysis on Daegu-Gyeongbuk Administrative Integration (대구·경북 행정통합에 대한 빅데이터 분석)

  • Song, Hwa Young;Park, Han Woo
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.5
    • /
    • pp.139-148
    • /
    • 2021
  • The study examines public attitude and reaction regarding administrative integration in Daegu and Gyeongbuk area. Specifically, it employs social big data including textual comments on online news articles and YouTube video clips. The collected data are analyzed in order to compare two periods, that is, before and after the inauguration of the Public Opinion Committee for One Daegu-Gyeongbuk. As a result, we have found that people's favorable response to administrative integration has gradually increased since the launch of the Committee. However, it still lacks specific administrative procedures and discussion topics among the frequently used words in the collected data. Thus, the Committee needs to provide a variety of information and materials related to administrative integration.

YouTube Users' Awareness of False Information Regulation and Exposure to Disinformation (유튜브 이용자들의 허위정보 노출경험 및 규제에 대한 인식 차이)

  • Kim, Sora
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.8
    • /
    • pp.14-32
    • /
    • 2022
  • This study aims to examine the perception of false information and deepfakes according to the experience of being exposed to false information and deepfake images for YouTube content users. The study used the data from 'YouTube Use and False Information Exposure Experience' conducted by the Korea Press Foundation in 2018. For the statistical analysis, correspondent analysis was employed. The main results followed as: First, it was found that men who have been exposed to false information are most seriously aware of the problems caused by false information on YouTube. Second, regarding the need for regulation on deepfake images, women who have experienced exposure to deepfake images tended to agree, and women had a stronger awareness of the need for regulation due to damage to deepfake images than men. While YouTube users generally agree that regulation is necessary, it is required to educate YouTube users about the types of disinformation and deepfakes. In particular, it is considered to be desirable to create an environment for the self-regulation of the producers and distributors.

A Study on Monitoring Method of Citizen Opinion based on Big Data : Focused on Gyeonggi Lacal Currency (Gyeonggi Money) (빅데이터 기반 시민의견 모니터링 방안 연구 : "경기지역화폐"를 중심으로)

  • Ahn, Soon-Jae;Lee, Sae-Mi;Ryu, Seung-Ei
    • Journal of Digital Convergence
    • /
    • v.18 no.7
    • /
    • pp.93-99
    • /
    • 2020
  • Text mining is one of the big data analysis methods that extracts meaningful information from atypical large-scale text data. In this study, text mining was used to monitor citizens' opinions on the policies and systems being implemented. We collected 5,108 newspaper articles and 748 online cafe posts related to 'Gyeonggi Lacal Currency' and performed frequency analysis, TF-IDF analysis, association analysis, and word tree visualization analysis. As a result, many articles related to the purpose of introducing local currency, the benefits provided, and the method of use. However, the contents related to the actual use of local currency were written in the online cafe posts. In order to revitalize local currency, the news was involved in the promotion of local currency as an informant. Online cafe posts consisted of the opinions of citizens who are local currency users. SNS and text mining are expected to effectively activate various policies as well as local currency.

Visualizing the Results of Opinion Mining from Social Media Contents: Case Study of a Noodle Company (소셜미디어 콘텐츠의 오피니언 마이닝결과 시각화: N라면 사례 분석 연구)

  • Kim, Yoosin;Kwon, Do Young;Jeong, Seung Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.89-105
    • /
    • 2014
  • After emergence of Internet, social media with highly interactive Web 2.0 applications has provided very user friendly means for consumers and companies to communicate with each other. Users have routinely published contents involving their opinions and interests in social media such as blogs, forums, chatting rooms, and discussion boards, and the contents are released real-time in the Internet. For that reason, many researchers and marketers regard social media contents as the source of information for business analytics to develop business insights, and many studies have reported results on mining business intelligence from Social media content. In particular, opinion mining and sentiment analysis, as a technique to extract, classify, understand, and assess the opinions implicit in text contents, are frequently applied into social media content analysis because it emphasizes determining sentiment polarity and extracting authors' opinions. A number of frameworks, methods, techniques and tools have been presented by these researchers. However, we have found some weaknesses from their methods which are often technically complicated and are not sufficiently user-friendly for helping business decisions and planning. In this study, we attempted to formulate a more comprehensive and practical approach to conduct opinion mining with visual deliverables. First, we described the entire cycle of practical opinion mining using Social media content from the initial data gathering stage to the final presentation session. Our proposed approach to opinion mining consists of four phases: collecting, qualifying, analyzing, and visualizing. In the first phase, analysts have to choose target social media. Each target media requires different ways for analysts to gain access. There are open-API, searching tools, DB2DB interface, purchasing contents, and so son. Second phase is pre-processing to generate useful materials for meaningful analysis. If we do not remove garbage data, results of social media analysis will not provide meaningful and useful business insights. To clean social media data, natural language processing techniques should be applied. The next step is the opinion mining phase where the cleansed social media content set is to be analyzed. The qualified data set includes not only user-generated contents but also content identification information such as creation date, author name, user id, content id, hit counts, review or reply, favorite, etc. Depending on the purpose of the analysis, researchers or data analysts can select a suitable mining tool. Topic extraction and buzz analysis are usually related to market trends analysis, while sentiment analysis is utilized to conduct reputation analysis. There are also various applications, such as stock prediction, product recommendation, sales forecasting, and so on. The last phase is visualization and presentation of analysis results. The major focus and purpose of this phase are to explain results of analysis and help users to comprehend its meaning. Therefore, to the extent possible, deliverables from this phase should be made simple, clear and easy to understand, rather than complex and flashy. To illustrate our approach, we conducted a case study on a leading Korean instant noodle company. We targeted the leading company, NS Food, with 66.5% of market share; the firm has kept No. 1 position in the Korean "Ramen" business for several decades. We collected a total of 11,869 pieces of contents including blogs, forum contents and news articles. After collecting social media content data, we generated instant noodle business specific language resources for data manipulation and analysis using natural language processing. In addition, we tried to classify contents in more detail categories such as marketing features, environment, reputation, etc. In those phase, we used free ware software programs such as TM, KoNLP, ggplot2 and plyr packages in R project. As the result, we presented several useful visualization outputs like domain specific lexicons, volume and sentiment graphs, topic word cloud, heat maps, valence tree map, and other visualized images to provide vivid, full-colored examples using open library software packages of the R project. Business actors can quickly detect areas by a swift glance that are weak, strong, positive, negative, quiet or loud. Heat map is able to explain movement of sentiment or volume in categories and time matrix which shows density of color on time periods. Valence tree map, one of the most comprehensive and holistic visualization models, should be very helpful for analysts and decision makers to quickly understand the "big picture" business situation with a hierarchical structure since tree-map can present buzz volume and sentiment with a visualized result in a certain period. This case study offers real-world business insights from market sensing which would demonstrate to practical-minded business users how they can use these types of results for timely decision making in response to on-going changes in the market. We believe our approach can provide practical and reliable guide to opinion mining with visualized results that are immediately useful, not just in food industry but in other industries as well.

Qualitative Analysis of Food and Nutrition Informations offered in Television Programs(year 2002-2003) -Newscastings, Health Information Programs and Dramas (지상파 TV 방송프로그램에 나타난 식품영양정보의 질적 분석(2002-2003년) - 뉴스, 건강정보 프로그램, 드라마)

  • Mun, Hyeon-Gyeong;Jang, Yeong-Ju
    • Journal of the Korean Dietetic Association
    • /
    • v.11 no.1
    • /
    • pp.67-85
    • /
    • 2005
  • The study aimed to perform the qualitative analysis of food and nutrition informations offered in TV program by monitoring newscastings, health-related programs giving food and nutrition information, dramas for family, education programs for children, and information programs for elderly in major TV broadcasting station(KBS, MBC, SBS, EBS). In this study, statistical analysis were done for numbers of information items related to health or food and nutrition informations. Duration of program the main, subject, sources, evaluation criteria of the contents. Results of qualitative monitoring for TV program are as follows. For health-related informations major propotions of subjects for the newscastings were about diseases. Those for health information programs were about foods. Those for children-education programs were about groceries. Those for seniors’ information programs were about eating habits. The analysis of food and nutrition information sources for most of programs were interviews with specialist and normal person, and on-the-spot-investingations. For food and nutrition informations those were evaluated as inappropriate, the propotion of news was increased to 72.2% in 2003 from 49.3% in 2002. For health information programs, it was increased to 67.7% in 2003 from 54.0% in 2002. But, in drama the propotion of inappropriate scenes were decreased to 16.2% in 2003 from 63.2% in 2002. In children-education programs, it was 40.0%. In seniors’ information programs, it was 17.9% in 2002. The propotion of cases that the quantity of foods is inappropriate in the food scene of serial drama, decreased to 15.8% in 2003 from 28.6% in 2002. The rate of drinking scenes increased to 11.5% from 10.7%. The rate of smoking scenes decreased to 0.2% from 1.6% due to the broadcasting self-regulation of smoking scenes in dramas. In the newscatings and information programs, reasons of being evaluated as inappropriate was that they didn’t have any practical suggestions and proper intakes. There were also insufficient explanation for technical terminology, different comparison standard of nutritive value, and exaggeration for physiological effect of food. The drama contained a lot of unnecessary scenes of alcohol drinking, coffee drinking, midnight meal, and had more quantity of foods than the quantity needed for persons to the scene. As the result of this study, the rate of food and nutrition information were high, but the rate of information which was evaluate as appropriate was not sufficient. There are need to improve contents of information and to moniter the contents for consumer.

  • PDF

Sports Celebrities as a Determinant of Sport Media Distribution Contents: Focusing on Tacit Premise of Agenda Setting Theory (스포츠미디어의 유통 콘텐츠 결정요인으로서 스포츠 스타: 의제설정 이론의 암묵적 전제를 중심으로)

  • YOO, Sang-Keon;KIM, Yong-Eun;SEO, Won-Jae
    • Journal of Distribution Science
    • /
    • v.17 no.10
    • /
    • pp.83-91
    • /
    • 2019
  • Purpose - Media is a significant distributional channel in sport. In terms of determining the influencer in building sport media contents, recent sport media studies have employed agenda-setting theory, assuming media itself as the agenda provider. In a real-world situation, however, sports stars have been deemed key factor determining distribution contents in sport. The starting point of this study is the "tacit premise" of agenda-setting theory. Given the agenda-setting theory, the current study attempted to explore the function of sport stars as an agenda provider, which is a key determinant of sport distribution. Research design, data, and methodology - This study has reviewed articles of Yuna Kim, Sang-hwa Lee, and Hyun-jin Ryu from daily newspapers including as dong-a ilbo and joongang ilbo (2013 to 2017). The study collected data, portable document format (PDF), from the online archive of dong-a ilbo and joongang ilbo. We coded the length of the article, the frequency, the size of the picture, and the structural form of the article. Inter-coder reliability was compared with data previously investigated by the researcher. Inter-coder reliabilities for study 1 and 2 was .89 and .85. To examine hypotheses, descriptive analysis, correlations, and cross-tap analysis were performed. Results - The results partially supported the hypotheses proposing the significant role of sports stars as the agenda setters in distributing sport media contents. In specific, the study found that the number of articles about sports stars prevailed the number of articles about regular athletes. Besides, studies found that the use of photos was more frequent in articles of sports starts than that of regular athletes. In sports newspaper articles, featured story articles were used more than straight-articles for news relating to sports stars. Also, sports newspaper of sports stars contained more information associated within an event rather than outside of an event. Conclusions - In sports journalism, this study challenges the current theory that the media affects the composition and the content of sports coverages. As the principle of the agenda-setting of sports media, the influence of sports stars must be continuously studied along with a follow-up study.

An Analysis of Newspaper Coverage of Korean Movie Stars : Focusing on the Image of Movie Stars and Reporting Trend (신문의 한국 영화스타 보도 내용분석 : 영화스타의 이미지와 보도 경향 중심으로)

  • Tae, Bo-Ra
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.9
    • /
    • pp.535-549
    • /
    • 2019
  • The purpose of this study was to examine what kind of images were presented on movie stars in the newspaper. For the purpose, we classified the time period according to the movie industry and media trend, selected representative stars by period, and collected 798 related articles reported in newspapers. As a result of analyzing the reporting trend, domestic and foreign topics, news format, and gender difference in collected movie star articles, it was found that the image of movie stars reproduced in newspaper articles had mostly neutral images that do not represent specific gender. Since the 2000s, news coverage was changed to reproduce various images rather than being fixed to particular images, and the subject of report became more diversified through comparison of domestic and foreign topics. In addition, articles in the form of book review decreased and the interview-type articles increased in number, and in the case of male movie stars, the proportion of articles based on works was high in comparison to female movie stars. This study has significance in that it explored the changes in the process of reproducing star images diachronically from the initial stage of stars to the modern times. And it is hoped that this study will serve as basic data for the follow-up studies on the process of reproducing various images in the multi-media era.