• Title/Summary/Keyword: Online News Content

Search Result 66, Processing Time 0.021 seconds

Analysis on Dynamics of Korea Startup Ecosystems Based on Topic Modeling (토픽 모델링을 활용한 한국의 창업생태계 트렌드 변화 분석)

  • Heeyoung Son;Myungjong Lee;Youngjo Byun
    • Knowledge Management Research
    • /
    • v.23 no.4
    • /
    • pp.315-338
    • /
    • 2022
  • In 1986, Korea established legal systems to support small and medium-sized start-ups, which becomes the main pillars of national development. The legal systems have stimulated start-up ecosystems to have more than 1 million new start-up companies founded every year during the past 30 years. To analyze the trend of Korea's start-up ecosystem, in this study, we collected 1.18 million news articles from 1991 to 2020. Then, we extracted news articles that have the keywords "start-up", "venture", and "start-up". We employed network analysis and topic modeling to analyze collected news articles. Our analysis can contribute to analyzing the government policy direction shown in the history of start-up support policy. Specifically, our analysis identifies the dynamic characteristics of government influenced by external environmental factors (e.g., society, economy, and culture). The results of our analysis suggest that the start-up ecosystems in Korea have changed and developed mainly by the government policies for corporation governance, industrial development planning, deregulation, and economic prosperity plan. Our frequency keyword analysis contributes to understanding entrepreneurial productivity attributed to activities among the networked components in industrial ecosystems. Our analyses and results provide practitioners and researchers with practical and academic implications that can help to establish dedicated support policies through forecast tasks of the economic environment surrounding the start-ups. Korean entrepreneurial productivity has been empowered by growing numbers of large companies in the mobile phone industry. The spectrum of large companies incorporates content startups, platform providers, online shopping malls, and youth-oriented start-ups. In addition, economic situational factors contribute to the growth of Korean entrepreneurial productivity the economic, which are related to the global expansions of the mobile industry, and government efforts to foster start-ups. Our research is methodologically implicative. We employ natural language processes for 30 years of media articles, which enables more rigorous analysis compared to the existing studies which only observe changes in government and policy based on a qualitative manner.

Sports Celebrities as a Determinant of Sport Media Distribution Contents: Focusing on Tacit Premise of Agenda Setting Theory (스포츠미디어의 유통 콘텐츠 결정요인으로서 스포츠 스타: 의제설정 이론의 암묵적 전제를 중심으로)

  • YOO, Sang-Keon;KIM, Yong-Eun;SEO, Won-Jae
    • Journal of Distribution Science
    • /
    • v.17 no.10
    • /
    • pp.83-91
    • /
    • 2019
  • Purpose - Media is a significant distributional channel in sport. In terms of determining the influencer in building sport media contents, recent sport media studies have employed agenda-setting theory, assuming media itself as the agenda provider. In a real-world situation, however, sports stars have been deemed key factor determining distribution contents in sport. The starting point of this study is the "tacit premise" of agenda-setting theory. Given the agenda-setting theory, the current study attempted to explore the function of sport stars as an agenda provider, which is a key determinant of sport distribution. Research design, data, and methodology - This study has reviewed articles of Yuna Kim, Sang-hwa Lee, and Hyun-jin Ryu from daily newspapers including as dong-a ilbo and joongang ilbo (2013 to 2017). The study collected data, portable document format (PDF), from the online archive of dong-a ilbo and joongang ilbo. We coded the length of the article, the frequency, the size of the picture, and the structural form of the article. Inter-coder reliability was compared with data previously investigated by the researcher. Inter-coder reliabilities for study 1 and 2 was .89 and .85. To examine hypotheses, descriptive analysis, correlations, and cross-tap analysis were performed. Results - The results partially supported the hypotheses proposing the significant role of sports stars as the agenda setters in distributing sport media contents. In specific, the study found that the number of articles about sports stars prevailed the number of articles about regular athletes. Besides, studies found that the use of photos was more frequent in articles of sports starts than that of regular athletes. In sports newspaper articles, featured story articles were used more than straight-articles for news relating to sports stars. Also, sports newspaper of sports stars contained more information associated within an event rather than outside of an event. Conclusions - In sports journalism, this study challenges the current theory that the media affects the composition and the content of sports coverages. As the principle of the agenda-setting of sports media, the influence of sports stars must be continuously studied along with a follow-up study.

A Study on Media Coverage and Recognition Type of Users about Ubiquitous Environment (유비쿼터스 환경에 대한 언론보도와 수용자의 인식유형에 관한 연구)

  • Ryu, Seung-Kwan;Lee, Jei-Young
    • Korean journal of communication and information
    • /
    • v.32
    • /
    • pp.169-207
    • /
    • 2006
  • This study conducted both a content analysis and Q-methods analysis in order to find media coverage and recognition type of users about ubiquitous environment in digital online age. The perception type of this study were divided into four types in all through Q-methodology. Above all, this study investigated three research problems. First, how is the type of the perception divided into the subjectivity by recognition-characteristic on ubiquitous environment in Korea? Second, what is a trait of character in these same or different types? Third, how did news media cover ubiquitous phenomena overall? Fourth, is there any difference between media coverage and audiences' perception about ubiquitous phenomena? The results show that audiences seem to generally follow the frames that mass media provide. In addition, however, the audiences tend to recognize ubiquitous environment based upon their own schema as following four types: 1. Positive Universality Leading Type, 2. Uncertain Prudence Preference Type, 3. Future Cultural Enjoyment Type, 4. Negative Effect View Type. This study suggests that every efforts such as technology and policies that can improve the quality of ubiquitous environment in Korea should be accompanied.

  • PDF

A Research on the Books Selected in 'One Book, One City' Community Reading Promotion Campaign in Korea (국내 '한 책, 한 도시' 독서운동의 선정책에 관한 연구)

  • Yoon, Cheong-Ok
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.2
    • /
    • pp.165-188
    • /
    • 2022
  • The purpose of this research is to document the current state of 'One Book, One City' community reading campaign (Hereafter called 'One Book' reading campaign), launched in 2003 in Korea, and the characteristics of the selected books. For this research, the homepages, news and reports of a total of 1,170 public libraries and their local government, and several major institutions and organizations related to reading and culture were analyzed with the research method of content analysis and literature review. Also, online catalogs of the National Library of Korea and the National Library for Children and Young Adults were examined to identify the characteristics of 729 titles and 1,179 volumes of books selected in 57 'One Book' programs, as of 2021. The analysis of 57 'One Book' programs and those selected books shows the selection of more than one books in different age groups in more and more 'One Book' programs, lack of consistency in themes of those selected books, and preference for young adult books, new publications and bestselling novels. This trend has weakened individual 'One Book' programs' concentration on one book or one subject, but helped invite a diverse group of people with various interests. More in-depth analysis and explanation of the process of book selection and its appropriateness with the stated goals of 'One Book' programs are needed.

A MVC Framework for Visualizing Text Data (텍스트 데이터 시각화를 위한 MVC 프레임워크)

  • Choi, Kwang Sun;Jeong, Kyo Sung;Kim, Soo Dong
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.39-58
    • /
    • 2014
  • As the importance of big data and related technologies continues to grow in the industry, it has become highlighted to visualize results of processing and analyzing big data. Visualization of data delivers people effectiveness and clarity for understanding the result of analyzing. By the way, visualization has a role as the GUI (Graphical User Interface) that supports communications between people and analysis systems. Usually to make development and maintenance easier, these GUI parts should be loosely coupled from the parts of processing and analyzing data. And also to implement a loosely coupled architecture, it is necessary to adopt design patterns such as MVC (Model-View-Controller) which is designed for minimizing coupling between UI part and data processing part. On the other hand, big data can be classified as structured data and unstructured data. The visualization of structured data is relatively easy to unstructured data. For all that, as it has been spread out that the people utilize and analyze unstructured data, they usually develop the visualization system only for each project to overcome the limitation traditional visualization system for structured data. Furthermore, for text data which covers a huge part of unstructured data, visualization of data is more difficult. It results from the complexity of technology for analyzing text data as like linguistic analysis, text mining, social network analysis, and so on. And also those technologies are not standardized. This situation makes it more difficult to reuse the visualization system of a project to other projects. We assume that the reason is lack of commonality design of visualization system considering to expanse it to other system. In our research, we suggest a common information model for visualizing text data and propose a comprehensive and reusable framework, TexVizu, for visualizing text data. At first, we survey representative researches in text visualization era. And also we identify common elements for text visualization and common patterns among various cases of its. And then we review and analyze elements and patterns with three different viewpoints as structural viewpoint, interactive viewpoint, and semantic viewpoint. And then we design an integrated model of text data which represent elements for visualization. The structural viewpoint is for identifying structural element from various text documents as like title, author, body, and so on. The interactive viewpoint is for identifying the types of relations and interactions between text documents as like post, comment, reply and so on. The semantic viewpoint is for identifying semantic elements which extracted from analyzing text data linguistically and are represented as tags for classifying types of entity as like people, place or location, time, event and so on. After then we extract and choose common requirements for visualizing text data. The requirements are categorized as four types which are structure information, content information, relation information, trend information. Each type of requirements comprised with required visualization techniques, data and goal (what to know). These requirements are common and key requirement for design a framework which keep that a visualization system are loosely coupled from data processing or analyzing system. Finally we designed a common text visualization framework, TexVizu which is reusable and expansible for various visualization projects by collaborating with various Text Data Loader and Analytical Text Data Visualizer via common interfaces as like ITextDataLoader and IATDProvider. And also TexVisu is comprised with Analytical Text Data Model, Analytical Text Data Storage and Analytical Text Data Controller. In this framework, external components are the specifications of required interfaces for collaborating with this framework. As an experiment, we also adopt this framework into two text visualization systems as like a social opinion mining system and an online news analysis system.

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.