• Title/Summary/Keyword: news big data

Search Result 290, Processing Time 0.028 seconds

A Study on Public Policy through Semantic Network Analysis of Public Data related News in Korea (국내 공공데이터 관련 뉴스 의미망 분석을 통한 공공정책 연구)

  • Moon, HyeJung;Lee, Kyungseo
    • Journal of Broadcast Engineering
    • /
    • v.23 no.4
    • /
    • pp.536-548
    • /
    • 2018
  • Public data has been transformed from provider-oriented information disclosure to a form of personalized information sharing centered on individual citizens since government 3.0. As a result, the government is implementing policies and projects to maximize the value of public data and increase reuse. This study analyzes the issues related to public data in the news and seeks the status of government agencies and government projects by issue. We conducted semantic analysis on domestic online news and public agency bidding information including public data and conducted the work of linking major key words derived with social and economic values inherent in public data. As a result, major issues related to public data were divided into broader access to public data, growth of new technology, cooperation and conflict among stakeholders, and utilization of the private sector, which were closely related to transparency, efficiency, participation, and innovation mechanisms. Also major agencies of four issues include the Ministry of Strategy and Finance and Seoul, Ministry of Culture, Sports and Tourism and Gyeonggi-do, Ministry of Trade, Industry and Energy and Incheon, and Ministry of Land, Infrastructure and Transport and Gyeongsangbuk-do. Most of the issues are being led by the government.

Topic Model Analysis of Research Trend on Renewable Energy (신재생에너지 동향 파악을 위한 토픽 모형 분석)

  • Shin, KyuSik;Choi, HoeRyeon;Lee, HongChul
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.9
    • /
    • pp.6411-6418
    • /
    • 2015
  • To respond the climate change and environmental pollution, the studies on renewable energy policies are increasing. The renewable energy is a new growth engine technology represented by the green industry and green technology. At present, the investments for the renewable energy supply and technology development projects of three main strategy sectors such as sunlight, wind power and hydrogen fuel cell are implemented in our country, while they are still in the early stage, accordingly reducing those uncertainty for the research direction and investment fields is the most urgent issue among others. Thus, this study applied text mining method and multinominal topic model among the big data analysis methods on our country's newspaper articles concerning the renewable energy over the last 10 years, and then analyzed the core issues and global research trend, forecasting the renewable energy fields with the growth potential. It is predicted that these results of the study based on information and communication technology will be actively applied on the renewable energy fields.

A Study on Applying Novel Reverse N-Gram for Construction of Natural Language Processing Dictionary for Healthcare Big Data Analysis (헬스케어 분야 빅데이터 분석을 위한 개체명 사전구축에 새로운 역 N-Gram 적용 연구)

  • KyungHyun Lee;RackJune Baek;WooSu Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.3
    • /
    • pp.391-396
    • /
    • 2024
  • This study proposes a novel reverse N-Gram approach to overcome the limitations of traditional N-Gram methods and enhance performance in building an entity dictionary specialized for the healthcare sector. The proposed reverse N-Gram technique allows for more precise analysis and processing of the complex linguistic features of healthcare-related big data. To verify the efficiency of the proposed method, big data on healthcare and digital health announced during the Consumer Electronics Show (CES) held each January was collected. Using the Python programming language, 2,185 news titles and summaries mentioned from January 1 to 31 in 2010 and from January 1 to 31 in 2024 were preprocessed with the new reverse N-Gram method. This resulted in the stable construction of a dictionary for natural language processing in the healthcare field.

Intelligent Web Crawler for Supporting Big Data Analysis Services (빅데이터 분석 서비스 지원을 위한 지능형 웹 크롤러)

  • Seo, Dongmin;Jung, Hanmin
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.12
    • /
    • pp.575-584
    • /
    • 2013
  • Data types used for big-data analysis are very widely, such as news, blog, SNS, papers, patents, sensed data, and etc. Particularly, the utilization of web documents offering reliable data in real time is increasing gradually. And web crawlers that collect web documents automatically have grown in importance because big-data is being used in many different fields and web data are growing exponentially every year. However, existing web crawlers can't collect whole web documents in a web site because existing web crawlers collect web documents with only URLs included in web documents collected in some web sites. Also, existing web crawlers can collect web documents collected by other web crawlers already because information about web documents collected in each web crawler isn't efficiently managed between web crawlers. Therefore, this paper proposed a distributed web crawler. To resolve the problems of existing web crawler, the proposed web crawler collects web documents by RSS of each web site and Google search API. And the web crawler provides fast crawling performance by a client-server model based on RMI and NIO that minimize network traffic. Furthermore, the web crawler extracts core content from a web document by a keyword similarity comparison on tags included in a web documents. Finally, to verify the superiority of our web crawler, we compare our web crawler with existing web crawlers in various experiments.

A study on the effect of tax evasion controversy on corporate values in internet news portals through big data analysis (빅데이터 분석을 통한 인터넷 뉴스 포털에서의 탈세 논란이 기업 가치에 미치는 영향 연구)

  • Lee, Sang-Min;Park, Myung-Ho;Kim, Byung-Jun;Park, Dae-Keun
    • Journal of Internet Computing and Services
    • /
    • v.22 no.6
    • /
    • pp.51-57
    • /
    • 2021
  • If a company's actions to save or avoid taxes are judged to be tax evasion rather than legal tax action by the tax authorities, the company will not only pay tax but also non-tax costs such as damage to corporate image and stock price decline due to a series of tax evasion-related news articles. Therefore, this study measures the frequency of occurrence of tax evasion controversial keywords in internet news portal as a factor to measure the severity of the case, and analyzes the effect of the frequency of occurrence on corporate value. In the Korean stock market, we crawl related articles from internet news portal by using keywords that are controversial for tax evasion targeting top companies based on market capitalization, and generate a time series of the frequency of occurrence of keywords about tax evasion by company and analyze the effect of frequency of appearance on book value versus market capitalization. Through panel regression and impulse response analysis, it is analyzed that the frequency of appearance has a negative effect on the market capitalization and the effect gradually decreases until 12 months. This study examines whether the tax evasion issue affects the corporate value of Korean companies and suggests that it is necessary to take these influences into account when entrepreneurs set up tax-planning schemes.

Exploring the Issue Structure of Drone Crime in Newspaper Articles: Focusing on Language Network Analysis (신문 기사에서의 드론 범죄 관련 이슈구조 탐색: 언어 네트워크 분석을 중심으로)

  • Park, Hee-Young;Lee, Soo-Bum
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.11
    • /
    • pp.20-29
    • /
    • 2021
  • This study aims to explore the issue of drones and crime in newspaper articles. BIG KINDS, an online news archive of the Korea Press Foundation, collected 1,213 newspaper articles that met the terms of "drone" and "crime" in 11 central and 28 regional comprehensive newspapers between January 1, 1990 and May 1, 2021. Among them, we perform keyword frequency, centrality analysis, network structure construction, CONCOR analysis, and density matrix analysis on 117 key keywords. According to the analysis, the main issues were classified into eight, and the report analysis on drones and crimes in newspaper articles showed that the government's policy-making and social problems on protecting people's privacy, preventing illegal filming, securing navigation safety, social security and resolution. This study attempts to expand the field of humanities and social studies related to drones and crime, and specifically suggests the current status and counterplan against drone-related crimes as policy implications and media implications.

Evaluation of Major Projects of the 5th Basic Forest Plan Utilizing Big Data Analysis (빅데이터 분석을 활용한 제5차 산림기본계획 주요 사업에 대한 평가)

  • Byun, Seung-Yeon;Koo, Ja-Choon;Seok, Hyun-Deok
    • Journal of Korean Society of Forest Science
    • /
    • v.106 no.3
    • /
    • pp.340-352
    • /
    • 2017
  • In This study, we examined the gap between supply and demand of forest policy by year through big data analysis for macroscopic evaluation of the 5th Basic Forest Plan. We collected unstructured data based on keywords related to the projects mentioned in the news, SNS and so on in the relevant year for the policy demand side; and based on the documents published by the Korea Forest Service for the policy supply side. based on the collected data, we specified the network structure through the social network analysis technique, and identified the gap between supply and demand of the Korea Forest Service's policies by comparing the network of the demand side and that of the supply side. The results of big data analysis indicated that the network of the supply side is less radial than that of the demand side, implying that various keywords other than forest could considerably influence on the network. Also we compared the trends of supply and demand for 33 keywords related to 27 major projects. The results showed that 7 keywords shows increasing demand but decreasing supply: sustainable, forest management, forest biota, forest protection, forest disease and pest, urban forest, and North Korea. Since the supply-demand gap is confirmed for the 7 keywords, it is necessary to strengthen the forest policy regarding the 7 keywords in the 6th Basic Plan.

Machine Learning Method in Medical Education: Focusing on Research Case of Press Frame on Asbestos (의학교육에서 기계학습방법 교육: 석면 언론 프레임 연구사례를 중심으로)

  • Kim, Junhewk;Heo, So-Yun;Kang, Shin-Ik;Kim, Geon-Il;Kang, Dongmug
    • Korean Medical Education Review
    • /
    • v.19 no.3
    • /
    • pp.158-168
    • /
    • 2017
  • There is a more urgent call for educational methods of machine learning in medical education, and therefore, new approaches of teaching and researching machine learning in medicine are needed. This paper presents a case using machine learning through text analysis. Topic modeling of news articles with the keyword 'asbestos' were examined. Two hypotheses were tested using this method, and the process of machine learning of texts is illustrated through this example. Using an automated text analysis method, all the news articles published from January 1, 1990 to November 15, 2016 in South Korea which included 'asbestos' in the title and the body were collected by web scraping. Differences in topics were analyzed by structured topic modelling (STM) and compared by press companies and periods. More articles were found in liberal media outlets. Differences were found in the number and types of topics in the articles according to the partisanship and period. STM showed that the conservative press views asbestos as a personal problem, while the progressive press views asbestos as a social problem. A divergence in the perspective for emphasizing the issues of asbestos between the conservative press and progressive press was also found. Social perspective influences the main topics of news stories. Thus, the patients' uneasiness and pain are not presented by both sources of media. In addition, topics differ between news media sources based on partisanship, and therefore cause divergence in readers' framing. The method of text analysis and its strengths and weaknesses are explained, and an application for the teaching and researching of machine learning in medical education using the methodology of text analysis is considered. An educational method of machine learning in medical education is urgent for future generations.

A Study on Corporate Reputation and Profitability Focus on Online News and Comments (기업평판과 수익성에 관한 연구 온라인 뉴스와 뉴스댓글을 중심으로)

  • Jin, Zhilong;Han, Eun-Kyoung
    • Journal of Digital Convergence
    • /
    • v.17 no.9
    • /
    • pp.399-406
    • /
    • 2019
  • The purpose of this study is to examine the relationship between corporate reputation and the profitability. In this study, Big Data Analysis was conducted for Hyundai Motor, Shinsegae Department Store, SK Telecom, and Amorepacific to solve research problems. The results of this study show that the effect of each corporate reputation on the profitability is different according to the company. For products such as Hyundai Motor and Amorepacific that are used directly by consumers, the corporate reputation formed by the comments was more influential. In addition, distribution Service company such as Shinsegae Department Store showed more influence by online news. On the other hand, SK Telecom did not have a significant effect on profitability. Based on the results, this study emphasizes the importance of online news and comments on corporate reputation management, and aims to contribute to establishing an efficient reputation management strategy by examining the relationship between corporate reputation and profitability.

A Study on Conversational Public Administration Service of the Chatbot Based on Artificial Intelligence (인공지능 기반 대화형 공공 행정 챗봇 서비스에 관한 연구)

  • Park, Dong-ah
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.8
    • /
    • pp.1347-1356
    • /
    • 2017
  • Artificial intelligence-based services are expanding into a new industrial revolution. There is artificial intelligence technology applied in real life due to the development of big data and deep learning related technology. And data analysis and intelligent assistant services that integrate information from various fields have also been commercialized. Chatbot with interactive artificial intelligence provide shopping, news or information. Chatbot service, which has begun to be adopted by some public institutions, is now just a first step in the steps. This study summarizes the services and technical analysis of chatbot. and the direction of public administration service chatbot was presented.