• Title/Summary/Keyword: news data

Search Result 888, Processing Time 0.029 seconds

News Clipping System Through Dynamic Data Extraction (동적 데이터 추출을 통한 뉴스 클리핑 시스템)

  • 전호철;신성혁
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2003.11b
    • /
    • pp.727-730
    • /
    • 2003
  • 인터넷의 빠른 보급으로 많은 양의 정보가 유통되기 시작했다. 그러나 사용자들은 필요한 정보들을 취사 선택하기엔 정보들은 양이 너무 방대하다. 각종 사이트에 있는 뉴스들을 실시간으로 사용자들에게 필요한 정보를 제공할 수 있는 뉴스 클리핑은 이러한 사용자들의 요구를 충족할수 있다 하지만 기존의 뉴스 클리핑 시스템은 각 사이트에 접근 후, 수동적인 분석을 통해 뉴스 정보 및 뉴스 기사의 위치를 파악하고 이를 추출하도록 하는 시스템들이다. 본 논문에서 제안하고자 하는 시스템은 사이트의 구조를 파악하고, 뉴스 기사들을 동적으로 추출함으로써 기존 시스템의 단점을 극복하고, 내용 기반의 뉴스기사 검색이 가능하도록 한다.

  • PDF

Design of Web Content Model (웹 컨텐트 저장소)

  • Abbass, Onytra;Koo, Heung-Seo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11c
    • /
    • pp.1915-1918
    • /
    • 2002
  • Managing semistructured data needs fine granularity such as markup elements. XML has major effect in managing web content, it enables content reusability, enriches information with metadata, ensures valid document links, etc. We introduce our content model as an integrated work which handles content objects as controllable units. The paper concerns on modeling news site and how the content is classified due to the site structure, aggregated content and reusability. The model stores instance XML document into relation database using fragmentation strategy.

  • PDF

New Data Buffering Scheme for News On Demand (NOD 데이터를 위한 새로운 버퍼링 기법)

  • 박용운;백건효;서원일;김영주;정기동
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1997.11a
    • /
    • pp.173-179
    • /
    • 1997
  • 본 연구에서는 실시간 데이터와 비 실시간 데이터가 복합적으로 존재하는 뉴스 데이터에 적합하도록 버퍼 캐쉬를 실시간 데이터와 비 실시간 데이터 영역으로 분할 한 후, 로그 데이터를 이용하여 접근 가능성이 높은 실시간 뉴스데이터를 프리팻칭하여 둠으로써 실시간 뉴스 데이터의 운영을 효과적으로 할 수 있는 새로운 버퍼 캐쉬 알고리즘을 제안한다. 이 방식을 이용함으로써 전체 뉴스 요청 건수 중 30% 이상의 요청 건수들이 디스크를 접근하지 않고 버퍼의 데이터를 접근함으로써 버퍼링 기법을 사용하지 않은 경우보다 실시간 지원에 필요한 디스크 접근 수를 줄일 수 있다.

  • PDF

Trump Tariffs and Roundabout Trade

  • TADASHI ITO
    • KDI Journal of Economic Policy
    • /
    • v.46 no.3
    • /
    • pp.25-47
    • /
    • 2024
  • Although there are many news articles of tariff dodging via the rerouting of made-in-China goods through third countries, relabeling these goods as made in Mexico or made in Vietnam, there have been no scientific studies on this issue. This paper provides statistical evidence regarding whether such practices are taking place. Using monthly trade statistics at the most disaggregated level and analyzing data up to 2019, the year before the COVID-19 shock, little evidence of roundabout trade is found. With an extended dataset up to 2023, overall there is little sign of roundabout trade, although some slight signs of roundabout trade are found for Mexico and Vietnam.

Bankruptcy Prediction Modeling Using Qualitative Information Based on Big Data Analytics (빅데이터 기반의 정성 정보를 활용한 부도 예측 모형 구축)

  • Jo, Nam-ok;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.33-56
    • /
    • 2016
  • Many researchers have focused on developing bankruptcy prediction models using modeling techniques, such as statistical methods including multiple discriminant analysis (MDA) and logit analysis or artificial intelligence techniques containing artificial neural networks (ANN), decision trees, and support vector machines (SVM), to secure enhanced performance. Most of the bankruptcy prediction models in academic studies have used financial ratios as main input variables. The bankruptcy of firms is associated with firm's financial states and the external economic situation. However, the inclusion of qualitative information, such as the economic atmosphere, has not been actively discussed despite the fact that exploiting only financial ratios has some drawbacks. Accounting information, such as financial ratios, is based on past data, and it is usually determined one year before bankruptcy. Thus, a time lag exists between the point of closing financial statements and the point of credit evaluation. In addition, financial ratios do not contain environmental factors, such as external economic situations. Therefore, using only financial ratios may be insufficient in constructing a bankruptcy prediction model, because they essentially reflect past corporate internal accounting information while neglecting recent information. Thus, qualitative information must be added to the conventional bankruptcy prediction model to supplement accounting information. Due to the lack of an analytic mechanism for obtaining and processing qualitative information from various information sources, previous studies have only used qualitative information. However, recently, big data analytics, such as text mining techniques, have been drawing much attention in academia and industry, with an increasing amount of unstructured text data available on the web. A few previous studies have sought to adopt big data analytics in business prediction modeling. Nevertheless, the use of qualitative information on the web for business prediction modeling is still deemed to be in the primary stage, restricted to limited applications, such as stock prediction and movie revenue prediction applications. Thus, it is necessary to apply big data analytics techniques, such as text mining, to various business prediction problems, including credit risk evaluation. Analytic methods are required for processing qualitative information represented in unstructured text form due to the complexity of managing and processing unstructured text data. This study proposes a bankruptcy prediction model for Korean small- and medium-sized construction firms using both quantitative information, such as financial ratios, and qualitative information acquired from economic news articles. The performance of the proposed method depends on how well information types are transformed from qualitative into quantitative information that is suitable for incorporating into the bankruptcy prediction model. We employ big data analytics techniques, especially text mining, as a mechanism for processing qualitative information. The sentiment index is provided at the industry level by extracting from a large amount of text data to quantify the external economic atmosphere represented in the media. The proposed method involves keyword-based sentiment analysis using a domain-specific sentiment lexicon to extract sentiment from economic news articles. The generated sentiment lexicon is designed to represent sentiment for the construction business by considering the relationship between the occurring term and the actual situation with respect to the economic condition of the industry rather than the inherent semantics of the term. The experimental results proved that incorporating qualitative information based on big data analytics into the traditional bankruptcy prediction model based on accounting information is effective for enhancing the predictive performance. The sentiment variable extracted from economic news articles had an impact on corporate bankruptcy. In particular, a negative sentiment variable improved the accuracy of corporate bankruptcy prediction because the corporate bankruptcy of construction firms is sensitive to poor economic conditions. The bankruptcy prediction model using qualitative information based on big data analytics contributes to the field, in that it reflects not only relatively recent information but also environmental factors, such as external economic conditions.

A Study on the Semantic Network Analysis of "Cooking Academy" through the Big Data (빅데이터를 활용한 "조리학원"의 의미연결망 분석에 관한 연구)

  • Lee, Seung-Hoo;Kim, Hak-Seon
    • Culinary science and hospitality research
    • /
    • v.24 no.3
    • /
    • pp.167-176
    • /
    • 2018
  • In this study, Big Data was used to collect the information related to 'Cooking Academy' keywords. After collecting all the data, we calculated the frequency through the text mining and selected the main words for future data analysis. Data collection was conducted from Google Web and News during the period from January 1, 2013 to December 31, 2017. The selected 64 words were analyzed by using UCINET 6.0 program, and the analysis results were visualized with NetDraw in order to present the relationship of main words. As a result, it was found that the most important goal for the students from cooking school is to work as a cook, likewise to have practical classes. In addition, we obtained the result that SNS marketing system that the social sites, such as Facebook, Twitter, and Instagram are actively utilized as a marketing strategy of the institute. Therefore, the results can be helpful in searching for the method of utilizing big data and can bring brand-new ideas for the follow-up studies. In practical terms, it will be remarkable material about the future marketing directions and various programs that are improved by the detailed curriculums through semantic network of cooking school by using big data.

A Semantic Network Analysis of Big Data regarding Food Exhibition at Convention Center (전시컨벤션센터 식품박람회와 관련된 빅데이터의 의미연결망 분석)

  • Kim, Hak-Seon
    • Culinary science and hospitality research
    • /
    • v.23 no.3
    • /
    • pp.257-270
    • /
    • 2017
  • The purpose of this study was to visualize the semantic network with big data related to food exhibition at convention center. For this, this study collected data containing 'coex food exhibition/bexco food exhibition' keywords from web pages and news on Google during one year from January 1 to December 31, 2016. Data were collected by using TEXTOM, a data collecting and processing program. From those data, degree centrality, closeness centrality, betweenness centrality and eigenvector centrality were analyzed by utilizing packaged NetDraw along with UCINET 6. The result showed that the web visibility of hospitality and destinations was high. In addition, the web visibility was also high for convention center programs, such as festival, exhibition, k-pop and event; hospitality related words, such as tourists, service, hotel, cruise, cuisine, travel. Convergence of iterated correlations showed 4 clustered named "Coex", "Bexco", "Nations" and "Hospitality". It is expected that this diagnosis on food exhibition at convention center according to changes in domestic environment by using these web information will be a foundation of baseline data useful for establishing convention marketing strategies.

An Exploratory Study on the Semantic Network Analysis of Food Tourism through the Big Data (빅데이터를 활용한 음식관광관련 의미연결망 분석의 탐색적 적용)

  • Kim, Hak-Seon
    • Culinary science and hospitality research
    • /
    • v.23 no.4
    • /
    • pp.22-32
    • /
    • 2017
  • The purpose of this study was to explore awareness of food tourism using big data analysis. For this, this study collected data containing 'food tourism' keywords from google web search, google news, and google scholar during one year from January 1 to December 31, 2016. Data were collected by using SCTM (Smart Crawling & Text Mining), a data collecting and processing program. From those data, degree centrality and eigenvector centrality were analyzed by utilizing packaged NetDraw along with UCINET 6. The result showed that the web visibility of 'core service' and 'social marketing' was high. In addition, the web visibility was also high for destination, such as rural, place, ireland and heritage; 'socioeconomic circumstance' related words, such as economy, region, public, policy, and industry. Convergence of iterated correlations showed 4 clustered named 'core service', 'social marketing', 'destinations' and 'social environment'. It is expected that this diagnosis on food tourism according to changes in international business environment by using these web information will be a foundation of baseline data useful for establishing food tourism marketing strategies.

Content Analysis on Newspaper Public Opinion Survey - The 17th Presidential Election of Korea -

  • Choi, Kyung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.19 no.2
    • /
    • pp.431-441
    • /
    • 2008
  • A public opinion poll's importance is at this time increasing now. Especially, a news report with a fair and objective execution and investigative reporting Moral Code is very important. But a research on the basis of investigative reporting Moral Code is not yet carried out. In this paper, with the center of a public opinion poll involved in the 17th Presidential Election of Korea, investigative reporting Moral Code has been analyzed measurably how well observed in the Press. Furthermore, it has been compared with findings carried out in the year 2002. Finally, through comparing response rate with actual results acquired in a survey of public opinion, I proposed a response rate acquisition.

  • PDF

Herd behavior and volatility in financial markets

  • Park, Beum-Jo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.6
    • /
    • pp.1199-1215
    • /
    • 2011
  • Relaxing an unrealistic assumption of a representative percolation model, this paper demonstrates that herd behavior leads to a high increase in volatility but not trading volume, in contrast with information flows that give rise to increases in both volatility and trading volume. Although detecting herd behavior has posed a great challenge due to its empirical difficulty, this paper proposes a new methodology for detecting trading days with herding. Furthermore, this paper suggests a herd-behavior-stochastic-volatility model, which accounts for herding in financial markets. Strong evidence in favor of the model specification over the standard stochastic volatility model is based on empirical application with high frequency data in the Korean equity market, strongly supporting the intuition that herd behavior causes excess volatility. In addition, this research indicates that strong persistence in volatility, which is a prevalent feature in financial markets, is likely attributed to herd behavior rather than news.