• Title/Summary/Keyword: lagged cross-correlation

Search Result 15, Processing Time 0.025 seconds

Relations Between Paprika Consumption and Unstructured Big Data, and Paprika Consumption Prediction

  • Cho, Yongbeen;Oh, Eunhwa;Cho, Wan-Sup;Nasridinov, Aziz;Yoo, Kwan-Hee;Rah, HyungChul
    • International Journal of Contents
    • /
    • v.15 no.4
    • /
    • pp.113-119
    • /
    • 2019
  • It has been reported that large amounts of information on agri-foods were delivered to consumers through television and social networks, and the information may influence consumers' behavior. The purpose of this paper was first to analyze relations of social network service and broadcasting program on paprika consumption in the aspect of amounts to purchase and identify potential factors that can promote paprika consumption; second, to develop prediction models of paprika consumption by using structured and unstructured big data. By using data 2010-2017, cross-correlation and time-series prediction algorithms (autoregressive exogenous model and vector error correction model), statistically significant correlations between paprika consumption and television programs/shows and blogs mentioning paprika and diet were identified with lagged times. When paprika and diet related data were added for prediction, these data improved the model predictability. This is the first report to predict paprika consumption by using structured and unstructured data.

Drought Forecasting with Regionalization of Climate Variables and Generalized Linear Model

  • Yejin Kong;Taesam Lee;Joo-Heon Lee;Sejeong Lee
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.249-249
    • /
    • 2023
  • Spring drought forecasting in South Korea is essential due to the sknewness of rainfall which could lead to water shortage especially in spring when managed without prediction. Therefore, drought forecasting over South Korea was performed in the current study by thoroughly searching appropriate predictors from the lagged global climate variable, mean sea level pressure(MSLP), specifically in winter season for forecasting time lag. The target predictand defined as accumulated spring precipitation(ASP) was driven by the median of 93 weather stations in South Korea. Then, it was found that a number of points of the MSLP data were significantly cross-correlated with the ASP, and the points with high correlation were regionally grouped. The grouped variables with three regions: the Arctic Ocean (R1), South Pacific (R2), and South Africa (R3) were determined. The generalized linear model(GLM) was further applied for skewed marginal distribution in drought prediction. It was shown that the applied GLM presents reasonable performance in forecasting ASP. The results concluded that the presented regionalization of the climate variable, MSLP can be a good alternative in forecasting spring drought.

  • PDF

A development of stochastic simulation model based on vector autoregressive model (VAR) for groundwater and river water stages (벡터자기회귀(VAR) 모형을 이용한 지하수위와 하천수위의 추계학적 모의기법 개발)

  • Kwon, Yoon Jeong;Won, Chang-Hee;Choi, Byoung-Han;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.12
    • /
    • pp.1137-1147
    • /
    • 2022
  • River and groundwater stages are the main elements in the hydrologic cycle. They are spatially correlated and can be used to evaluate hydrological and agricultural drought. Stochastic simulation is often performed independently on hydrological variables that are spatiotemporally correlated. In this setting, interdependency across mutual variables may not be maintained. This study proposes the Bayesian vector autoregression model (VAR) to capture the interdependency between multiple variables over time. VAR models systematically consider the lagged stages of each variable and the lagged values of the other variables. Further, an autoregressive model (AR) was built and compared with the VAR model. It was confirmed that the VAR model was more effective in reproducing observed interdependency (or cross-correlation) between river and ground stages, while the AR generally underestimated that of the observed.

Estimation of river discharge using satellite-derived flow signals and artificial neural network model: application to imjin river (Satellite-derived flow 시그널 및 인공신경망 모형을 활용한 임진강 유역 유출량 산정)

  • Li, Li;Kim, Hyunglok;Jun, Kyungsoo;Choi, Minha
    • Journal of Korea Water Resources Association
    • /
    • v.49 no.7
    • /
    • pp.589-597
    • /
    • 2016
  • In this study, we investigated the use of satellite-derived flow (SDF) signals and a data-based model for the estimation of outflow for the river reach where in situ measurements are either completely unavailable or are difficult to access for hydraulic and hydrology analysis such as the upper basin of Imjin River. It has been demonstrated by many studies that the SDF signals can be used as the river width estimates and the correlation between SDF signals and river width is related to the shape of cross sections. To extract the nonlinear relationship between SDF signals and river outflow, Artificial Neural Network (ANN) model with SDF signals as its inputs were applied for the computation of flow discharge at Imjin Bridge located in Imjin River. 15 pixels were considered to extract SDF signals and Partial Mutual Information (PMI) algorithm was applied to identify the most relevant input variables among 150 candidate SDF signals (including 0~10 day lagged observations). The estimated discharges by ANN model were compared with the measured ones at Imjin Bridge gauging station and correlation coefficients of the training and validation were 0.86 and 0.72, respectively. It was found that if the 1 day previous discharge at Imjin bridge is considered as an input variable for ANN model, the correlation coefficients were improved to 0.90 and 0.83, respectively. Based on the results in this study, SDF signals along with some local measured data can play an useful role in river flow estimation and especially in flood forecasting for data-scarce regions as it can simulate the peak discharge and peak time of flood events with satisfactory accuracy.

Construction of Consumer Confidence index based on Sentiment analysis using News articles (뉴스기사를 이용한 소비자의 경기심리지수 생성)

  • Song, Minchae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.1-27
    • /
    • 2017
  • It is known that the economic sentiment index and macroeconomic indicators are closely related because economic agent's judgment and forecast of the business conditions affect economic fluctuations. For this reason, consumer sentiment or confidence provides steady fodder for business and is treated as an important piece of economic information. In Korea, private consumption accounts and consumer sentiment index highly relevant for both, which is a very important economic indicator for evaluating and forecasting the domestic economic situation. However, despite offering relevant insights into private consumption and GDP, the traditional approach to measuring the consumer confidence based on the survey has several limits. One possible weakness is that it takes considerable time to research, collect, and aggregate the data. If certain urgent issues arise, timely information will not be announced until the end of each month. In addition, the survey only contains information derived from questionnaire items, which means it can be difficult to catch up to the direct effects of newly arising issues. The survey also faces potential declines in response rates and erroneous responses. Therefore, it is necessary to find a way to complement it. For this purpose, we construct and assess an index designed to measure consumer economic sentiment index using sentiment analysis. Unlike the survey-based measures, our index relies on textual analysis to extract sentiment from economic and financial news articles. In particular, text data such as news articles and SNS are timely and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. There exist two main approaches to the automatic extraction of sentiment from a text, we apply the lexicon-based approach, using sentiment lexicon dictionaries of words annotated with the semantic orientations. In creating the sentiment lexicon dictionaries, we enter the semantic orientation of individual words manually, though we do not attempt a full linguistic analysis (one that involves analysis of word senses or argument structure); this is the limitation of our research and further work in that direction remains possible. In this study, we generate a time series index of economic sentiment in the news. The construction of the index consists of three broad steps: (1) Collecting a large corpus of economic news articles on the web, (2) Applying lexicon-based methods for sentiment analysis of each article to score the article in terms of sentiment orientation (positive, negative and neutral), and (3) Constructing an economic sentiment index of consumers by aggregating monthly time series for each sentiment word. In line with existing scholarly assessments of the relationship between the consumer confidence index and macroeconomic indicators, any new index should be assessed for its usefulness. We examine the new index's usefulness by comparing other economic indicators to the CSI. To check the usefulness of the newly index based on sentiment analysis, trend and cross - correlation analysis are carried out to analyze the relations and lagged structure. Finally, we analyze the forecasting power using the one step ahead of out of sample prediction. As a result, the news sentiment index correlates strongly with related contemporaneous key indicators in almost all experiments. We also find that news sentiment shocks predict future economic activity in most cases. In almost all experiments, the news sentiment index strongly correlates with related contemporaneous key indicators. Furthermore, in most cases, news sentiment shocks predict future economic activity; in head-to-head comparisons, the news sentiment measures outperform survey-based sentiment index as CSI. Policy makers want to understand consumer or public opinions about existing or proposed policies. Such opinions enable relevant government decision-makers to respond quickly to monitor various web media, SNS, or news articles. Textual data, such as news articles and social networks (Twitter, Facebook and blogs) are generated at high-speeds and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. Although research using unstructured data in economic analysis is in its early stages, but the utilization of data is expected to greatly increase once its usefulness is confirmed.