• Title/Summary/Keyword: statistical forecast model

Search Result 252, Processing Time 0.029 seconds

How to improve oil consumption forecast using google trends from online big data?: the structured regularization methods for large vector autoregressive model

  • Choi, Ji-Eun;Shin, Dong Wan
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.1
    • /
    • pp.41-51
    • /
    • 2022
  • We forecast the US oil consumption level taking advantage of google trends. The google trends are the search volumes of the specific search terms that people search on google. We focus on whether proper selection of google trend terms leads to an improvement in forecast performance for oil consumption. As the forecast models, we consider the least absolute shrinkage and selection operator (LASSO) regression and the structured regularization method for large vector autoregressive (VAR-L) model of Nicholson et al. (2017), which select automatically the google trend terms and the lags of the predictors. An out-of-sample forecast comparison reveals that reducing the high dimensional google trend data set to a low-dimensional data set by the LASSO and the VAR-L models produces better forecast performance for oil consumption compared to the frequently-used forecast models such as the autoregressive model, the autoregressive distributed lag model and the vector error correction model.

Binary Forecast of Heavy Snow Using Statistical Models

  • Sohn, Keon-Tae
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.2
    • /
    • pp.369-378
    • /
    • 2006
  • This Study focuses on the binary forecast of occurrence of heavy snow in Honam area based on the MOS(model output statistic) method. For our study daily amount of snow cover at 17 stations during the cold season (November to March) in 2001 to 2005 and Corresponding 45 RDAPS outputs are used. Logistic regression model and neural networks are applied to predict the probability of occurrence of Heavy snow. Based on the distribution of estimated probabilities, optimal thresholds are determined via true shill score. According to the results of comparison the logistic regression model is recommended.

Leave-one-out Bayesian model averaging for probabilistic ensemble forecasting

  • Kim, Yongdai;Kim, Woosung;Ohn, Ilsang;Kim, Young-Oh
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.1
    • /
    • pp.67-80
    • /
    • 2017
  • Over the last few decades, ensemble forecasts based on global climate models have become an important part of climate forecast due to the ability to reduce uncertainty in prediction. Moreover in ensemble forecast, assessing the prediction uncertainty is as important as estimating the optimal weights, and this is achieved through a probabilistic forecast which is based on the predictive distribution of future climate. The Bayesian model averaging has received much attention as a tool of probabilistic forecasting due to its simplicity and superior prediction. In this paper, we propose a new Bayesian model averaging method for probabilistic ensemble forecasting. The proposed method combines a deterministic ensemble forecast based on a multivariate regression approach with Bayesian model averaging. We demonstrate that the proposed method is better in prediction than the standard Bayesian model averaging approach by analyzing monthly average precipitations and temperatures for ten cities in Korea.

A Model to Calibrate Expressway Traffic Forecasting Errors Considering Socioeconomic Characteristics and Road Network Structure (사회경제적 특성과 도로망구조를 고려한 고속도로 교통량 예측 오차 보정모형)

  • Yi, Yongju;Kim, Youngsun;Yu, Jeong Whon
    • International Journal of Highway Engineering
    • /
    • v.15 no.3
    • /
    • pp.93-101
    • /
    • 2013
  • PURPOSES : This study is to investigate the relationship of socioeconomic characteristics and road network structure with traffic growth patterns. The findings is to be used to tweak traffic forecast provided by traditional four step process using relevant socioeconomic and road network data. METHODS: Comprehensive statistical analysis is used to identify key explanatory variables using historical observations on traffic forecast, actual traffic counts and surrounding environments. Based on statistical results, a multiple regression model is developed to predict the effects of socioeconomic and road network attributes on traffic growth patterns. The validation of the proposed model is also performed using a different set of historical data. RESULTS : The statistical analysis results indicate that several socioeconomic characteristics and road network structure cleary affect the tendency of over- and under-estimation of road traffics. Among them, land use is a key factor which is revealed by a factor that traffic forecast for urban road tends to be under-estimated while rural road traffic prediction is generally over-estimated. The model application suggests that tweaking the traffic forecast using the proposed model can reduce the discrepancies between the predicted and actual traffic counts from 30.4% to 21.9%. CONCLUSIONS : Prediction of road traffic growth patterns based on surrounding socioeconomic and road network attributes can help develop the optimal strategy of road construction plan by enhancing reliability of traffic forecast as well as tendency of traffic growth.

Statistical Correction of Numerical Model Forecasts for Typhoon Tracks

  • Sohn, Keon-Tae
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.295-304
    • /
    • 2005
  • This paper concentrates on the prediction of typhoon tracks using the dynamic linear model (DLM) for the statistical correction of the numerical model guidance used in the JMA. The DLM with proposed forecast strategy is applied to reduce their systematic errors using the latest observation. All parameters of the DLM are updated dynamically and backward forecasting is performed to remove the effect of initial values.

Robustness of Bayes forecast to Non-normality

  • Bansal, Ashok K.
    • Journal of the Korean Statistical Society
    • /
    • v.7 no.1
    • /
    • pp.11-16
    • /
    • 1978
  • Bayesian procedures are in vogue to revise the parameter estimates of the forecasting model in the light of actual time series data. In this paper, we study the Bayes forecast for demand and the risk when (a) 'noise' and (b) mean demand rate in a constant process model have moderately non-normal probability distributions.

  • PDF

Comparison of different post-processing techniques in real-time forecast skill improvement

  • Jabbari, Aida;Bae, Deg-Hyo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.150-150
    • /
    • 2018
  • The Numerical Weather Prediction (NWP) models provide information for weather forecasts. The highly nonlinear and complex interactions in the atmosphere are simplified in meteorological models through approximations and parameterization. Therefore, the simplifications may lead to biases and errors in model results. Although the models have improved over time, the biased outputs of these models are still a matter of concern in meteorological and hydrological studies. Thus, bias removal is an essential step prior to using outputs of atmospheric models. The main idea of statistical bias correction methods is to develop a statistical relationship between modeled and observed variables over the same historical period. The Model Output Statistics (MOS) would be desirable to better match the real time forecast data with observation records. Statistical post-processing methods relate model outputs to the observed values at the sites of interest. In this study three methods are used to remove the possible biases of the real-time outputs of the Weather Research and Forecast (WRF) model in Imjin basin (North and South Korea). The post-processing techniques include the Linear Regression (LR), Linear Scaling (LS) and Power Scaling (PS) methods. The MOS techniques used in this study include three main steps: preprocessing of the historical data in training set, development of the equations, and application of the equations for the validation set. The expected results show the accuracy improvement of the real-time forecast data before and after bias correction. The comparison of the different methods will clarify the best method for the purpose of the forecast skill enhancement in a real-time case study.

  • PDF

Development of a Transfer Function Model to Forecast Ground-level Ozone Concentration in Seoul (서울지역의 지표오존농도 예보를 위한 전이함수모델 개발)

  • 김유근;손건태;문윤섭;오인보
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.15 no.6
    • /
    • pp.779-789
    • /
    • 1999
  • To support daily ground-level $O_3$ forecasting in Seoul, a transfer function model(TFM) has been developed by using surface meteorological data and pollutant data(previous-day [$O_3$] and [$NO_2$]) from 1 May to 31 August in 1997. The forecast performance of the TFM was evaluated by statistical comparison with $O_3$ concentration observed during September it is shown that correlation coefficient(R), root mean squared error(RMSE), normalized mean squared error(NMSE) and mean relative error(MRE) were 0.73, 15.64, 0.006 and 0.101, respectively. The TFM appeared to have some difficulty forecasting very high $O_3$ concentrations. To compare with this model, multiple regression model(MRM) was developed for the same period. According to statistical comparison between the TFM and MRM. two models had similar predictive capability but TFM based on $O_3$ concentration higher than 60 ppb provided more accurate forecast than MRM. It was concluded that statistical model based on TFM can be useful for improving the accuracy of local $O_3$ forecast.

  • PDF

Markov Chain Approach to Forecast in the Binomial Autoregressive Models

  • Kim, Hee-Young;Park, You-Sung
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.3
    • /
    • pp.441-450
    • /
    • 2010
  • In this paper we consider the problem of forecasting binomial time series, modelled by the binomial autoregressive model. This paper considers proposed by McKenzie (1985) and is extended to a higher order by $Wei{\ss}$(2009). Since the binomial autoregressive model is a Markov chain, we can apply the earlier work of Bu and McCabe (2008) for integer valued autoregressive(INAR) model to the binomial autoregressive model. We will discuss how to compute the h-step-ahead forecast of the conditional probabilities of $X_{T+h}$ when T periods are used in fitting. Then we obtain the maximum likelihood estimator of binomial autoregressive model and use it to derive the maximum likelihood estimator of the h-step-ahead forecast of the conditional probabilities of $X_{T+h}$. The methodology is illustrated by applying it to a data set previously analyzed by $Wei{\ss}$(2009).

A Time Series-Based Statistical Approach for Trade Turnover Forecasting and Assessing: Evidence from China and Russia

  • DING, Xiao Wei
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.4
    • /
    • pp.83-92
    • /
    • 2022
  • Due to the uncertainty in the order of the integrated model, the SARIMA-LSTM model, SARIMA-SVR model, LSTM-SARIMA model, and SVR-SARIMA model are constructed respectively to determine the best-combined model for forecasting the China-Russia trade turnover. Meanwhile, the effect of the order of the combined models on the prediction results is analyzed. Using indicators such as MAPE and RMSE, we compare and evaluate the predictive effects of different models. The results show that the SARIMA-LSTM model combines the SARIMA model's short-term forecasting advantage with the LSTM model's long-term forecasting advantage, which has the highest forecast accuracy of all models and can accurately predict the trend of China-Russia trade turnover in the post-epidemic period. Furthermore, the SARIMA - LSTM model has a higher forecast accuracy than the LSTM-ARIMA model. Nevertheless, the SARIMA-SVR model's forecast accuracy is lower than the SVR-SARIMA model's. As a result, the combined models' order has no bearing on the predicting outcomes for the China-Russia trade turnover time series.