• Title/Summary/Keyword: ARIMA Models

Search Result 188, Processing Time 0.027 seconds

Time series models for predicting the trend of voice phishing: seasonality and exogenous variables approaches (보이스피싱 발생 추이 예측을 위한 시계열 모형 연구: 계절성과 외생변수 활용)

  • Da-Yeon Kang;Seung-Yeon Lee;Eunju Hwang
    • Convergence Security Journal
    • /
    • v.24 no.2
    • /
    • pp.151-160
    • /
    • 2024
  • In recent years with high interest rates and inflations, which worsen people's lives, voice phishing crimes also increase along with damage. Voice phishing that becomes more evolved by technology developments causes serious financial and mental damage to victims. This work aims to study time series models for its accurate prediction. ARIMA, SARIMA and SARIMAX models are compared. As exogenous variables, the amount of damages and the numbers of arrests and criminals are adopted. Forecasting performances are evaluated. Prediction intervals are constructed along with empirical coverages, which justify the superiority of the model. Finally, the numbers of voice phishing up to December 2024 are predicted, through which we expect the establishment of future prevention strategies for voice phishing.

Short-term Predictive Models for Influenza-like Illness in Korea: Using Weekly ILI Surveillance Data and Web Search Queries (한국 인플루엔자 의사환자 단기 예측 모형 개발: 주간 ILI 감시 자료와 웹 검색 정보의 활용)

  • Jung, Jae Un
    • Journal of Digital Convergence
    • /
    • v.16 no.9
    • /
    • pp.147-157
    • /
    • 2018
  • Since Google launched a prediction service for influenza-like illness(ILI), studies on ILI prediction based on web search data have proliferated worldwide. In this regard, this study aims to build short-term predictive models for ILI in Korea using ILI and web search data and measure the performance of the said models. In these proposed ILI predictive models specific to Korea, ILI surveillance data of Korea CDC and Korean web search data of Google and Naver were used along with the ARIMA model. Model 1 used only ILI data. Models 2 and 3 added Google and Naver search data to the data of Model 1, respectively. Model 4 included a common query used in Models 2 and 3 in addition to the data used in Model 1. In the training period, the goodness of fit of all predictive models was higher than 95% ($R^2$). In predictive periods 1 and 2, Model 1 yielded the best predictions (99.98% and 96.94%, respectively). Models 3(a), 4(b), and 4(c) achieved stable predictability higher than 90% in all predictive periods, but their performances were not better than that of Model 1. The proposed models that yielded accurate and stable predictions can be applied to early warning systems for the influenza pandemic in Korea, with supplementary studies on improving their performance.

Prediction and Causality Examination of the Environment Service Industry and Distribution Service Industry (환경서비스업과 물류서비스업의 예측 및 인과성 검정)

  • Sun, Il-Suck;Lee, Choong-Hyo
    • Journal of Distribution Science
    • /
    • v.12 no.6
    • /
    • pp.49-57
    • /
    • 2014
  • Purpose - The world now recognizes environmental disruption as a serious issue when regarding growth-oriented strategies; therefore, environmental preservation issues become pertinent. Consequently, green distribution is continuously emphasized. However, studying the prediction and association of distribution and the environment is insufficient. Most existing studies about green distribution are about its necessity, detailed operation methods, and political suggestions; it is necessary to study the distribution service industry and environmental service industry together, for green distribution. Research design, data, and methodology - ARIMA (auto-regressive moving average model) was used to predict the environmental service and distribution service industries, and the Granger Causality Test based on VAR (vector auto regressive) was used to analyze the causal relationship. This study used 48 quarters of time-series data, from the 4th quarter in 2001 to the 3rd quarter in 2013, about each business type's production index, and used an unchangeable index. The production index about the business type is classified into the current index and the unchangeable index. The unchangeable index divides the current index into deflators to remove fluctuation. Therefore, it is easy to analyze the actual production index. This study used the unchangeable index. Results - The production index of the distribution service industry and the production index of the environmental service industry consider the autocorrelation coefficient and partial autocorrelation coefficient; therefore, ARIMA(0,0,2)(0,1,1)4 and ARIMA(3,1,0)(0,1,1)4 were established as final prediction models, resulting in the gradual improvement in every production index of both types of business. Regarding the distribution service industry's production index, it is predicted that the 4th quarter in 2014 is 114.35, and the 4th quarter in 2015 is 123.48. Moreover, regarding the environmental service industry's production index, it is predicted that the 4th quarter in 2014 is 110.95, and the 4th quarter in 2015 is 111.67. In a causal relationship analysis, the environmental service industry impacts the distribution service industry, but the distribution service industry does not impact the environmental service industry. Conclusions - This study predicted the distribution service industry and environmental service industry with the ARIMA model, and examined the causal relationship between them through the Granger causality test based on the VAR Model. Prediction reveals the seasonality and gradual increase in the two industries. Moreover, the environmental service industry impacts the distribution service industry, but the distribution service industry does not impact the environmental service industry. This study contributed academically by offering base line data needed in the establishment of a future style of management and policy directions for the two industries through the prediction of the distribution service industry and the environmental service industry, and tested a causal relationship between them, which is insufficient in existing studies. The limitations of this study are that deeper considerations of advanced studies are deficient, and the effect of causality between the two types of industries on the actual industry was not established.

A Space-Time Model with Application to Annual Temperature Anomalies;

  • Lee, Eui-Kyoo;Moon, Myung-Sang;Gunst, Richard F.
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.1
    • /
    • pp.19-30
    • /
    • 2003
  • Spatiotemporal statistical models are used for analyzing space-time data in many fields, such as environmental sciences, meteorology, geology, epidemiology, forestry, hydrology, fishery, and so on. It is well known that classical spatiotemporal process modeling requires the estimation of space-time variogram or covariance functions. In practice, the estimation of such variogram or covariance functions are computationally difficult and highly sensitive to data structures. We investigate a Bayesian hierarchical model which allows the specification of a more realistic series of conditional distributions instead of computationally difficult and less realistic joint covariance functions. The spatiotemporal model investigated in this study allows both spatial component and autoregressive temporal component. These two features overcome the inability of pure time series models to adequately predict changes in trends in individual sites.

Forecasting Chinese Yuan/USD Via Combination Techniques During COVID-19

  • ASADULLAH, Muhammad;UDDIN, Imam;QAYYUM, Arsalan;AYUBI, Sharique;SABRI, Rabia
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.5
    • /
    • pp.221-229
    • /
    • 2021
  • This study aims to forecast the exchange rate of the Chinese Yuan against the US Dollar by a combination of different models as proposed by Poon and Granger (2003) during the Covid-19 pandemic. For this purpose, we include three uni-variate time series models, i.e., ARIMA, Naïve, Exponential smoothing, and one multivariate model, i.e., NARDL. This is the first of its kind endeavor to combine univariate models along with NARDL to the best of our knowledge. Utilizing monthly data from January 2011 to December 2020, we predict the Chinese Yuan against the US dollar by two combination criteria i.e. var-cor and equal weightage. After finding out the individual accuracy, the models are then assessed through equal weightage and var-cor methods. Our results suggest that Naïve outperforms all individual & combination of time series models. Similarly, the combination of NARDL and Naïve model again outperformed all of the individual as well as combined models except the Naïve model, with the lowest MAPE value of 0764. The results suggesting that the Chinese Yuan exchange rate against the US Dollar is dependent upon the recent observations of the time series. Further evidence shows that the combination of models plays a vital role in forecasting which commensurate with the literature.

Forecasting Exchange Rates: An Empirical Application to Pakistani Rupee

  • ASADULLAH, Muhammad;BASHIR, Adnan;ALEEMI, Abdur Rahman
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.4
    • /
    • pp.339-347
    • /
    • 2021
  • This study aims to forecast the exchange rate by a combination of different models as proposed by Poon and Granger (2003). For this purpose, we include three univariate time series models, i.e., ARIMA, Naïve, Exponential smoothing, and one multivariate model, i.e., NARDL. This is the first of its kind endeavor to combine univariate models along with NARDL to the best of our knowledge. Utilizing monthly data from January 2011 to December 2020, we predict the Pakistani Rupee against the US dollar by a combination of different forecasting techniques. The observations from M1 2020 to M12 2020 are held back for in-sample forecasting. The models are then assessed through equal weightage and var-cor methods. Our results suggest that NARDL outperforms all individual time series models in terms of forecasting the exchange rate. Similarly, the combination of NARDL and Naïve model again outperformed all of the individual as well as combined models with the lowest MAPE value of 0.612 suggesting that the Pakistani Rupee exchange rate against the US Dollar is dependent upon the macro-economic fundamentals and recent observations of the time series. Further evidence shows that the combination of models plays a vital role in forecasting, as stated by Poon and Granger (2003).

시공간 베이지안 계층모형-미국 연기온 편차자료에 적용-

  • Lee, Ui-Gyu;Mun, Myeong-Sang;Gunst, Richard F.
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.11a
    • /
    • pp.163-168
    • /
    • 2002
  • 전형적인 시공간모형은 시공간 변이도(semivariogram) 또는 공분산 함수(covariance function)를 필요로 한다. 본 논문에서는 계산하기 어렵고 현실적이지 못한 결합 공분산함수를 통한 고전적 모형 대신, 일련의 독립적인 조건분포를 이용하는 보다 현실적인 베이지안 계층모형을 이용한다. 미국 전 지역에 산재해 있는 138개 기온 관측소로부터 얻어진 61년(1920-1980) 동안의 연기온편차 자료에 시공간 베이지안 계층모형을 적용하고 순수시계열모형에서의 적합값과 제안된 모형의 적합값을 비교분석한다.

  • PDF

Long term trends in the Korean professional baseball (한국프로야구 기록들의 장기추세)

  • Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.1
    • /
    • pp.1-10
    • /
    • 2015
  • This paper offers some long term perspective on what has been happening to some baseball statistics for Korean professional baseball. The data used are league summaries by year over the period 1982-2013. For the baseball statistics, statistically significant positive correlations (p < 0.01) were found for doubles (2B), runs batted in (RBI), bases on balls (BB), strike outs (SO), grounded into double play (GIDP), hit by pitch (HBP), on base percentage (OBP), OPS, earned run average (ERA), wild pitches (WP) and walks plus hits divided by innings pitched (WHIP) increased with year. There was a statistically significant decreasing trend in the correlations for triples (3B), caught stealing (CS), errors (E), completed games (CG), shutouts (SHO) and balks (BK) with year (trend p < 0.01). The ARIMA model of Box-Jenkins is applied to find a model to forecast future baseball measures. Univariate time series results suggest that simple lag-1 models fit some baseball measures quite well. In conclusion, the single most important change in Korean professional baseball is the overall incidence of completed games (CG) downward. Also the decrease of strike outs (SO) is very remarkable.

Time Series Analysis and Forecasting of Electrical Conductivity in Coastal Aquifers (연안암반대수층의 해수침투경향성 파악을 위한 전기전도도 시계열 분석과 예측)

  • Ju, Jeong-Woung;Yeo, In Wook
    • Economic and Environmental Geology
    • /
    • v.50 no.4
    • /
    • pp.267-276
    • /
    • 2017
  • Seawater intrusion into coastal fractured rock aquifer, resulting in groundwater contamination, is of serious concern in coastal areas of Jeolla Namdo, Korea, which heavily depends on groundwater resources. Time series analysis and forecasting were carried out to analyze and predict EC which is a major indicator of seawater intrusion. Two time series models of autoregressive integrated moving average (ARIMA) and seasonal autoregressive integrated moving average (SARIMA) were tested for suggesting appropriate time series model. Time series data of EC measured over one year showed a increasing trend with short periodic fluctuations, due to tidal effect and pumping, which indicated that EC time series data tended to be non-stationary. SARIMA model was found better fitted to observed EC than any other time series model. Time series analysis and modeling was found to be a useful tool to analyze EC at coastal fractured rock aquifer subject to seawater intrusion.

Deep Learning Forecast model for City-Gas Acceptance Using Extranoues variable (외재적 변수를 이용한 딥러닝 예측 기반의 도시가스 인수량 예측)

  • Kim, Ji-Hyun;Kim, Gee-Eun;Park, Sang-Jun;Park, Woon-Hak
    • Journal of the Korean Institute of Gas
    • /
    • v.23 no.5
    • /
    • pp.52-58
    • /
    • 2019
  • In this study, we have developed a forecasting model for city- gas acceptance. City-gas corporations have to report about city-gas sale volume next year to KOGAS. So it is a important thing to them. Factors influenced city-gas have differences corresponding to usage classification, however, in city-gas acceptence, it is hard to classificate. So we have considered tha outside temperature as factor that influence regardless of usage classification and the model development was carried out. ARIMA, one of the traditional time series analysis, and LSTM, a deep running technique, were used to construct forecasting models, and various Ensemble techniques were used to minimize the disadvantages of these two methods.Experiments and validation were conducted using data from JB Corp. from 2008 to 2018 for 11 years.The average of the error rate of the daily forecast was 0.48% for Ensemble LSTM, the average of the error rate of the monthly forecast was 2.46% for Ensemble LSTM, And the absolute value of the error rate is 5.24% for Ensemble LSTM.