• Title/Summary/Keyword: time series of counts data

Search Result 23, Processing Time 0.029 seconds

Effects of Overdispersion on Testing for Serial Dependence in the Time Series of Counts Data

  • Kim, Hee-Young;Park, You-Sung
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.6
    • /
    • pp.829-843
    • /
    • 2010
  • To test for the serial dependence in time series of counts data, Jung and Tremayne (2003) evaluated the size and power of several tests under the class of INARMA models based on binomial thinning operations for Poisson marginal distributions. The overdispersion phenomenon(i.e., a variance greater than the expectation) is common in the real world. Overdispersed count data can be modeled by using alternative thinning operations such as random coefficient thinning, iterated thinning, and quasi-binomial thinning. Such thinning operations can lead to time series models of counts with negative binomial or generalized Poisson marginal distributions. This paper examines whether the test statistics used by Jung and Tremayne (2003) on serial dependence in time series of counts data are affected by overdispersion.

Stochastic structures of world's death counts after World War II

  • Lee, Jae J.
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.353-371
    • /
    • 2022
  • This paper analyzes death counts after World War II of several countries to identify and to compare their stochastic structures. The stochastic structures that this paper entertains are three structural time series models, a local level with a random walk model, a fixed local linear trend model and a local linear trend model. The structural time series models assume that a time series can be formulated directly with the unobserved components such as trend, slope, seasonal, cycle and daily effect. Random effect of each unobserved component is characterized by its own stochastic structure and a distribution of its irregular component. The structural time series models use the Kalman filter to estimate unknown parameters of a stochastic model, to predict future data, and to do filtering data. This paper identifies the best-fitted stochastic model for three types of death counts (Female, Male and Total) of each country. Two diagnostic procedures are used to check the validity of fitted models. Three criteria, AIC, BIC and SSPE are used to select the best-fitted valid stochastic model for each type of death counts of each country.

Exploring COVID-19 in mainland China during the lockdown of Wuhan via functional data analysis

  • Li, Xing;Zhang, Panpan;Feng, Qunqiang
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.1
    • /
    • pp.103-125
    • /
    • 2022
  • In this paper, we analyze the time series data of the case and death counts of COVID-19 that broke out in China in December, 2019. The study period is during the lockdown of Wuhan. We exploit functional data analysis methods to analyze the collected time series data. The analysis is divided into three parts. First, the functional principal component analysis is conducted to investigate the modes of variation. Second, we carry out the functional canonical correlation analysis to explore the relationship between confirmed and death cases. Finally, we utilize a clustering method based on the Expectation-Maximization (EM) algorithm to run the cluster analysis on the counts of confirmed cases, where the number of clusters is determined via a cross-validation approach. Besides, we compare the clustering results with some migration data available to the public.

Hierarchical time series forecasting with an application to traffic accident counts (계층적 시계열 분석을 이용한 지역별 교통사고 발생건수 예측)

  • Lee, Jooeun;Seong, Byeongchan
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.181-193
    • /
    • 2017
  • The paper introduces bottom-up and optimal combination methods that can analyze and forecast hierarchical time series. These methods allow forecasts at lower levels to be summed consistently to upper levels without any ad-hoc adjustment. They can also potentially improve forecast performance in comparison to independent forecasts. We forecast regional traffic accident counts as time series data in order to identify efficiency gains from hierarchical forecasting. We observe that bottom-up or optimal combination methods are superior to independent methods in terms of forecast accuracy.

Volatility clustering in data breach counts

  • Shim, Hyunoo;Kim, Changki;Choi, Yang Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.4
    • /
    • pp.487-500
    • /
    • 2020
  • Insurers face increasing demands for cyber liability; entailed in part by a variety of new forms of risk of data breaches. As data breach occurrences develop, our understanding of the volatility in data breach counts has also become important as well as its expected occurrences. Volatility clustering, the tendency of large changes in a random variable to cluster together in time, are frequently observed in many financial asset prices, asset returns, and it is questioned whether the volatility of data breach occurrences are also clustered in time. We now present volatility analysis based on INGARCH models, i.e., integer-valued generalized autoregressive conditional heteroskedasticity time series model for frequency counts due to data breaches. Using the INGARCH(1, 1) model with data breach samples, we show evidence of temporal volatility clustering for data breaches. In addition, we present that the firms' volatilities are correlated between some they belong to and that such a clustering effect remains even after excluding the effect of financial covariates such as the VIX and the stock return of S&P500 that have their own volatility clustering.

Temporal hierarchical forecasting with an application to traffic accident counts (시간적 계층을 이용한 교통사고 발생건수 예측)

  • Jun, Gwanyoung;Seong, Byeongchan
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.2
    • /
    • pp.229-239
    • /
    • 2018
  • This paper introduces how to adopt the concept of temporal hierarchies to forecast time series data. Similarly as in hierarchical cross-sectional data, temporal hierarchies can be constructed for any time series data by means of non-overlapping temporal aggregation. Reconciliation forecasts with temporal hierarchies result in more accurate and robust forecasts when compared with the independent base and bottom-up forecasts. As an empirical example, we forecast traffic accident counts with temporal hierarchies and observe that reconciliation forecasts are superior to the base and bottom-up forecasts in terms of forecast accuracy.

A Study on Performance Evaluation of Various Kriging Models for Estimating AADT (연평균 일교통량 산정을 위한 다양한 크리깅 방법의 성능 평가에 대한 연구)

  • Ha, Jung Ah;Oh, Sei-Chang;Heo, Tae-Young
    • Journal of Korean Society of Transportation
    • /
    • v.32 no.4
    • /
    • pp.380-388
    • /
    • 2014
  • Annual average daily traffic(AADT) serves as important basic data in the transportation sector. AADT is used as design traffic which is the basic traffic volume in transportation planning. Despite of its importance, at most locations, AADT is estimated using short term traffic counts. An accurate AADT is calculated through permanent traffic counts at limited locations. This study dealt with estimating AADT using various models considering both the spatial correlation and time series data. Kriging models which are commonly used spatial statistics methods were applied and compared with each model. Additionally the External Universal kriging model, which includes explanatory variables, was used to assure accuracy of AADT estimation. For evaluation of various kriging methods, AADT estimation error, proposed using national highway permanent traffic count data, was analyzed and their performances were compared. The result shows the accuracy enhancement of the AADT estimation.

Effect of Air Pollution on Emergency Room Visits for Asthma : a Time Series Analysis (대기오염과 천식발작의 관련성에 관한 시계열적 연구)

  • Ju, Young-Su;Cho, Soo-Hun
    • Journal of Preventive Medicine and Public Health
    • /
    • v.34 no.1
    • /
    • pp.61-72
    • /
    • 2001
  • Objectives : To evaluate the hypothesis that increasing ambient levels of ozone or particulate matter are associated with increased emergency room visits for asthma and to quantify the strength of association, if any, between these. Methods : Daily counts of emergency room visits for asthma, air quality, and weather data were collected from hospitals with over 200 beds and from monitoring Stations in Seoul, Korea from 1994 through 1997. Daily counts of emergency mom visits for asthma attack were analyzed using a general additive Poisson model, with adjustment for the effects of secular trend, seasonal variation, Sunday and holiday, temperature, and humidly, according to levels of ozone and particulate matter. Results : The association between daily counts of emergency room visits for asthma attack and ozone levels was statistically significant in summer(from June to August), and the RR by unit inclement of 100 ppb ozone was 1.30(95% CI = $1.11\sim1.52$) without lag time. With restriction of the period from April to September in 1996, the RR was 1.37(95% CI = $1.06\sim1.76$), and from June to August in 1995, the RR was 1.62(95% CI = $1.12\sim2.35$). In the data for children$(5\sim14yr)$, the RR was 2.57(95% CI = $1.31\sim5.05$) with restriction of the period from April to September in 1997. There was no Significant association between TSP levels and asthma attacks, but a slight association was seen between PM10 levels and asthma attacks in a very restricted period. Conclusion : There was a statistically significant association between ambient levels of ozone and daily counts of emergency room visits for asthma attack. Therefore, we must make efforts to effectively minimize air pollution, in order to protect public health.

  • PDF

Robust CUSUM test for time series of counts and its application to analyzing the polio incidence data

  • Kang, Jiwon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.6
    • /
    • pp.1565-1572
    • /
    • 2015
  • In this paper, we analyze the polio incidence data based on the Poisson autoregressive models, focusing particularly on change-point detection. Since the data include some strongly deviating observations, we employ the robust cumulative sum (CUSUM) test proposed by Kang and Song (2015) to perform the test for parameter change. Contrary to the result of Kang and Lee (2014), our data analysis indicates that there is no significant change in the case of the CUSUM test with strong robustness and the same result is obtained after ridding the polio data of outliers. We additionally consider the comparison of the forecasting performance. All the results demonstrate that the robust CUSUM test performs adequately in the presence of seemingly outliers.

Time Series Forecasting on Car Accidents in Korea Using Auto-Regressive Integrated Moving Average Model (자동 회귀 통합 이동 평균 모델 적용을 통한 한국의 자동차 사고에 대한 시계열 예측)

  • Shin, Hyunkyung
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.12
    • /
    • pp.54-61
    • /
    • 2019
  • Recently, IITS (intelligent integrated transportation system) has been important topic in Smart City related industry. As a main objective of IITS, prevention of traffic jam (due to car accidents) has been attempted with help of advanced sensor and communication technologies. Studies show that car accident has certain correlation with some factors including characteristics of location, weather, driver's behavior, and time of day. We concentrate our study on observing auto correlativity of car accidents in terms of time of day. In this paper, we performed the ARIMA tests including ADF (augmented Dickey-Fuller) to check the three factors determining auto-regressive, stationarity, and lag order. Summary on forecasting of hourly car crash counts is presented, we show that the traffic accident data obtained in Korea can be applied to ARIMA model and present a result that traffic accidents in Korea have property of being recurrent daily basis.