• 제목/요약/키워드: Multivariate Time Series Data

검색결과 109건 처리시간 0.021초

다변량 시계열 자료를 이용한 부정맥 예측 (Prediction of arrhythmia using multivariate time series data)

  • 이민혜;노호석
    • 응용통계연구
    • /
    • 제32권5호
    • /
    • pp.671-681
    • /
    • 2019
  • 최근에 부정맥 환자가 증가하면서 머신러닝을 이용한 부정맥을 예측하는 연구가 활발하게 진행되고 있다. 기존의 많은 연구들은 특정한 시점의 RR 간격 데이터에서 추출한 특징변수 다변량 데이터에 기반하여 부정맥을 예측하였다. 본 연구에서는 심장 상태가 시간에 따라 변해가는 패턴도 부정맥 예측에 중요한 정보가 될 수 있다고 생각하여 일정한 시간 간격을 두고 특징변수의 다변량 벡터를 추출하여 쌓음으써 얻어지는 다변량 시계열 데이터로 부정맥을 예측하는 것의 유용성에 대해 살펴보았다. 1-Nearest Neighbor 방법과 그것을 앙상블(ensemble)한 learner를 중심으로 비교했을 경우 시계열의 특징을 고려한 적절한 시계열 거리함수를 선택하여 시계열 정보를 활용한 다변량 시계열 데이터 기반 방법의 분류 성능이 더 좋게 나오는 것을 확인하였다.

Multivariate GARCH and Its Application to Bivariate Time Series

  • Choi, M.S.;Park, J.A.;Hwang, S.Y.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권4호
    • /
    • pp.915-925
    • /
    • 2007
  • Multivariate GARCH has been useful to model dynamic relationships between volatilities arising from each component series of multivariate time series. Methodologies including EWMA(Exponentially weighted moving-average model), DVEC(Diagonal VEC model), BEKK and CCC(Constant conditional correlation model) models are comparatively reviewed for bivariate time series. In addition, these models are applied to evaluate VaR(Value at Risk) and to construct joint prediction region. To illustrate, bivariate stock prices data consisting of Samsung Electronics and LG Electronics are analysed.

  • PDF

독립성분분석을 이용한 다변량 시계열 모의 (Multivariate Time Series Simulation With Component Analysis)

  • 이태삼;호세살라스;주하카바넨;노재경
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2008년도 학술발표회 논문집
    • /
    • pp.694-698
    • /
    • 2008
  • In hydrology, it is a difficult task to deal with multivariate time series such as modeling streamflows of an entire complex river system. Normal distribution based model such as MARMA (Multivariate Autorgressive Moving average) has been a major approach for modeling the multivariate time series. There are some limitations for the normal based models. One of them might be the unfavorable data-transformation forcing that the data follow the normal distribution. Furthermore, the high dimension multivariate model requires the very large parameter matrix. As an alternative, one might be decomposing the multivariate data into independent components and modeling it individually. In 1985, Lins used Principal Component Analysis (PCA). The five scores, the decomposed data from the original data, were taken and were formulated individually. The one of the five scores were modeled with AR-2 while the others are modeled with AR-1 model. From the time series analysis using the scores of the five components, he noted "principal component time series might provide a relatively simple and meaningful alternative to conventional large MARMA models". This study is inspired from the researcher's quote to develop a multivariate simulation model. The multivariate simulation model is suggested here using Principal Component Analysis (PCA) and Independent Component Analysis (ICA). Three modeling step is applied for simulation. (1) PCA is used to decompose the correlated multivariate data into the uncorrelated data while ICA decomposes the data into independent components. Here, the autocorrelation structure of the decomposed data is still dominant, which is inherited from the data of the original domain. (2) Each component is resampled by block bootstrapping or K-nearest neighbor. (3) The resampled components bring back to original domain. From using the suggested approach one might expect that a) the simulated data are different with the historical data, b) no data transformation is required (in case of ICA), c) a complex system can be decomposed into independent component and modeled individually. The model with PCA and ICA are compared with the various statistics such as the basic statistics (mean, standard deviation, skewness, autocorrelation), and reservoir-related statistics, kernel density estimate.

  • PDF

A Simultaneous Test for Multivariate Normality and Independence with Application to Univariate Residuals

  • Park, Cheol-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권1호
    • /
    • pp.115-122
    • /
    • 2006
  • A test is suggested for detecting deviations from both multivariate normality and independence. This test can be used for assessing the normality and independence of univariate time series residuals. We derive the limiting distribution of the test statistic and a simulation study is conducted to study the accuracy of the limiting distribution in finite samples. Finally, we apply our method to a real data of time series.

  • PDF

Efficient Compression Algorithm with Limited Resource for Continuous Surveillance

  • Yin, Ling;Liu, Chuanren;Lu, Xinjiang;Chen, Jiafeng;Liu, Caixing
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권11호
    • /
    • pp.5476-5496
    • /
    • 2016
  • Energy efficiency of resource-constrained wireless sensor networks is critical in applications such as real-time monitoring/surveillance. To improve the energy efficiency and reduce the energy consumption, the time series data can be compressed before transmission. However, most of the compression algorithms for time series data were developed only for single variate scenarios, while in practice there are often multiple sensor nodes in one application and the collected data is actually multivariate time series. In this paper, we propose to compress the time series data by the Lasso (least absolute shrinkage and selection operator) approximation. We show that, our approach can be naturally extended for compressing the multivariate time series data. Our extension is novel since it constructs an optimal projection of the original multivariates where the best energy efficiency can be realized. The two algorithms are named by ULasso (Univariate Lasso) and MLasso (Multivariate Lasso), for which we also provide practical guidance for parameter selection. Finally, empirically evaluation is implemented with several publicly available real-world data sets from different application domains. We quantify the algorithm performance by measuring the approximation error, compression ratio, and computation complexity. The results show that ULasso and MLasso are superior to or at least equivalent to compression performance of LTC and PLAMlis. Particularly, MLasso can significantly reduce the smooth multivariate time series data, without breaking the major trends and important changes of the sensor network system.

상관된 시계열 자료 모니터링을 위한 다변량 누적합 관리도 (Multivariate CUSUM Chart to Monitor Correlated Multivariate Time-series Observations)

  • 이규영;이미림
    • 품질경영학회지
    • /
    • 제49권4호
    • /
    • pp.539-550
    • /
    • 2021
  • Purpose: The purpose of this study is to propose a multivariate CUSUM control chart that can detect the out-of-control state fast while monitoring the cross- and auto- correlated multivariate time series data. Methods: We first build models to estimate the observation data and calculate the corresponding residuals. After then, a multivariate CUSUM chart is applied to monitor the residuals instead of the original raw observation data. Vector Autoregression and Artificial Neural Net are selected for the modelling, and Separated-MCUSUM chart is selected for the monitoring. The suggested methods are tested under a number of experimental settings and the performances are compared with those of other existing methods. Results: We find that Artificial Neural Net is more appropriate than Vector Autoregression for the modelling and show the combination of Separated-MCUSUM with Artificial Neural Net outperforms the other alternatives considered in this paper. Conclusion: The suggested chart has many advantages. It can monitor the complicated multivariate data with cross- and auto- correlation, and detects the out-of-control state fast. Unlike other CUSUM charts finding their control limits by trial and error simulation, the suggested chart saves lots of time and effort by approximating its control limit mathematically. We expect that the suggested chart performs not only effectively but also efficiently for monitoring the process with complicated correlations and frequently-changed parameters.

Analysis of Multivariate Financial Time Series Using Cointegration : Case Study

  • Choi, M.S.;Park, J.A.;Hwang, S.Y.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권1호
    • /
    • pp.73-80
    • /
    • 2007
  • Cointegration(together with VARMA(vector ARMA)) has been proven to be useful for analyzing multivariate non-stationary data in the field of financial time series. It provides a linear combination (which turns out to be stationary series) of non-stationary component series. This linear combination equation is referred to as long term equilibrium between the component series. We consider two sets of Korean bivariate financial time series and then illustrate cointegration analysis. Specifically estimated VAR(vector AR) and VECM(vector error correction model) are obtained and CV(cointegrating vector) is found for each data sets.

  • PDF

다변량 비정상 계절형 시계열모형의 예측력 비교 (Comparison of Forecasting Performance in Multivariate Nonstationary Seasonal Time Series Models)

  • 성병찬
    • Communications for Statistical Applications and Methods
    • /
    • 제18권1호
    • /
    • pp.13-21
    • /
    • 2011
  • 본 논문에서는 계절성을 가지는 다변량 비정상 시계열자료의 분석 방법을 연구한다. 이를 위하여, 3가지의 다변량 시계열분석 모형(계절형 공적분 모형, 계절형 가변수를 가지는 비계절형 공적분 모형, 차분을 이용한 벡터자기회귀모형)을 고려하고, 한국의 실제 거시경제 자료를 이용하여 3가지 모형의 예측력을 비교한다. 공적분 모형은 단기적 예측에서 우수하였고, 장기적 예측에서는 차분을 이용한 벡터자기회귀모형이 우수하였다.

Change points detection for nonstationary multivariate time series

  • Yeonjoo Park;Hyeongjun Im;Yaeji Lim
    • Communications for Statistical Applications and Methods
    • /
    • 제30권4호
    • /
    • pp.369-388
    • /
    • 2023
  • In this paper, we develop the two-step procedure that detects and estimates the position of structural changes for multivariate nonstationary time series, either on mean parameters or second-order structures. We first investigate the presence of mean structural change by monitoring data through the aggregated cumulative sum (CUSUM) type statistic, a sequential procedure identifying the likely position of the change point on its trend. If no mean change point is detected, the proposed method proceeds to scan the second-order structural change by modeling the multivariate nonstationary time series with a multivariate locally stationary Wavelet process, allowing the time-localized auto-correlation and cross-dependence. Under this framework, the estimated dynamic spectral matrices derived from the local wavelet periodogram capture the time-evolving scale-specific auto- and cross-dependence features of data. We then monitor the change point from the lower-dimensional approximated space of the spectral matrices over time by applying the dynamic principal component analysis. Different from existing methods requiring prior information on the type of changes between mean and covariance structures as an input for the implementation, the proposed algorithm provides the output indicating the type of change and the estimated location of its occurrence. The performance of the proposed method is demonstrated in simulations and the analysis of two real finance datasets.

환경생태 자료 분석을 위한 시계열 분석 방법 연구 (A Review of Time Series Analysis for Environmental and Ecological Data)

  • 모형호;조기종;신기일
    • 환경생물
    • /
    • 제34권4호
    • /
    • pp.365-373
    • /
    • 2016
  • 환경생태 자료 분석에 사용된 많은 자료가 시간에 따라 얻어지고 있다. 조사된 시점의 수가 적은 경우에는 자료가 충분한 정보를 주지 않기 때문에 반복 측정하거나 여러 지점을 조사하여 종합적인 분석을 수행하게 된다. 이때 사용하는 방법이 경시적 자료 분석(longitudinal data analysis) 또는 혼합모형(mixed model) 분석이다. 그러나 시점의 수가 많아 정보의 양이 충분하다면 반복적인 자료가 필요하지 않으며 이러한 자료는 시계열 분석 기법을 이용하여 분석하게 된다. 특히 현재와 같이 다수의 시점에서 얻어진 자료의 수가 많아지고 있는 상항에서 각 변수 간에 서로 어떤 영향을 주는지 또는 향후 어떤 경향을 띠게 되는지 예측을 원한다면 시계열 분석 기법을 사용하여 자료를 분석해야 한다. 본 연구에서는 단변량 시계열 분석(univariate time series analysis), 개입 분석(intervention time series model), 전이함수 모형 분석(transfer function model), 다변량 시계열 분석(multivariate time series model) 기법을 소개하고 현재까지 진행된 국내외 연구 논문을 살펴보았다. 또한 향후 환경생태 자료 분석에서 중요하게 사용될 수 있는 오차수정 모형(error correction model)을 소개하였다.