• Title/Summary/Keyword: The time-series data

Search Result 3,672, Processing Time 0.031 seconds

BST-IGT Model: Synthetic Benchmark Generation Technique Maintaining Trend of Time Series Data

  • Kim, Kyung Min;Kwak, Jong Wook
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.2
    • /
    • pp.31-39
    • /
    • 2020
  • In this paper, we introduce a technique for generating synthetic benchmarks based on time series data. Many of the data measured on IoT devices have a time series characteristic that measures numerical changes over time. However, there is a problem that it is difficult to model the data measured over a long period as generalized time series data. To solve this problem, this paper introduces the BST-IGT model. The BST-IGT model separates the entire data into sections that can be easily time-series modeled, collects the generated data into templates, and produces new synthetic benchmarks that share or modify characteristics based on them. As a result of making a new benchmark using the proposed modeling method, we could create a benchmark with multiple aspects by mixing the composite benchmark with the statistical features of the existing data and other benchmarks.

Efficient Time-Series Similarity Measurement and Ranking Based on Anomaly Detection (이상탐지 기반의 효율적인 시계열 유사도 측정 및 순위화)

  • Ji-Hyun Choi;Hyun Ahn
    • Journal of Internet Computing and Services
    • /
    • v.25 no.2
    • /
    • pp.39-47
    • /
    • 2024
  • Time series analysis is widely employed by many organizations to solve business problems, as it extracts various information and insights from chronologically ordered data. Among its applications, measuring time series similarity is a step to identify time series with similar patterns, which is very important in time series analysis applications such as time series search and clustering. In this study, we propose an efficient method for measuring time series similarity that focuses on anomalies rather than the entire series. In this regard, we validate the proposed method by measuring and analyzing the rank correlation between the similarity measure for the set of subsets extracted by anomaly detection and the similarity measure for the whole time series. Experimental results, especially with stock time series data and an anomaly proportion of 10%, demonstrate a Spearman's rank correlation coefficient of up to 0.9. In conclusion, the proposed method can significantly reduce computation cost of measuring time series similarity, while providing reliable time series search and clustering results.

Statistical Inference for Space Time Series Model with Application to Mumps Data

  • Jeong, Ae-Ran;Kim, Sun-Woo;Lee, Sung-Duck
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.475-486
    • /
    • 2006
  • Space time series data can be viewed either as a set of time series collected simultaneously at a number of spatial locations or as sets of spatial data collected at a number of time points. The major purpose of this article is to formulate a class of space time autoregressive moving average (STARMA) model, to discuss some of the their statistical properties such as model identification approaches, some procedure for estimation and the predictions. For illustration, we apply this STARMA model to the mumps data. The data set of mumps cases consists of the number of cases of mumps reported from twelve states monthly over the years 1969-1988.

  • PDF

The Evaluation of the Annual Time Series Data for the Mean Sea Level of the West Coast by Regression Model (회귀모형에 의한 서해안 평균해면의 연시계열자료의 평가)

  • 조기태;박영기;이장춘
    • Journal of Environmental Science International
    • /
    • v.9 no.1
    • /
    • pp.19-25
    • /
    • 2000
  • As the tideland reclamation is done on a large scale these days, construction work is active in the coastal areas. Facilities in the coastal areas must be built with the tide characteristics taken into consideration. Thus the tide characteristics affect the overall reclamation plan. The analysis of the tide data boils down to a harmonic analysis of the hourly changes of long-term tide data and extraction of unharmonic coefficients from the results. Since considerable amount of tide data of the West Coast are available, the existing data can be collected and can be used to obtain the temporal changes of the tide by being fitted into the tide prediction model. The goal of this thesis lies in assessing whether the mean sea level used in the field agrees with the analysis results from the long-term observation data obtained with their homogeneity guaranteed. To achieve this goal, the research was conducted as follows. First the present conditions of the observation stations, the land level standard, and the sea level standard were analyzed to set up a time series model formula for representing them. To secure the homogeneity of the time series, each component was separated. Lastly the mean sea level used in the field was assessed based on the results obtained form the analysis of the time series.

  • PDF

Recent Trends in the Application of Extreme Learning Machines for Online Time Series Data (온라인 시계열 자료를 위한 익스트림 러닝머신 적용의 최근 동향)

  • YeoChang Yoon
    • The Journal of Bigdata
    • /
    • v.8 no.2
    • /
    • pp.15-25
    • /
    • 2023
  • Extreme learning machines (ELMs) are a major analytical method in various prediction fields. ELMs can accurately predict even if the data contains noise or is nonlinear by learning the complex patterns of time series data through optimal learning. This study presents the recent trends of machine learning models that are mainly studied as tools for analyzing online time series data, along with the application characteristics using existing algorithms. In order to efficiently learn large-scale online data that is continuously and explosively generated, it is necessary to have a learning technology that can perform well even in properties that can evolve in various ways. Therefore, this study examines a comprehensive overview of the latest machine learning models applied to big data in the field of time series prediction, discusses the general characteristics of the latest models that learn online data, which is one of the major challenges of machine learning for big data, and how efficiently they can learn and use online time series data for prediction, and proposes alternatives.

VaR(Value at Risk) for Korean Financial Time Series

  • Hwang, S.Y.;Park, J.
    • Journal of the Korean Data and Information Science Society
    • /
    • v.16 no.2
    • /
    • pp.283-288
    • /
    • 2005
  • Value at Risk(VaR) has been proven useful in finance literature as a tool of risk management(cf. Jorion(2001)). This article is concerned with introducing VaR to various Korean financial time series. Five daily data sets with sample period ranging from 2000 and 2004 such as KOSPI, KOSPI 200, KOSDAQ, KOSDAQ 50 and won-dollar exchange rate are analyzed using GARCH modeling and in turn VaR is obtained for each data.

  • PDF

A Study on the Test and Visualization of Change in Structures Associated with the Occurrence of Non-Stationary of Long-Term Time Series Data Based on Unit Root Test (Unit Root Test를 기반으로 한 장기 시계열 데이터의 Non-Stationary 발생에 따른 구조 변화 검정 및 시각화 연구)

  • Yoo, Jaeseong;Choo, Jaegul
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.7
    • /
    • pp.289-302
    • /
    • 2019
  • Structural change of time series means that the distribution of observations is relatively stable in the period of constituting the entire time series data, but shows a sudden change of the distribution characteristic at a specific time point. Within a non-stationary long-term time series, it is important to determine in a timely manner whether the change in short-term trends is transient or structurally changed. This is because it is necessary to always detect the change of the time series trend and to take appropriate measures to cope with the change. In this paper, we propose a method for decision makers to easily grasp the structural changes of time series by visualizing the test results based on the unit root test. Particularly, it is possible to grasp the short-term structural changes even in the long-term time series through the method of dividing the time series and testing it.

Finding associations between genes by time-series microarray sequential patterns analysis

  • Nam, Ho-Jung;Lee, Do-Heon
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.161-164
    • /
    • 2005
  • Data mining techniques can be applied to identify patterns of interest in the gene expression data. One goal in mining gene expression data is to determine how the expression of any particular gene might affect the expression of other genes. To find relationships between different genes, association rules have been applied to gene expression data set [1]. A notable limitation of association rule mining method is that only the association in a single profile experiment can be detected. It cannot be used to find rules across different condition profiles or different time point profile experiments. However, with the appearance of time-series microarray data, it became possible to analyze the temporal relationship between genes. In this paper, we analyze the time-series microarray gene expression data to extract the sequential patterns which are similar to the association rules between genes among different time points in the yeast cell cycle. The sequential patterns found in our work can catch the associations between different genes which express or repress at diverse time points. We have applied sequential pattern mining method to time-series microarray gene expression data and discovered a number of sequential patterns from two groups of genes (test, control) and more sequential patterns have been discovered from test group (same CO term group) than from the control group (different GO term group). This result can be a support for the potential of sequential patterns which is capable of catching the biologically meaningful association between genes.

  • PDF

Test of Homogeneity for Panel Bilinear Time Series Model (패널 중선형 시계열 모형의 동질성 검정)

  • Lee, ShinHyung;Kim, SunWoo;Lee, SungDuck
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.3
    • /
    • pp.521-529
    • /
    • 2013
  • The acceptance of the test of the homogeneity for panel time series models allows for the pooling of the series to achieve parsimony. In this paper, we introduce a panel bilinear time series model as well as derive the stationary condition and the limiting distribution of the test statistic of the homogeneity test for the model. For the applications study, we use Korea Mumps data from January 2001 to December 2008. Finally, we perform test of homogeneity for the panel data with 8 independent bilinear time series.

Time-series Analysis and Prediction of Future Trends of Groundwater Level in Water Curtain Cultivation Areas Using the ARIMA Model (ARIMA 모델을 이용한 수막재배지역 지하수위 시계열 분석 및 미래추세 예측)

  • Baek, Mi Kyung;Kim, Sang Min
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.65 no.2
    • /
    • pp.1-11
    • /
    • 2023
  • This study analyzed the impact of greenhouse cultivation area and groundwater level changes due to the water curtain cultivation in the greenhouse complexes. The groundwater observation data in the Miryang study area were used and classified into greenhouse and field cultivation areas to compare the groundwater impact of water curtain cultivation in the greenhouse complex. We identified the characteristics of the groundwater time series data by the terrain of the study area and selected the optimal model through time series analysis. We analyzed the time series data for each terrain's two representative groundwater observation wells. The Seasonal ARIMA model was chosen as the optimal model for riverside well, and for plain and mountain well, the ARIMA model and Seasonal ARIMA model were selected as the optimal model. A suitable prediction model is not limited to one model due to a change in a groundwater level fluctuation pattern caused by a surrounding environment change but may change over time. Therefore, it is necessary to periodically check and revise the optimal model rather than continuously applying one selected ARIMA model. Groundwater forecasting results through time series analysis can be used for sustainable groundwater resource management.