• Title/Summary/Keyword: Time Series Data Analysis

Search Result 1,835, Processing Time 0.033 seconds

Exploratory Data Analysis for Korean Stock Data with Recurrence Plots (재현그림을 통한 우리나라 주식 자료에 대한 탐색적 자료분석)

  • Jang, Dae-Heung
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.5
    • /
    • pp.807-819
    • /
    • 2013
  • A recurrence plot can be used as a graphical exploratory data analysis tool before confirmatory time series analysis. With the recurrence plot, we can obtain the structural pattern of the time series and recognize the structural change points in a time series at a glance. Korean stock data shows the usefulness of the recurrence plot as a graphical exploratory data analysis tool for time series data.

Bayes Inference for the Spatial Bilinear Time Series Model with Application to Epidemic Data

  • Lee, Sung-Duck;Kim, Duk-Ki
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.4
    • /
    • pp.641-650
    • /
    • 2012
  • Spatial time series data can be viewed as a set of time series simultaneously collected at a number of spatial locations. This paper studies Bayesian inferences in a spatial time bilinear model with a Gibbs sampling algorithm to overcome problems in the numerical analysis techniques of a spatial time series model. For illustration, the data set of mumps cases reported from the Korea Center for Disease Control and Prevention monthly over the years 2001~2009 are selected for analysis.

A Machine Learning Model for Predicting Silica Concentrations through Time Series Analysis of Mining Data (광업 데이터의 시계열 분석을 통해 실리카 농도를 예측하기 위한 머신러닝 모델)

  • Lee, Seung Hoon;Yoon, Yeon Ah;Jung, Jin Hyeong;Sim, Hyun su;Chang, Tai-Woo;Kim, Yong Soo
    • Journal of Korean Society for Quality Management
    • /
    • v.48 no.3
    • /
    • pp.511-520
    • /
    • 2020
  • Purpose: The purpose of this study was to devise an accurate machine learning model for predicting silica concentrations following the addition of impurities, through time series analysis of mining data. Methods: The mining data were preprocessed and subjected to time series analysis using the machine learning model. Through correlation analysis, valid variables were selected and meaningless variables were excluded. To reflect changes over time, dependent variables at baseline were treated as independent variables at later time points. The relationship between independent variables and the dependent variable after n point was subjected to Pearson correlation analysis. Results: The correlation (R2) was strongest after 3 hours, which was adopted as a dependent variable. According to root mean square error (RMSE) data, the proposed method was superior to the other machine learning methods. The XGboost algorithm showed the best predictive performance. Conclusion: This study is important given the current lack of machine learning studies pertaining to the domestic mining industry. In addition, using time series analysis in mining data will show further improvement. Before establishing a predictive model for the proposed method, predictions should be made using data with time series characteristics. After doing this work, it should also improve prediction accuracy in other domains.

The Evaluation of the Annual Time Series Data for the Mean Sea Level of the West Coast by Regression Model (회귀모형에 의한 서해안 평균해면의 연시계열자료의 평가)

  • 조기태;박영기;이장춘
    • Journal of Environmental Science International
    • /
    • v.9 no.1
    • /
    • pp.19-25
    • /
    • 2000
  • As the tideland reclamation is done on a large scale these days, construction work is active in the coastal areas. Facilities in the coastal areas must be built with the tide characteristics taken into consideration. Thus the tide characteristics affect the overall reclamation plan. The analysis of the tide data boils down to a harmonic analysis of the hourly changes of long-term tide data and extraction of unharmonic coefficients from the results. Since considerable amount of tide data of the West Coast are available, the existing data can be collected and can be used to obtain the temporal changes of the tide by being fitted into the tide prediction model. The goal of this thesis lies in assessing whether the mean sea level used in the field agrees with the analysis results from the long-term observation data obtained with their homogeneity guaranteed. To achieve this goal, the research was conducted as follows. First the present conditions of the observation stations, the land level standard, and the sea level standard were analyzed to set up a time series model formula for representing them. To secure the homogeneity of the time series, each component was separated. Lastly the mean sea level used in the field was assessed based on the results obtained form the analysis of the time series.

  • PDF

A model of predicting performance of Olympic female weightlifters using time series analysis

  • Won, Jin-hee;Cho, In-ho
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.3
    • /
    • pp.216-222
    • /
    • 2020
  • The purpose of this study was to predict the performance of female weightlifters using time series analysis. Based on this purpose, a time series analysis was used to calculate the performance prediction model for women(58kg) among the domestic women weightlifters who participated in the Olympics. As a result of creating time series data based on 10 years of record and then evaluating the sequential charts of each athlete group, the female athletes' records did not show any seasonality or difference. In addition, after examining the independence of the data through the creation of a time series model, it was shown that the models produced conformed to the criteria for compliance and that there was no difference in the data, but there was a trend. Accordingly, Holt linear trend analysis of the exponential smoothing model was applied. As a result of deriving the prediction model of the athletes through this process, it was found that the women (58kg) who participated in the Olympics continued to improve within the range of 166.11kg to 184.1kg.

Temporal Fusion Transformers and Deep Learning Methods for Multi-Horizon Time Series Forecasting (Temporal Fusion Transformers와 심층 학습 방법을 사용한 다층 수평 시계열 데이터 분석)

  • Kim, InKyung;Kim, DaeHee;Lee, Jaekoo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.2
    • /
    • pp.81-86
    • /
    • 2022
  • Given that time series are used in various fields, such as finance, IoT, and manufacturing, data analytical methods for accurate time-series forecasting can serve to increase operational efficiency. Among time-series analysis methods, multi-horizon forecasting provides a better understanding of data because it can extract meaningful statistics and other characteristics of the entire time-series. Furthermore, time-series data with exogenous information can be accurately predicted by using multi-horizon forecasting methods. However, traditional deep learning-based models for time-series do not account for the heterogeneity of inputs. We proposed an improved time-series predicting method, called the temporal fusion transformer method, which combines multi-horizon forecasting with interpretable insights into temporal dynamics. Various real-world data such as stock prices, fine dust concentrates and electricity consumption were considered in experiments. Experimental results showed that our temporal fusion transformer method has better time-series forecasting performance than existing models.

Comparison of long-term forecasting performance of export growth rate using time series analysis models and machine learning analysis (시계열 분석 모형 및 머신 러닝 분석을 이용한 수출 증가율 장기예측 성능 비교)

  • Seong-Hwi Nam
    • Korea Trade Review
    • /
    • v.46 no.6
    • /
    • pp.191-209
    • /
    • 2021
  • In this paper, various time series analysis models and machine learning models are presented for long-term prediction of export growth rate, and the prediction performance is compared and reviewed by RMSE and MAE. Export growth rate is one of the major economic indicators to evaluate the economic status. And It is also used to predict economic forecast. The export growth rate may have a negative (-) value as well as a positive (+) value. Therefore, Instead of using the ReLU function, which is often used for time series prediction of deep learning models, the PReLU function, which can have a negative (-) value as an output value, was used as the activation function of deep learning models. The time series prediction performance of each model for three types of data was compared and reviewed. The forecast data of long-term prediction of export growth rate was deduced by three forecast methods such as a fixed forecast method, a recursive forecast method and a rolling forecast method. As a result of the forecast, the traditional time series analysis model, ARDL, showed excellent performance, but as the time period of learning data increases, the performance of machine learning models including LSTM was relatively improved.

Fused Fuzzy Logic System for Corrupted Time Series Data Analysis (훼손된 시계열 데이터 분석을 위한 퍼지 시스템 융합 연구)

  • Kim, Dong Won
    • Journal of Internet of Things and Convergence
    • /
    • v.4 no.1
    • /
    • pp.1-5
    • /
    • 2018
  • This paper is concerned with the modeling and identification of time series data corrupted by noise. As modeling techniques, nonsingleton fuzzy logic system (NFLS) is employed for the modeling of corrupted time series. Main characteristic of the NFLS is a fuzzy system whose inputs are modeled as fuzzy number. So the NFLS is especially useful in cases where the available training data or the input data to the fuzzy logic system are corrupted by noise. Simulation results of the Mackey-Glass time series data will be demonstrated to show the performance of the modeling methods. As a result, NFLS does a much better job of modeling noisy time series data than does a traditional Mamdani FLS.

A Biclustering Method for Time Series Analysis

  • Lee, Jeong-Hwa;Lee, Young-Rok;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • v.9 no.2
    • /
    • pp.131-140
    • /
    • 2010
  • Biclustering is a method of finding meaningful subsets of objects and attributes simultaneously, which may not be detected by traditional clustering methods. It is popularly used for the analysis of microarray data representing the expression levels of genes by conditions. Usually, biclustering algorithms do not consider a sequential relation between attributes. For time series data, however, bicluster solutions should keep the time sequence. This paper proposes a new biclustering algorithm for time series data by modifying the plaid model. The proposed algorithm introduces a parameter controlling an interval between two selected time points. Also, the pruning step preventing an over-fitting problem is modified so as to eliminate only starting or ending points. Results from artificial data sets show that the proposed method is more suitable for the extraction of biclusters from time series data sets. Moreover, by using the proposed method, we find some interesting observations from real-world time-course microarray data sets and apartment price data sets in metropolitan areas.

Correlation analysis and time series analysis of Ground-water inflow rate into tunnel of Seoul subway system

  • 김성준;이강근;염병우
    • Proceedings of the Korean Society of Soil and Groundwater Environment Conference
    • /
    • 2003.09a
    • /
    • pp.254-257
    • /
    • 2003
  • Statistical analysis is performed to estimate the correlations between geological or geographical factor and groundwater inflow rates in the Seoul subway system. Correlation analysis shows that among several geological and geographical factors fractures and streams have most strong effects on inflow rate into tunnels. In particular, subway line 5∼8 are affected more by these factors than subway line 1∼4. Time series analysis is carried out to forecast groundwater inflow rate. Time series analysis is a useful empirical method for simulation and forecasts in case that physical model can not be applied to. The time series of groundwater inflow rates is calculated using the observation data. Transfer function-noise model is applied with the precipitation data as input variables. For time series analysis, statistical methods are performed to identify proper model and autoregressive-moving average models are applied to evaluation of inflow rate. Each model is identified to satisfy the lowest value of information criteria. Results show that the values by result equations are well fitted with the actual inflow rate values. The selected models could give a good explanation of inflow rates variation into subway tunnels.

  • PDF