• 제목/요약/키워드: time series data analysis

검색결과 1,829건 처리시간 0.034초

재현그림을 통한 우리나라 주식 자료에 대한 탐색적 자료분석 (Exploratory Data Analysis for Korean Stock Data with Recurrence Plots)

  • 장대흥
    • 응용통계연구
    • /
    • 제26권5호
    • /
    • pp.807-819
    • /
    • 2013
  • 확증적 시계열 자료분석 전의 그래픽 탐색적 자료분석방법으로서 재현그림을 사용할 수 있다. 재현그림을 통하여 시계열 자료의 구조적 패턴을 확인할 수 있고 이 패턴을 통하여 탐색적으로 시계열 데이터의 구조 변화점을 한 눈에 확인할 수 있게 된다. 우리나라 주식 자료를 이용하여 재현그림이 시계열 자료를 위한 그래픽 탐색적 자료분석방법으로서 유용함을 보였다.

Bayes Inference for the Spatial Bilinear Time Series Model with Application to Epidemic Data

  • Lee, Sung-Duck;Kim, Duk-Ki
    • 응용통계연구
    • /
    • 제25권4호
    • /
    • pp.641-650
    • /
    • 2012
  • Spatial time series data can be viewed as a set of time series simultaneously collected at a number of spatial locations. This paper studies Bayesian inferences in a spatial time bilinear model with a Gibbs sampling algorithm to overcome problems in the numerical analysis techniques of a spatial time series model. For illustration, the data set of mumps cases reported from the Korea Center for Disease Control and Prevention monthly over the years 2001~2009 are selected for analysis.

광업 데이터의 시계열 분석을 통해 실리카 농도를 예측하기 위한 머신러닝 모델 (A Machine Learning Model for Predicting Silica Concentrations through Time Series Analysis of Mining Data)

  • 이승훈;윤연아;정진형;심현수;장태우;김용수
    • 품질경영학회지
    • /
    • 제48권3호
    • /
    • pp.511-520
    • /
    • 2020
  • Purpose: The purpose of this study was to devise an accurate machine learning model for predicting silica concentrations following the addition of impurities, through time series analysis of mining data. Methods: The mining data were preprocessed and subjected to time series analysis using the machine learning model. Through correlation analysis, valid variables were selected and meaningless variables were excluded. To reflect changes over time, dependent variables at baseline were treated as independent variables at later time points. The relationship between independent variables and the dependent variable after n point was subjected to Pearson correlation analysis. Results: The correlation (R2) was strongest after 3 hours, which was adopted as a dependent variable. According to root mean square error (RMSE) data, the proposed method was superior to the other machine learning methods. The XGboost algorithm showed the best predictive performance. Conclusion: This study is important given the current lack of machine learning studies pertaining to the domestic mining industry. In addition, using time series analysis in mining data will show further improvement. Before establishing a predictive model for the proposed method, predictions should be made using data with time series characteristics. After doing this work, it should also improve prediction accuracy in other domains.

회귀모형에 의한 서해안 평균해면의 연시계열자료의 평가 (The Evaluation of the Annual Time Series Data for the Mean Sea Level of the West Coast by Regression Model)

  • 조기태;박영기;이장춘
    • 한국환경과학회지
    • /
    • 제9권1호
    • /
    • pp.19-25
    • /
    • 2000
  • As the tideland reclamation is done on a large scale these days, construction work is active in the coastal areas. Facilities in the coastal areas must be built with the tide characteristics taken into consideration. Thus the tide characteristics affect the overall reclamation plan. The analysis of the tide data boils down to a harmonic analysis of the hourly changes of long-term tide data and extraction of unharmonic coefficients from the results. Since considerable amount of tide data of the West Coast are available, the existing data can be collected and can be used to obtain the temporal changes of the tide by being fitted into the tide prediction model. The goal of this thesis lies in assessing whether the mean sea level used in the field agrees with the analysis results from the long-term observation data obtained with their homogeneity guaranteed. To achieve this goal, the research was conducted as follows. First the present conditions of the observation stations, the land level standard, and the sea level standard were analyzed to set up a time series model formula for representing them. To secure the homogeneity of the time series, each component was separated. Lastly the mean sea level used in the field was assessed based on the results obtained form the analysis of the time series.

  • PDF

A model of predicting performance of Olympic female weightlifters using time series analysis

  • Won, Jin-hee;Cho, In-ho
    • International Journal of Advanced Culture Technology
    • /
    • 제8권3호
    • /
    • pp.216-222
    • /
    • 2020
  • The purpose of this study was to predict the performance of female weightlifters using time series analysis. Based on this purpose, a time series analysis was used to calculate the performance prediction model for women(58kg) among the domestic women weightlifters who participated in the Olympics. As a result of creating time series data based on 10 years of record and then evaluating the sequential charts of each athlete group, the female athletes' records did not show any seasonality or difference. In addition, after examining the independence of the data through the creation of a time series model, it was shown that the models produced conformed to the criteria for compliance and that there was no difference in the data, but there was a trend. Accordingly, Holt linear trend analysis of the exponential smoothing model was applied. As a result of deriving the prediction model of the athletes through this process, it was found that the women (58kg) who participated in the Olympics continued to improve within the range of 166.11kg to 184.1kg.

Temporal Fusion Transformers와 심층 학습 방법을 사용한 다층 수평 시계열 데이터 분석 (Temporal Fusion Transformers and Deep Learning Methods for Multi-Horizon Time Series Forecasting)

  • 김인경;김대희;이재구
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제11권2호
    • /
    • pp.81-86
    • /
    • 2022
  • 시계열 데이터는 주식, IoT, 공장 자동화와 같은 다양한 실생활에서 수집되고 활용되고 있으며, 정확한 시계열 예측은 해당 분야에서 운영 효율성을 높일 수 있어서 전통적으로 중요한 연구 주제이다. 전반적인 시계열 데이터의 향상된 특징을 추출할 수 있는 대표적인 시계열 데이터 분석 방법인 다층 수평 예측은 최근 부가적 정보를 포함하는 시계열 데이터에 내재한 이질성(heterogeneity)까지 포괄적으로 분석에 활용하여 향상된 시계열 예측한다. 하지만 대부분의 심층 학습 기반 시계열 분석 모델들은 시계열 데이터의 이질성을 반영하지 못했다. 따라서 우리는 잘 알려진 temporal fusion transformers 방법을 사용하여 실생활과 밀접한 실제 데이터를 이질성을 고려한 다층 수평 예측에 적용하였다. 결과적으로 주식, 미세먼지, 전기 소비량과 같은 실생활 시계열 데이터에 적용한 방법이 기존 예측 모델보다 향상된 정확도를 가짐을 확인할 수 있었다.

시계열 분석 모형 및 머신 러닝 분석을 이용한 수출 증가율 장기예측 성능 비교 (Comparison of long-term forecasting performance of export growth rate using time series analysis models and machine learning analysis)

  • 남성휘
    • 무역학회지
    • /
    • 제46권6호
    • /
    • pp.191-209
    • /
    • 2021
  • In this paper, various time series analysis models and machine learning models are presented for long-term prediction of export growth rate, and the prediction performance is compared and reviewed by RMSE and MAE. Export growth rate is one of the major economic indicators to evaluate the economic status. And It is also used to predict economic forecast. The export growth rate may have a negative (-) value as well as a positive (+) value. Therefore, Instead of using the ReLU function, which is often used for time series prediction of deep learning models, the PReLU function, which can have a negative (-) value as an output value, was used as the activation function of deep learning models. The time series prediction performance of each model for three types of data was compared and reviewed. The forecast data of long-term prediction of export growth rate was deduced by three forecast methods such as a fixed forecast method, a recursive forecast method and a rolling forecast method. As a result of the forecast, the traditional time series analysis model, ARDL, showed excellent performance, but as the time period of learning data increases, the performance of machine learning models including LSTM was relatively improved.

훼손된 시계열 데이터 분석을 위한 퍼지 시스템 융합 연구 (Fused Fuzzy Logic System for Corrupted Time Series Data Analysis)

  • 김동원
    • 사물인터넷융복합논문지
    • /
    • 제4권1호
    • /
    • pp.1-5
    • /
    • 2018
  • 본 논문에서는 노이즈에 의해 훼손된 시계열 데이터의 모델링에 대하여 다룬다. 모델링 기법으로, 논싱글톤 퍼지 시스템을 사용한다. 논싱글톤 퍼지 시스템의 주요특징은 미지의 비선형시스템의 입력이 퍼지값으로 모델링 된다는데 있다. 그러므로 퍼지시스템에 인가되는 학습데이터나 입력데이터 등이 노이즈나 외부 환경에 의해 변형된 경우에 매우 유용하게 적용될 수 있다. 성능비교를 위해 벤치마크 데이터로 잘 알려진 Mackey-Glass 데이터를 사용한다. 이들 데이터 모델링을 통하여 결과를 비교, 분석하여 논싱글톤 퍼지시스템이 잡음에 대하여 보다 강인하고 효율적임을 본 논문에서 보인다.

A Biclustering Method for Time Series Analysis

  • Lee, Jeong-Hwa;Lee, Young-Rok;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • 제9권2호
    • /
    • pp.131-140
    • /
    • 2010
  • Biclustering is a method of finding meaningful subsets of objects and attributes simultaneously, which may not be detected by traditional clustering methods. It is popularly used for the analysis of microarray data representing the expression levels of genes by conditions. Usually, biclustering algorithms do not consider a sequential relation between attributes. For time series data, however, bicluster solutions should keep the time sequence. This paper proposes a new biclustering algorithm for time series data by modifying the plaid model. The proposed algorithm introduces a parameter controlling an interval between two selected time points. Also, the pruning step preventing an over-fitting problem is modified so as to eliminate only starting or ending points. Results from artificial data sets show that the proposed method is more suitable for the extraction of biclusters from time series data sets. Moreover, by using the proposed method, we find some interesting observations from real-world time-course microarray data sets and apartment price data sets in metropolitan areas.

Correlation analysis and time series analysis of Ground-water inflow rate into tunnel of Seoul subway system

  • 김성준;이강근;염병우
    • 한국지하수토양환경학회:학술대회논문집
    • /
    • 한국지하수토양환경학회 2003년도 추계학술발표회
    • /
    • pp.254-257
    • /
    • 2003
  • Statistical analysis is performed to estimate the correlations between geological or geographical factor and groundwater inflow rates in the Seoul subway system. Correlation analysis shows that among several geological and geographical factors fractures and streams have most strong effects on inflow rate into tunnels. In particular, subway line 5∼8 are affected more by these factors than subway line 1∼4. Time series analysis is carried out to forecast groundwater inflow rate. Time series analysis is a useful empirical method for simulation and forecasts in case that physical model can not be applied to. The time series of groundwater inflow rates is calculated using the observation data. Transfer function-noise model is applied with the precipitation data as input variables. For time series analysis, statistical methods are performed to identify proper model and autoregressive-moving average models are applied to evaluation of inflow rate. Each model is identified to satisfy the lowest value of information criteria. Results show that the values by result equations are well fitted with the actual inflow rate values. The selected models could give a good explanation of inflow rates variation into subway tunnels.

  • PDF