• Title/Summary/Keyword: Time Series Prediction Model

Search Result 583, Processing Time 0.024 seconds

Estimation Method of Predicted Time Series Data Based on Absolute Maximum Value (최대 절대값 기반 시계열 데이터 예측 모델 평가 기법)

  • Shin, Ki-Hoon;Kim, Chul;Nam, Sang-Hun;Park, Sung-Jae;Yoo, Sung-Soo
    • Journal of Energy Engineering
    • /
    • v.27 no.4
    • /
    • pp.103-110
    • /
    • 2018
  • In this paper, we introduce evaluation method of time series prediction model with new approach of Mean Absolute Percentage Error(hereafter MAPE) and Symmetric Mean Absolute Percentage Error(hereafter sMAPE). There are some problems using MAPE and sMAPE. First MAPE can't evaluate Zero observation of dataset. Moreover, when the observed value is very close to zero it evaluate heavier than other methods. Finally it evaluate different measure even same error between observations and predicted values. And sMAPE does different evaluations are made depending on whether the same error value is over-predicted or under-predicted. And it has different measurement according to the each sign, even if error is the same distance. These problems were solved by Maximum Mean Absolute Percentage Error(hereafter mMAPE). we used the absolute maximum of observed value as denominator instead of the observed value in MAPE, when the value is less than 1, removed denominator then solved the problem that the zero value is not defined. and were able to prevent heavier measurement problem. Also, if the absolute maximum of observed value is greater than 1, the evaluation values of mMAPE were compared with those of the other evaluations. With Beijing PM2.5 temperature data and our simulation data, we compared the evaluation values of mMAPE with other evaluations. And we proved that mMAPE can solve the problems that we mentioned.

Development of Deep-Learning-Based Models for Predicting Groundwater Levels in the Middle-Jeju Watershed, Jeju Island (딥러닝 기법을 이용한 제주도 중제주수역 지하수위 예측 모델개발)

  • Park, Jaesung;Jeong, Jiho;Jeong, Jina;Kim, Ki-Hong;Shin, Jaehyeon;Lee, Dongyeop;Jeong, Saebom
    • The Journal of Engineering Geology
    • /
    • v.32 no.4
    • /
    • pp.697-723
    • /
    • 2022
  • Data-driven models to predict groundwater levels 30 days in advance were developed for 12 groundwater monitoring stations in the middle-Jeju watershed, Jeju Island. Stacked long short-term memory (stacked-LSTM), a deep learning technique suitable for time series forecasting, was used for model development. Daily time series data from 2001 to 2022 for precipitation, groundwater usage amount, and groundwater level were considered. Various models were proposed that used different combinations of the input data types and varying lengths of previous time series data for each input variable. A general procedure for deep-learning-based model development is suggested based on consideration of the comparative validation results of the tested models. A model using precipitation, groundwater usage amount, and previous groundwater level data as input variables outperformed any model neglecting one or more of these data categories. Using extended sequences of these past data improved the predictions, possibly owing to the long delay time between precipitation and groundwater recharge, which results from the deep groundwater level in Jeju Island. However, limiting the range of considered groundwater usage data that significantly affected the groundwater level fluctuation (rather than using all the groundwater usage data) improved the performance of the predictive model. The developed models can predict the future groundwater level based on the current amount of precipitation and groundwater use. Therefore, the models provide information on the soundness of the aquifer system, which will help to prepare management plans to maintain appropriate groundwater quantities.

Drought Analysis and Assessment by Using Land Surface Model on South Korea (지표수문해석모형을 활용한 국내 가뭄해석 적용성 평가)

  • Son, Kyung-Hwan;Bae, Deg-Hyo;Chung, Jun-Seok
    • Journal of Korea Water Resources Association
    • /
    • v.44 no.8
    • /
    • pp.667-681
    • /
    • 2011
  • The objective of this study is to evaluate the applicability of a Land Surface Model (LSM) for drought analysis in Korea. For evaluating the applicability of the model, the model was calibrated on several upper dam site watersheds and the hydrological components (runoff and soil moisture) were simulated over the whole South Korea at grid basis. After converting daily series of runoff and soil moisture data to accumulated time series (3, 6, 12 months), drought indices such as SRI and SSI are calculated through frequency analysis and standardization of accumulated probability. For evaluating the drought indices, past drought events are investigated and drought indices including SPI and PDSI are used for comparative analysis. Temporal and spatial analysis of the drought indices in addition to hydrologic component analysis are performed to evaluate the reproducibility of drought severity as well as relieving of drought. It can be concluded that the proposed indices obtained from the LSM model show good performance to reflect the historical drought events for both spatially and temporally. From this point of view, the LSM can be useful for drought management. It leads to the conclusion that these indices are applicable to domestic drought and water management.

Forecasting of Iron Ore Prices using Machine Learning (머신러닝을 이용한 철광석 가격 예측에 대한 연구)

  • Lee, Woo Chang;Kim, Yang Sok;Kim, Jung Min;Lee, Choong Kwon
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.25 no.2
    • /
    • pp.57-72
    • /
    • 2020
  • The price of iron ore has continued to fluctuate with high demand and supply from many countries and companies. In this business environment, forecasting the price of iron ore has become important. This study developed the machine learning model forecasting the price of iron ore a one month after the trading events. The forecasting model used distributed lag model and deep learning models such as MLP (Multi-layer perceptron), RNN (Recurrent neural network) and LSTM (Long short-term memory). According to the results of comparing individual models through metrics, LSTM showed the lowest predictive error. Also, as a result of comparing the models using the ensemble technique, the distributed lag and LSTM ensemble model showed the lowest prediction.

Prediction of Optimal Production Level for Maximizing Total Profit in Miryang Sesame Leaf Cultivation (밀양 깻잎 농업의 총소득 극대화를 위한 적정 생산 규모 전망)

  • Cho, Jae-Hwan;Chung, Wonho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.1
    • /
    • pp.313-320
    • /
    • 2021
  • This study develops a demand and supply model and price model for Miryang sesame leaf cultivation and predicts the optimal production level to maximize total profit for Miryang sesame leaf farms. We used time series data from 1996 to 2017, which are related to Miryang sesame leaf cultivation. For the analysis, we estimated the demand function and average cost function, calculated the optimal production level and price, and derived the optimal profit. In addition, we predicted the optimal production level, price, total revenue, total cost, and profit until the year 2030 through scenario analysis. The results show that the optimal production level until the year 2030 is between 10 and 12.5 thousand tons, while the production volume was 7 thousand tons in 2017, and total profit for Miryang sesame leaf farms is estimated at 13.3 to 21.3 billion Korean won in 2030. The producer group needs to maintain the optimal production level to maximize total profit for farmers, as suggested in this study.

Near Infrared Spectroscopy for Diagnosis: Influence of Mammary Gland Inflammation on Cow´s Milk Composition Measurement

  • Roumiana Tsenkova;Stefka Atanassova;Kiyohiko Toyoda
    • Near Infrared Analysis
    • /
    • v.2 no.1
    • /
    • pp.59-66
    • /
    • 2001
  • Nowadays, medical diagnostics is efficiently supported by clinical chemistry and near infrared spectroscopy is becoming a new dimension, which has shown high potential to provide valuable information for diagnosis. The investigation was carried out to study the influence of mammary gland inflammation, called mastitis, on cow´s milk spectra and milk composition measured by near infrared spectroscopy (NIRS). Milk somatic cell counts (SCC) in milk were used as a measure of mammary gland inflammation. Naturally occurred variations with milk composition within lactation and in the process of milking were included in the experimental design of this study. Time series of unhomogenized, raw milk spectral data were collected from 3 cow along morning and evening milking, for 5 consecutive months, within their second lactation. In the time of the trial, the investigated cows had periods with mammary gland inflammation. Transmittance spectra of 258 milk samples were obtained by NIRSystem 6500 spectrophotometer in 1100-2400 nm region. Calibration equations for the examined milk components were developed by PLS regression using 3 different sets of samples: samples with low somatic cell count (SCC), samples with high SCC and combined data set. The NIR calibration and prediction of individual cow´s milk fat, protein, and lactose were highly influenced by the presence of mil samples from animals with mammary gland inflammation in the data set. The best accuracy of prediction (i.e. the lower SEP and the higher correlation coefficient) for fat, protein and lactose was obtained for equations, developed when using only “healthy” samples, with low SCC. The standard error of prediction increased and correlation coefficient decreased significantly when equations for low SCC milk were used to predict examined components in “mastitis” samples with high SCC, and vice versa. Combined data set that included samples from healthy and mastitis animals could be used to build up regression models for screening. Further use of separate model for healthy samples improved milk composition measurement. Regression vectors for NIR mild protein measurement obtained for “healthy” and “mastitic” group were compared and revealed differences in 1390-1450 nm, 1500-1740 nm and 1900-2200 nm regions and thus illustrated post-secretory breakdown of milk proteins by hydrolytic enzymes that occurred with mastitis. For the first time it has been found that monitoring the spectral differences in water bands at 1440 nm and 1912 nm could provide valuable information for inflammation diagnosis.

A Study on the Rainfall-Runoff Analysis of Using Satellite Image (위성영상정보를 이용한 강우유출 해석에 관한 연구)

  • Park, Young-Kee;Lee, Jeung-Seok;Park, Jeong-Gyu
    • Journal of Environmental Science International
    • /
    • v.19 no.1
    • /
    • pp.115-124
    • /
    • 2010
  • Urban watershed can be found in the visible changes in technology, the most realistic satellite images is to use the data. Satellite image data on the indicators for progress on the nature of the change of land use is consistent and repetitive information, regular observation makes possible the detailed analysis of space-time. These remote sensing techniques and the type of course and, by using the time series history, the past, the dynamic model and the randomized prediction methodology for the conversion process if the city and river basin cooperation of the space changes effectively will be able to extrapolate. For each of the main changes in river flow, depending on the area of urbanization as determined according to reproduce the duration of the relationship between the urbanization of the area and runoff can be represented as a linear polynomial expression was, if a linear expression in the two fast slew rate of 0.858 to 0.861 showed up, and fast slew rate of 0.934 to 0.974 for the polynomial are reported. Change of land use changes in the watershed of the flow is one of the most affecting elements. Therefore, changes in land use of the correct classification of rivers is a more accurate calculation of the amount of the floodgate. In particular, using the Landsat images through the image of the land use category, land use past data and calculated using the Markov Chain model and predict the future land use plan in the water control project will be used for large likely.

A study on stock price prediction system based on text mining method using LSTM and stock market news (LSTM과 증시 뉴스를 활용한 텍스트 마이닝 기법 기반 주가 예측시스템 연구)

  • Hong, Sunghyuck
    • Journal of Digital Convergence
    • /
    • v.18 no.7
    • /
    • pp.223-228
    • /
    • 2020
  • The stock price reflects people's psychology, and factors affecting the entire stock market include economic growth rate, economic rate, interest rate, trade balance, exchange rate, and currency. The domestic stock market is heavily influenced by the stock index of the United States and neighboring countries on the previous day, and the representative stock indexes are the Dow index, NASDAQ, and S & P500. Recently, research on stock price analysis using stock news has been actively conducted, and research is underway to predict the future based on past time series data through artificial intelligence-based analysis. However, even if the stock market is hit for a short period of time by the forecasting system, the market will no longer move according to the short-term strategy, and it will have to change anew. Therefore, this model monitored Samsung Electronics' stock data and news information through text mining, and presented a predictable model by showing the analyzed results.

Dynamic Glide Path using Retirement Target Date and Forecast Volatility (은퇴 시점과 예측 변동성을 고려한 동적 Glide Path)

  • Kim, Sun Woong
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.2
    • /
    • pp.82-89
    • /
    • 2021
  • The objective of this study is to propose a new Glide Path that dynamically adjusts the risky asset inclusion ratio of the Target Date Fund by simultaneously considering the market's forecast volatility as well as the time of investor retirement, and to compare the investment performance with the traditional Target Date Fund. Forecasts of market volatility utilize historical volatility, time series model GARCH volatility, and the volatility index VKOSPI. The investment performance of the new dynamic Glide Path, which considers stock market volatility has been shown to be excellent during the analysis period from 2003 to 2020. In all three volatility prediction models, Sharpe Ratio, an investment performance indicator, is improved with higher returns and lower risks than traditional static Glide Path, which considers only retirement date. The empirical results of this study present the potential for the utilization of the suggested Glide Path in the Target Date Fund management industry as well as retirees.

A Study on the prediction of Advertising Expenditure (계량적 통계분석을 통한 매체별 광고비 예측 연구)

  • Han, Sangpil;Yu, Seung Yeob
    • Journal of Digital Convergence
    • /
    • v.12 no.9
    • /
    • pp.111-121
    • /
    • 2014
  • This study is designed to predict the total ad expenditure of Korea, and six media ad expenditures in 5 years based on the past 20 years ad expenditure date. We use annual data published by Cheil Worldwide advertising data analysis. Time series, SUR method, exponential smoothing method and regression analysis were used for exploring the data. The results showed that the total advertising expenditure in 2018 is predicted to 10,873 billion wons. On the basis of the findings, implications are discussed for academicians as well as practitioners.