• Title/Summary/Keyword: Stock Price Forecasting

Search Result 88, Processing Time 0.03 seconds

Forecasting realized volatility using data normalization and recurrent neural network

  • Yoonjoo Lee;Dong Wan Shin;Ji Eun Choi
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.1
    • /
    • pp.105-127
    • /
    • 2024
  • We propose recurrent neural network (RNN) methods for forecasting realized volatility (RV). The data are RVs of ten major stock price indices, four from the US, and six from the EU. Forecasts are made for relative ratio of adjacent RVs instead of the RV itself in order to avoid the out-of-scale issue. Forecasts of RV ratios distribution are first constructed from which those of RVs are computed which are shown to be better than forecasts constructed directly from RV. The apparent asymmetry of RV ratio is addressed by the Piecewise Min-max (PM) normalization. The serial dependence of the ratio data renders us to consider two architectures, long short-term memory (LSTM) and gated recurrent unit (GRU). The hyperparameters of LSTM and GRU are tuned by the nested cross validation. The RNN forecast with the PM normalization and ratio transformation is shown to outperform other forecasts by other RNN models and by benchmarking models of the AR model, the support vector machine (SVM), the deep neural network (DNN), and the convolutional neural network (CNN).

An Accurate Cryptocurrency Price Forecasting using Reverse Walk-Forward Validation (역순 워크 포워드 검증을 이용한 암호화폐 가격 예측)

  • Ahn, Hyun;Jang, Baekcheol
    • Journal of Internet Computing and Services
    • /
    • v.23 no.4
    • /
    • pp.45-55
    • /
    • 2022
  • The size of the cryptocurrency market is growing. For example, market capitalization of bitcoin exceeded 500 trillion won. Accordingly, many studies have been conducted to predict the price of cryptocurrency, and most of them have similar methodology of predicting stock prices. However, unlike stock price predictions, machine learning become best model in cryptocurrency price predictions, conceptually cryptocurrency has no passive income from ownership, and statistically, cryptocurrency has at least three times higher liquidity than stocks. Thats why we argue that a methodology different from stock price prediction should be applied to cryptocurrency price prediction studies. We propose Reverse Walk-forward Validation (RWFV), which modifies Walk-forward Validation (WFV). Unlike WFV, RWFV measures accuracy for Validation by pinning the Validation dataset directly in front of the Test dataset in time series, and gradually increasing the size of the Training dataset in front of it in time series. Train data were cut according to the size of the Train dataset with the highest accuracy among all measured Validation accuracy, and then combined with Validation data to measure the accuracy of the Test data. Logistic regression analysis and Support Vector Machine (SVM) were used as the analysis model, and various algorithms and parameters such as L1, L2, rbf, and poly were applied for the reliability of our proposed RWFV. As a result, it was confirmed that all analysis models showed improved accuracy compared to existing studies, and on average, the accuracy increased by 1.23%p. This is a significant improvement in accuracy, given that most of the accuracy of cryptocurrency price prediction remains between 50% and 60% through previous studies.

The Effect of Data Size on the k-NN Predictability: Application to Samsung Electronics Stock Market Prediction (데이터 크기에 따른 k-NN의 예측력 연구: 삼성전자주가를 사례로)

  • Chun, Se-Hak
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.239-251
    • /
    • 2019
  • Statistical methods such as moving averages, Kalman filtering, exponential smoothing, regression analysis, and ARIMA (autoregressive integrated moving average) have been used for stock market predictions. However, these statistical methods have not produced superior performances. In recent years, machine learning techniques have been widely used in stock market predictions, including artificial neural network, SVM, and genetic algorithm. In particular, a case-based reasoning method, known as k-nearest neighbor is also widely used for stock price prediction. Case based reasoning retrieves several similar cases from previous cases when a new problem occurs, and combines the class labels of similar cases to create a classification for the new problem. However, case based reasoning has some problems. First, case based reasoning has a tendency to search for a fixed number of neighbors in the observation space and always selects the same number of neighbors rather than the best similar neighbors for the target case. So, case based reasoning may have to take into account more cases even when there are fewer cases applicable depending on the subject. Second, case based reasoning may select neighbors that are far away from the target case. Thus, case based reasoning does not guarantee an optimal pseudo-neighborhood for various target cases, and the predictability can be degraded due to a deviation from the desired similar neighbor. This paper examines how the size of learning data affects stock price predictability through k-nearest neighbor and compares the predictability of k-nearest neighbor with the random walk model according to the size of the learning data and the number of neighbors. In this study, Samsung electronics stock prices were predicted by dividing the learning dataset into two types. For the prediction of next day's closing price, we used four variables: opening value, daily high, daily low, and daily close. In the first experiment, data from January 1, 2000 to December 31, 2017 were used for the learning process. In the second experiment, data from January 1, 2015 to December 31, 2017 were used for the learning process. The test data is from January 1, 2018 to August 31, 2018 for both experiments. We compared the performance of k-NN with the random walk model using the two learning dataset. The mean absolute percentage error (MAPE) was 1.3497 for the random walk model and 1.3570 for the k-NN for the first experiment when the learning data was small. However, the mean absolute percentage error (MAPE) for the random walk model was 1.3497 and the k-NN was 1.2928 for the second experiment when the learning data was large. These results show that the prediction power when more learning data are used is higher than when less learning data are used. Also, this paper shows that k-NN generally produces a better predictive power than random walk model for larger learning datasets and does not when the learning dataset is relatively small. Future studies need to consider macroeconomic variables related to stock price forecasting including opening price, low price, high price, and closing price. Also, to produce better results, it is recommended that the k-nearest neighbor needs to find nearest neighbors using the second step filtering method considering fundamental economic variables as well as a sufficient amount of learning data.

Development of Outbound Tourism Forecasting Models in Korea

  • Yoon, Ji-Hwan;Lee, Jung Seung;Yoon, Kyung Seon
    • Journal of Information Technology Applications and Management
    • /
    • v.21 no.1
    • /
    • pp.177-184
    • /
    • 2014
  • This research analyzes the effects of factors on the demands for outbound to the countries such as Japan, China, the United States of America, Thailand, Philippines, Hong Kong, Singapore and Australia, the countries preferred by many Koreans. The factors for this research are (1) economic variables such as Korea Composite Stock Price Index (KOSPI), which could have influences on outbound tourism and exchange rate and (2) unpredictable events such as diseases, financial crisis and terrors. Regression analysis was used to identify relationship based on the monthly data from January 2001 to December 2010. The results of the analysis show that both exchange rate and KOSPI have impacts on the demands for outbound travel. In the case of travels to the United States of America and Philippines, Korean tourists usually have particular purposes such as studying, visiting relatives, playing golf or honeymoon, thus they are less influenced by the exchange rate. Moreover, Korean tourists tend not to visit particular locations for some time when shock reaction happens. As the demands for outbound travels are different from country to country accompanied by economic variables and shock variables, differentiated measure to should be considered to come close to the target numbers of tourists by switching as well as creating the demands. For further study we plan to build outbound tourism forecasting models using Artificial Neural Networks.

Deep learning forecasting for financial realized volatilities with aid of implied volatilities and internet search volumes (금융 실현변동성을 위한 내재변동성과 인터넷 검색량을 활용한 딥러닝)

  • Shin, Jiwon;Shin, Dong Wan
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.93-104
    • /
    • 2022
  • In forecasting realized volatility of the major US stock price indexes (S&P 500, Russell 2000, DJIA, Nasdaq 100), internet search volume reflecting investor's interests and implied volatility are used to improve forecast via a deep learning method of the LSTM. The LSTM method combined with search volume index produces better forecasts than existing standard methods of the vector autoregressive (VAR) and the vector error correction (VEC) models. It also beats the recently proposed vector error correction heterogeneous autoregressive (VECHAR) model which takes advantage of the cointegration relation between realized volatility and implied volatility.

Volatility spillover between the Korean KOSPI and the Hong Kong HSI stock markets

  • Baek, Eun-Ah;Oh, Man-Suk
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.3
    • /
    • pp.203-213
    • /
    • 2016
  • We investigate volatility spillover aspects of realized volatilities (RVs) for the log returns of the Korea Composite Stock Price Index (KOSPI) and the Hang Seng Index (HSI) from 2009-2013. For all RVs, significant long memories and asymmetries are identified. For a model selection, we consider three commonly used time series models as well as three models that incorporate long memory and asymmetry. Taking into account of goodness-of-fit and forecasting ability, Leverage heteroskedastic autoregressive realized volatility (LHAR) model is selected for the given data. The LHAR model finds significant decompositions of the spillover effect from the HSI to the KOSPI into moderate negative daily spillover, positive weekly spillover and positive monthly spillover, and from the KOSPI to the HSI into substantial negative weekly spillover and positive monthly spillover. An interesting result from the analysis is that the daily volatility spillover from the HSI to the KOSPI is significant versus the insignificant daily volatility spillover of the KOSPI to HSI. The daily volatility in Hong Kong affects next day volatility in Korea but the daily volatility in Korea does not affect next day volatility in Hong Kong.

Classification Algorithm-based Prediction Performance of Order Imbalance Information on Short-Term Stock Price (분류 알고리즘 기반 주문 불균형 정보의 단기 주가 예측 성과)

  • Kim, S.W.
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.157-177
    • /
    • 2022
  • Investors are trading stocks by keeping a close watch on the order information submitted by domestic and foreign investors in real time through Limit Order Book information, so-called price current provided by securities firms. Will order information released in the Limit Order Book be useful in stock price prediction? This study analyzes whether it is significant as a predictor of future stock price up or down when order imbalances appear as investors' buying and selling orders are concentrated to one side during intra-day trading time. Using classification algorithms, this study improved the prediction accuracy of the order imbalance information on the short-term price up and down trend, that is the closing price up and down of the day. Day trading strategies are proposed using the predicted price trends of the classification algorithms and the trading performances are analyzed through empirical analysis. The 5-minute KOSPI200 Index Futures data were analyzed for 4,564 days from January 19, 2004 to June 30, 2022. The results of the empirical analysis are as follows. First, order imbalance information has a significant impact on the current stock prices. Second, the order imbalance information observed in the early morning has a significant forecasting power on the price trends from the early morning to the market closing time. Third, the Support Vector Machines algorithm showed the highest prediction accuracy on the day's closing price trends using the order imbalance information at 54.1%. Fourth, the order imbalance information measured at an early time of day had higher prediction accuracy than the order imbalance information measured at a later time of day. Fifth, the trading performances of the day trading strategies using the prediction results of the classification algorithms on the price up and down trends were higher than that of the benchmark trading strategy. Sixth, except for the K-Nearest Neighbor algorithm, all investment performances using the classification algorithms showed average higher total profits than that of the benchmark strategy. Seventh, the trading performances using the predictive results of the Logical Regression, Random Forest, Support Vector Machines, and XGBoost algorithms showed higher results than the benchmark strategy in the Sharpe Ratio, which evaluates both profitability and risk. This study has an academic difference from existing studies in that it documented the economic value of the total buy & sell order volume information among the Limit Order Book information. The empirical results of this study are also valuable to the market participants from a trading perspective. In future studies, it is necessary to improve the performance of the trading strategy using more accurate price prediction results by expanding to deep learning models which are actively being studied for predicting stock prices recently.

Stock Price Direction Prediction Using Convolutional Neural Network: Emphasis on Correlation Feature Selection (합성곱 신경망을 이용한 주가방향 예측: 상관관계 속성선택 방법을 중심으로)

  • Kyun Sun Eo;Kun Chang Lee
    • Information Systems Review
    • /
    • v.22 no.4
    • /
    • pp.21-39
    • /
    • 2020
  • Recently, deep learning has shown high performance in various applications such as pattern analysis and image classification. Especially known as a difficult task in the field of machine learning research, stock market forecasting is an area where the effectiveness of deep learning techniques is being verified by many researchers. This study proposed a deep learning Convolutional Neural Network (CNN) model to predict the direction of stock prices. We then used the feature selection method to improve the performance of the model. We compared the performance of machine learning classifiers against CNN. The classifiers used in this study are as follows: Logistic Regression, Decision Tree, Neural Network, Support Vector Machine, Adaboost, Bagging, and Random Forest. The results of this study confirmed that the CNN showed higher performancecompared with other classifiers in the case of feature selection. The results show that the CNN model effectively predicted the stock price direction by analyzing the embedded values of the financial data

A Case of Establishing Robo-advisor Strategy through Parameter Optimization (금융 지표와 파라미터 최적화를 통한 로보어드바이저 전략 도출 사례)

  • Kang, Mincheal;Lim, Gyoo Gun
    • Journal of Information Technology Services
    • /
    • v.19 no.2
    • /
    • pp.109-124
    • /
    • 2020
  • Facing the 4th Industrial Revolution era, researches on artificial intelligence have become active and attempts have been made to apply machine learning in various fields. In the field of finance, Robo Advisor service, which analyze the market, make investment decisions and allocate assets instead of people, are rapidly expanding. The stock price prediction using the machine learning that has been carried out to date is mainly based on the prediction of the market index such as KOSPI, and utilizes technical data that is fundamental index or price derivative index using financial statement. However, most researches have proceeded without any explicit verification of the prediction rate of the learning data. In this study, we conducted an experiment to determine the degree of market prediction ability of basic indicators, technical indicators, and system risk indicators (AR) used in stock price prediction. First, we set the core parameters for each financial indicator and define the objective function reflecting the return and volatility. Then, an experiment was performed to extract the sample from the distribution of each parameter by the Markov chain Monte Carlo (MCMC) method and to find the optimum value to maximize the objective function. Since Robo Advisor is a commodity that trades financial instruments such as stocks and funds, it can not be utilized only by forecasting the market index. The sample for this experiment is data of 17 years of 1,500 stocks that have been listed in Korea for more than 5 years after listing. As a result of the experiment, it was possible to establish a meaningful trading strategy that exceeds the market return. This study can be utilized as a basis for the development of Robo Advisor products in that it includes a large proportion of listed stocks in Korea, rather than an experiment on a single index, and verifies market predictability of various financial indicators.

Analysis of Global Food Market and Food-Energy Price Links: Based on System Dynamics Approach

  • Kim, Gyu-Rim
    • Korean System Dynamics Review
    • /
    • v.10 no.3
    • /
    • pp.105-124
    • /
    • 2009
  • The situation of the global food markets has been being rapidly restructured and entering on a new phase by new dynamic and driving forces. The factors such as economic growth and income increase, high energy price, globalization, urbanization, and global climate change are transforming patterns of food consumption, production, and markets. The prices and markets of world food and energy are getting increasingly linked each other. Food and fuel are the global dilemma issues associated with the risk of diverting farmland or of consuming cereals for biofuel production in detriment of the cereals supply to the global food markets. An estimated 100 million tons of grain per year are being redirected from food to fuel. Therefore, the objectives of this study are as follows: Firstly, the study examines situations of the world food and energy resources, analyzes the trends of prices of the crude oil and biofuel, and formulates the food-energy links mechanism. Secondly, the study builds a simulation model, based on system dynamics approach, for not only analyzing the global cereals market and energy market but also forecasting the global production, consumption, and stock of those markets by 2030 in the future. The model of this study consists of four sectors, i.e., world population dynamics sector, global food market dynamics sector, global energy market dynamics sector, scenario sector of world economic growth and oil price.

  • PDF