• Title/Summary/Keyword: Stock trend prediction

Search Result 20, Processing Time 0.026 seconds

Classification Algorithm-based Prediction Performance of Order Imbalance Information on Short-Term Stock Price (분류 알고리즘 기반 주문 불균형 정보의 단기 주가 예측 성과)

  • Kim, S.W.
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.157-177
    • /
    • 2022
  • Investors are trading stocks by keeping a close watch on the order information submitted by domestic and foreign investors in real time through Limit Order Book information, so-called price current provided by securities firms. Will order information released in the Limit Order Book be useful in stock price prediction? This study analyzes whether it is significant as a predictor of future stock price up or down when order imbalances appear as investors' buying and selling orders are concentrated to one side during intra-day trading time. Using classification algorithms, this study improved the prediction accuracy of the order imbalance information on the short-term price up and down trend, that is the closing price up and down of the day. Day trading strategies are proposed using the predicted price trends of the classification algorithms and the trading performances are analyzed through empirical analysis. The 5-minute KOSPI200 Index Futures data were analyzed for 4,564 days from January 19, 2004 to June 30, 2022. The results of the empirical analysis are as follows. First, order imbalance information has a significant impact on the current stock prices. Second, the order imbalance information observed in the early morning has a significant forecasting power on the price trends from the early morning to the market closing time. Third, the Support Vector Machines algorithm showed the highest prediction accuracy on the day's closing price trends using the order imbalance information at 54.1%. Fourth, the order imbalance information measured at an early time of day had higher prediction accuracy than the order imbalance information measured at a later time of day. Fifth, the trading performances of the day trading strategies using the prediction results of the classification algorithms on the price up and down trends were higher than that of the benchmark trading strategy. Sixth, except for the K-Nearest Neighbor algorithm, all investment performances using the classification algorithms showed average higher total profits than that of the benchmark strategy. Seventh, the trading performances using the predictive results of the Logical Regression, Random Forest, Support Vector Machines, and XGBoost algorithms showed higher results than the benchmark strategy in the Sharpe Ratio, which evaluates both profitability and risk. This study has an academic difference from existing studies in that it documented the economic value of the total buy & sell order volume information among the Limit Order Book information. The empirical results of this study are also valuable to the market participants from a trading perspective. In future studies, it is necessary to improve the performance of the trading strategy using more accurate price prediction results by expanding to deep learning models which are actively being studied for predicting stock prices recently.

A Time Series Graph based Convolutional Neural Network Model for Effective Input Variable Pattern Learning : Application to the Prediction of Stock Market (효과적인 입력변수 패턴 학습을 위한 시계열 그래프 기반 합성곱 신경망 모형: 주식시장 예측에의 응용)

  • Lee, Mo-Se;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.167-181
    • /
    • 2018
  • Over the past decade, deep learning has been in spotlight among various machine learning algorithms. In particular, CNN(Convolutional Neural Network), which is known as the effective solution for recognizing and classifying images or voices, has been popularly applied to classification and prediction problems. In this study, we investigate the way to apply CNN in business problem solving. Specifically, this study propose to apply CNN to stock market prediction, one of the most challenging tasks in the machine learning research. As mentioned, CNN has strength in interpreting images. Thus, the model proposed in this study adopts CNN as the binary classifier that predicts stock market direction (upward or downward) by using time series graphs as its inputs. That is, our proposal is to build a machine learning algorithm that mimics an experts called 'technical analysts' who examine the graph of past price movement, and predict future financial price movements. Our proposed model named 'CNN-FG(Convolutional Neural Network using Fluctuation Graph)' consists of five steps. In the first step, it divides the dataset into the intervals of 5 days. And then, it creates time series graphs for the divided dataset in step 2. The size of the image in which the graph is drawn is $40(pixels){\times}40(pixels)$, and the graph of each independent variable was drawn using different colors. In step 3, the model converts the images into the matrices. Each image is converted into the combination of three matrices in order to express the value of the color using R(red), G(green), and B(blue) scale. In the next step, it splits the dataset of the graph images into training and validation datasets. We used 80% of the total dataset as the training dataset, and the remaining 20% as the validation dataset. And then, CNN classifiers are trained using the images of training dataset in the final step. Regarding the parameters of CNN-FG, we adopted two convolution filters ($5{\times}5{\times}6$ and $5{\times}5{\times}9$) in the convolution layer. In the pooling layer, $2{\times}2$ max pooling filter was used. The numbers of the nodes in two hidden layers were set to, respectively, 900 and 32, and the number of the nodes in the output layer was set to 2(one is for the prediction of upward trend, and the other one is for downward trend). Activation functions for the convolution layer and the hidden layer were set to ReLU(Rectified Linear Unit), and one for the output layer set to Softmax function. To validate our model - CNN-FG, we applied it to the prediction of KOSPI200 for 2,026 days in eight years (from 2009 to 2016). To match the proportions of the two groups in the independent variable (i.e. tomorrow's stock market movement), we selected 1,950 samples by applying random sampling. Finally, we built the training dataset using 80% of the total dataset (1,560 samples), and the validation dataset using 20% (390 samples). The dependent variables of the experimental dataset included twelve technical indicators popularly been used in the previous studies. They include Stochastic %K, Stochastic %D, Momentum, ROC(rate of change), LW %R(Larry William's %R), A/D oscillator(accumulation/distribution oscillator), OSCP(price oscillator), CCI(commodity channel index), and so on. To confirm the superiority of CNN-FG, we compared its prediction accuracy with the ones of other classification models. Experimental results showed that CNN-FG outperforms LOGIT(logistic regression), ANN(artificial neural network), and SVM(support vector machine) with the statistical significance. These empirical results imply that converting time series business data into graphs and building CNN-based classification models using these graphs can be effective from the perspective of prediction accuracy. Thus, this paper sheds a light on how to apply deep learning techniques to the domain of business problem solving.

A Two-Phase Stock Trading System based on Pattern Matching and Automatic Rule Induction (패턴 매칭과 자동 규칙 생성에 기반한 2단계 주식 트레이딩 시스템)

  • Lee, Jong-Woo;Kim, Yu-Seop;Kim, Sung-Dong;Lee, Jae-Won;Chae, Jin-Seok
    • The KIPS Transactions:PartB
    • /
    • v.10B no.3
    • /
    • pp.257-264
    • /
    • 2003
  • In the context of a dynamic trading environment, the ultimate goal of the financial forecasting system is to optimize a specific trading objective. This paper proposes a two-phase (extraction and filtering) stock trading system that aims at maximizing the rates of returns. Extraction of stocks is performed by searching specific time-series patterns described by a combination of values of technical indicators. In the filtering phase, several rules are applied to the extracted sets of stocks to select stocks to be actually traded. The filtering rules are automatically induced from past data. From a large database of daily stock prices, the values of technical indicators are calculated. They are used to make the extraction patterns, and the distributions of the discretization intervals of the values are calculated for both positive and negative data sets. We assumed that the values in the intervals of distinctive distribution may contribute to the prediction of future trend of stocks, so the rules for filtering stocks are automatically induced from the data in those intervals. We show the rates of returns when using our trading system outperform the market average. These results mean rule induction method using distributional differences is useful.

Optimization of Stock Trading System based on Multi-Agent Q-Learning Framework (다중 에이전트 Q-학습 구조에 기반한 주식 매매 시스템의 최적화)

  • Kim, Yu-Seop;Lee, Jae-Won;Lee, Jong-Woo
    • The KIPS Transactions:PartB
    • /
    • v.11B no.2
    • /
    • pp.207-212
    • /
    • 2004
  • This paper presents a reinforcement learning framework for stock trading systems. Trading system parameters are optimized by Q-learning algorithm and neural networks are adopted for value approximation. In this framework, cooperative multiple agents are used to efficiently integrate global trend prediction and local trading strategy for obtaining better trading performance. Agents Communicate With Others Sharing training episodes and learned policies, while keeping the overall scheme of conventional Q-learning. Experimental results on KOSPI 200 show that a trading system based on the proposed framework outperforms the market average and makes appreciable profits. Furthermore, in view of risk management, the system is superior to a system trained by supervised learning.

Fuzzy Support Vector Machine for Pattern Classification of Time Series Data of KOSPI200 Index (시계열 자료 코스피200의 패턴분류를 위한 퍼지 서포트 벡타 기계)

  • Lee, S.Y.;Sohn, S.Y.;Kim, C.E.;Lee, Y.B.
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.1
    • /
    • pp.52-56
    • /
    • 2004
  • The Information of classification and estimate about KOSPI200 index`s up and down in the stock market becomes an important standard of decision-making in designing portofolio in futures and option market. Because the coming trend of time series patterns, an economic indicator, is very subordinate to the most recent economic pattern, it is necessary to study the recent patterns most preferentially. This paper compares classification and estimated performance of SVM(Support Vector Machine) and Fuzzy SVM model that are getting into the spotlight in time series analyses, neural net models and various fields. Specially, it proves that Fuzzy SVM is superior by presenting the most suitable dimension to fuzzy membership function that has time series attribute in accordance with learning Data Base.

A Study on Stock Trading Method based on Volatility Breakout Strategy using a Deep Neural Network (심층 신경망을 이용한 변동성 돌파 전략 기반 주식 매매 방법에 관한 연구)

  • Yi, Eunu;Lee, Won-Boo
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.3
    • /
    • pp.81-93
    • /
    • 2022
  • The stock investing is one of the most popular investment techniques. However, since it is not easy to obtain a return through actual investment, various strategies have been devised and tried in the past to obtain an effective and stable return. Among them, the volatility breakout strategy identifies a strong uptrend that exceeds a certain level on a daily basis as a breakout signal, follows the uptrend, and quickly earns daily returns. It is one of the popular investment strategies that are widely used to realize profits. However, it is difficult to predict stock prices by understanding the price trend pattern of stocks. In this paper, we propose a method of buying and selling stocks by predicting the return in trading based on the volatility breakout strategy using a bi-directional long short-term memory deep neural network that can realize a return in a short period of time. As a result of the experiment assuming actual trading on the test data with the learned model, it can be seen that the results outperform both the return and stability compared to the existing closing price prediction model using the long-short-term memory deep neural network model.

A Stock trend Prediction based on Explainable Artificial Intelligence (설명 가능 인공지능 기법을 활용한 주가 전망 예측)

  • Kim, Ji Hyun;Lee, Yeon Su;Jung, Su Min;Jo, Seol A;Ahn, Jeong Eun;Kim, Hyun Hee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.797-800
    • /
    • 2021
  • 인공지능을 활용한 주가 예측 모형을 실제 금융 서비스에 도입한 사례가 많아지고 있다. 주식 데이터는 일반적인 시계열 데이터와 다르게 예측을 어렵게 하는 복합적인 요소가 존재하며 주식은 리스크가 큰 자산 상품 중 하나이다. 주가 예측 모형의 활용 가능성을 높이기 위해선 성능을 향상시키는 것과 함께 모델을 해석 가능한 형태로 제시해 신뢰성을 향상시킬 필요성이 있다. 본 논문은 주가 전망 결정 방법에 따른 예측 결과를 비교하고, 설명 가능성을 부여해 모형 개선했다는 것에 의의가 있다. 연구 결과, 주가 전망을 장기적으로 결정할수록 정확도가 증가하고, XAI 기법을 통해 모형의 개선 근거를 제시할 수 있음을 알 수 있었다. 본 연구를 통해 인공지능 모형의 신뢰성을 확보하고, 합리적인 투자 결정에 도움을 줄 수 있을 것으로 기대한다.

Forest Resources of the Korea Based on National Forest Inventory Data

  • Kim, Dong-Hyuk;Nor, Dae-Kyun;Jeong, Jin-Hyun;Kim, Sung-Ho;Chung, Dong-Jun
    • Journal of Forest and Environmental Science
    • /
    • v.24 no.3
    • /
    • pp.159-164
    • /
    • 2008
  • Forest inventory is a commercial term meaning the preparation of detailed descriptive list of articles with number, quantity and value of each item included. Forest inventory deals with the measurement of trees and stands, the estimation of their volume, growth prediction, biomass, carbon stocks and the description tree characteristics, as well as the land upon which they are growing. National Forest Inventory Center (NFIC) in Korea conducts national forest inventory every 5 years to obtain accurate baseline data for national forest policy. The permanent sample plot data used in were collected by NFI. The objective of this study was to develop methods for quantifying forest resources at national scale based on $5^{th}$ National Forest Inventory (NFI) data in Korea. Forest land area decreased from 6.44 to 6.38 million ha between 1997 and 2007, continuing a slight downward trend in area beginning in the late 1990s. However forest resources of the Korea have continued improving in general condition and quality, as measured by increased average size and volume of trees. Growing-stock volume of the Korea increased from 17 to 123.79 cubic meter per ha between 1976 and 2007. The biomass in Korea was estimated to be 153.81 tons per hectare and carbon stocks in Korea was estimated to be 84.36 tons per hectare by NFI data. This information is important for government officials, public administration, the private business sector, and the researcher. Forest Inventory should be implemented in a way to be able to monitor and assess the forests continuously.

  • PDF

A Study on Trend Using Time Series Data (시계열 데이터 활용에 관한 동향 연구)

  • Shin-Hyeong Choi
    • Advanced Industrial SCIence
    • /
    • v.3 no.1
    • /
    • pp.17-22
    • /
    • 2024
  • History, which began with the emergence of mankind, has a means of recording. Today, we can check the past through data. Generated data may only be generated and stored at a certain moment, but it is not only continuously generated over a certain time interval from the past to the present, but also occurs in the future, so making predictions using it is an important task. In order to find out trends in the use of time series data among numerous data, this paper analyzes the concept of time series data, analyzes Recurrent Neural Network and Long-Short Term Memory, which are mainly used for time series data analysis in the machine learning field, and analyzes the use of these models. Through case studies, it was confirmed that it is being used in various fields such as medical diagnosis, stock price analysis, and climate prediction, and is showing high predictive results. Based on this, we will explore ways to utilize it in the future.

Analysis of Intrinsic Patterns of Time Series Based on Chaos Theory: Focusing on Roulette and KOSPI200 Index Future (카오스 이론 기반 시계열의 내재적 패턴분석: 룰렛과 KOSPI200 지수선물 데이터 대상)

  • Lee, HeeChul;Kim, HongGon;Kim, Hee-Woong
    • Knowledge Management Research
    • /
    • v.22 no.4
    • /
    • pp.119-133
    • /
    • 2021
  • As a large amount of data is produced in each industry, a number of time series pattern prediction studies are being conducted to make quick business decisions. However, there is a limit to predicting specific patterns in nonlinear time series data due to the uncertainty inherent in the data, and there are difficulties in making strategic decisions in corporate management. In addition, in recent decades, various studies have been conducted on data such as demand/supply and financial markets that are suitable for industrial purposes to predict time series data of irregular random walk models, but predict specific rules and achieve sustainable corporate objectives There are difficulties. In this study, the prediction results were compared and analyzed using the Chaos analysis method for roulette data and financial market data, and meaningful results were derived. And, this study confirmed that chaos analysis is useful for finding a new method in analyzing time series data. By comparing and analyzing the characteristics of roulette games with the time series of Korean stock index future, it was derived that predictive power can be improved if the trend is confirmed, and it is meaningful in determining whether nonlinear time series data with high uncertainty have a specific pattern.