• Title/Summary/Keyword: 주식 예측

Search Result 352, Processing Time 0.03 seconds

A Time Series Graph based Convolutional Neural Network Model for Effective Input Variable Pattern Learning : Application to the Prediction of Stock Market (효과적인 입력변수 패턴 학습을 위한 시계열 그래프 기반 합성곱 신경망 모형: 주식시장 예측에의 응용)

  • Lee, Mo-Se;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.167-181
    • /
    • 2018
  • Over the past decade, deep learning has been in spotlight among various machine learning algorithms. In particular, CNN(Convolutional Neural Network), which is known as the effective solution for recognizing and classifying images or voices, has been popularly applied to classification and prediction problems. In this study, we investigate the way to apply CNN in business problem solving. Specifically, this study propose to apply CNN to stock market prediction, one of the most challenging tasks in the machine learning research. As mentioned, CNN has strength in interpreting images. Thus, the model proposed in this study adopts CNN as the binary classifier that predicts stock market direction (upward or downward) by using time series graphs as its inputs. That is, our proposal is to build a machine learning algorithm that mimics an experts called 'technical analysts' who examine the graph of past price movement, and predict future financial price movements. Our proposed model named 'CNN-FG(Convolutional Neural Network using Fluctuation Graph)' consists of five steps. In the first step, it divides the dataset into the intervals of 5 days. And then, it creates time series graphs for the divided dataset in step 2. The size of the image in which the graph is drawn is $40(pixels){\times}40(pixels)$, and the graph of each independent variable was drawn using different colors. In step 3, the model converts the images into the matrices. Each image is converted into the combination of three matrices in order to express the value of the color using R(red), G(green), and B(blue) scale. In the next step, it splits the dataset of the graph images into training and validation datasets. We used 80% of the total dataset as the training dataset, and the remaining 20% as the validation dataset. And then, CNN classifiers are trained using the images of training dataset in the final step. Regarding the parameters of CNN-FG, we adopted two convolution filters ($5{\times}5{\times}6$ and $5{\times}5{\times}9$) in the convolution layer. In the pooling layer, $2{\times}2$ max pooling filter was used. The numbers of the nodes in two hidden layers were set to, respectively, 900 and 32, and the number of the nodes in the output layer was set to 2(one is for the prediction of upward trend, and the other one is for downward trend). Activation functions for the convolution layer and the hidden layer were set to ReLU(Rectified Linear Unit), and one for the output layer set to Softmax function. To validate our model - CNN-FG, we applied it to the prediction of KOSPI200 for 2,026 days in eight years (from 2009 to 2016). To match the proportions of the two groups in the independent variable (i.e. tomorrow's stock market movement), we selected 1,950 samples by applying random sampling. Finally, we built the training dataset using 80% of the total dataset (1,560 samples), and the validation dataset using 20% (390 samples). The dependent variables of the experimental dataset included twelve technical indicators popularly been used in the previous studies. They include Stochastic %K, Stochastic %D, Momentum, ROC(rate of change), LW %R(Larry William's %R), A/D oscillator(accumulation/distribution oscillator), OSCP(price oscillator), CCI(commodity channel index), and so on. To confirm the superiority of CNN-FG, we compared its prediction accuracy with the ones of other classification models. Experimental results showed that CNN-FG outperforms LOGIT(logistic regression), ANN(artificial neural network), and SVM(support vector machine) with the statistical significance. These empirical results imply that converting time series business data into graphs and building CNN-based classification models using these graphs can be effective from the perspective of prediction accuracy. Thus, this paper sheds a light on how to apply deep learning techniques to the domain of business problem solving.

A Study on the Prediction Model of Stock Price Index Trend based on GA-MSVM that Simultaneously Optimizes Feature and Instance Selection (입력변수 및 학습사례 선정을 동시에 최적화하는 GA-MSVM 기반 주가지수 추세 예측 모형에 관한 연구)

  • Lee, Jong-sik;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.147-168
    • /
    • 2017
  • There have been many studies on accurate stock market forecasting in academia for a long time, and now there are also various forecasting models using various techniques. Recently, many attempts have been made to predict the stock index using various machine learning methods including Deep Learning. Although the fundamental analysis and the technical analysis method are used for the analysis of the traditional stock investment transaction, the technical analysis method is more useful for the application of the short-term transaction prediction or statistical and mathematical techniques. Most of the studies that have been conducted using these technical indicators have studied the model of predicting stock prices by binary classification - rising or falling - of stock market fluctuations in the future market (usually next trading day). However, it is also true that this binary classification has many unfavorable aspects in predicting trends, identifying trading signals, or signaling portfolio rebalancing. In this study, we try to predict the stock index by expanding the stock index trend (upward trend, boxed, downward trend) to the multiple classification system in the existing binary index method. In order to solve this multi-classification problem, a technique such as Multinomial Logistic Regression Analysis (MLOGIT), Multiple Discriminant Analysis (MDA) or Artificial Neural Networks (ANN) we propose an optimization model using Genetic Algorithm as a wrapper for improving the performance of this model using Multi-classification Support Vector Machines (MSVM), which has proved to be superior in prediction performance. In particular, the proposed model named GA-MSVM is designed to maximize model performance by optimizing not only the kernel function parameters of MSVM, but also the optimal selection of input variables (feature selection) as well as instance selection. In order to verify the performance of the proposed model, we applied the proposed method to the real data. The results show that the proposed method is more effective than the conventional multivariate SVM, which has been known to show the best prediction performance up to now, as well as existing artificial intelligence / data mining techniques such as MDA, MLOGIT, CBR, and it is confirmed that the prediction performance is better than this. Especially, it has been confirmed that the 'instance selection' plays a very important role in predicting the stock index trend, and it is confirmed that the improvement effect of the model is more important than other factors. To verify the usefulness of GA-MSVM, we applied it to Korea's real KOSPI200 stock index trend forecast. Our research is primarily aimed at predicting trend segments to capture signal acquisition or short-term trend transition points. The experimental data set includes technical indicators such as the price and volatility index (2004 ~ 2017) and macroeconomic data (interest rate, exchange rate, S&P 500, etc.) of KOSPI200 stock index in Korea. Using a variety of statistical methods including one-way ANOVA and stepwise MDA, 15 indicators were selected as candidate independent variables. The dependent variable, trend classification, was classified into three states: 1 (upward trend), 0 (boxed), and -1 (downward trend). 70% of the total data for each class was used for training and the remaining 30% was used for verifying. To verify the performance of the proposed model, several comparative model experiments such as MDA, MLOGIT, CBR, ANN and MSVM were conducted. MSVM has adopted the One-Against-One (OAO) approach, which is known as the most accurate approach among the various MSVM approaches. Although there are some limitations, the final experimental results demonstrate that the proposed model, GA-MSVM, performs at a significantly higher level than all comparative models.

금융실명제 실시가 비기대이익의 분산과 이익반응계수에 미치는 영향에 관한 실증적 연구

  • Kim, Myeong-Gyun;Kim, Byeong-Ho;Choi, In
    • The Korean Journal of Financial Management
    • /
    • v.12 no.2
    • /
    • pp.163-184
    • /
    • 1995
  • 본 논문은 금융실명제가 기업에서 발표하는 회계학적 이익정보에 대한 주식가격의 변화에 미치는 영향을 분석하였다. 이는 금융실명제실시 이후에는 기업에서 창출해 내는 기업이익이 진정한 이익에 보다 더 접근을 할 것이라 예상과 채무분석가의 기업이익에 대한 예측치는 진정한 이익에 대한 예측치이므로 금융실명제 실시 이후에는 예측오차가 감소할 것이다는 일반적 예상을 검증하기 위한 것이다. 본 논문은 먼저 1992년과 1993년 12월 결산기업에 대하여 비기대이익을 계산하여 두 해에서의 차이를 분석하였고, 계산된 비기대이익과 기업이익 공시시점에서의 비정상수익율과의 관계를 회귀분석을 통하여 분석하였다. 채무분석가의 예측치로서 대우경제연구소에서 1992년과 1993년 12월에 각각 발표한 각 상장기업의 이익에 대한1992년 및 1993년의 예상치를 각각 년도의 예상기업 이익으로 사용하고 실제로 1993년과 1994년 초에 공시되는 기업이익과의 차이를 조사하였다. 비정상수익율의 계산은 시장위험조정모형과 시장조정모형을 사용하였고 일별수익율에 의하여 측정하였다. 사건 시점은 주주총회 일을 중심으로하여 여러 사건 기간을 택하여 분석을 하였다. 실증적 분석 결과를 보면, 전체표본을 대상으로한 재무분석가의 추정치에 의하여 계산된 비기대이익의 분산이 금융실명제 실시 이후가 실시 이전에 비하여 더 크게 나타났다. 이러한 결과는 금융실명제의 실시로 인하여 재무분석가의 예측이 오히려 더 부정확하게 나타난 것이라 할 수 있다. 이러한 결과는 실명제 실시에 따라서 기업이익예측에 대한 불확실성이 더 증가를 하여 기업이익 공시시점에서의 비기대이익의 측정에서의 오차가 오히려 증가하였다는 것을 알 수 있다. 그러나 전체표본을 소그룹으로 나누어서, 1부에 속한 기업들과 대형 주기업들을 대상으로한 분석에서는 이 두 소그룹에 속한 기업들이 각각 금융실명제실시 이후가 금융실명제 실시 이전보다 비기대이익의 분산이 작게 나타났다. 이러한 결과는 1부에 속한 기업들과 대형주기업들에서 는 금융실명제실시로 채무분석가들의 이익 예측치가 더 정확성을 지니게 된 것으로 해석된다. 이익반응계수의 추정에서 예상했던 바와는 반대로 금융실명제 실시 이후에 계수의 크기가 오히려 감소하였다. 소그룹으로 나누어서 분석한 결과도 마찬가지였다. 금융실명제 실시가 기업회계이익에 미친 영향은 비기대이익의 측정을 통하여 일부 가설과 일치하는 결과를 얻었고, 이익반응계수의 측정에서는 가설과 일치하는 결과를 얻지 못하였다.

  • PDF

A Study of Economic Indicator Prediction Model using Dimensions Decrease Techniques and HMM (차원감소기법과 은닉마아코프모델을 이용한 경기지표 예측 모델 연구)

  • Jeon, Jin-Ho;Kim, Min-Soo
    • Journal of Digital Convergence
    • /
    • v.11 no.10
    • /
    • pp.305-311
    • /
    • 2013
  • The size of the market as the economy continues to evolve, in order to make the right decisions to accurately predict the economic problems the market has emerged as an important issues. To express the modern economic system, the largest of the various economic indicators, pillars stock indicators analysis and decision-making with a proper understanding of the problem for the application of the model is suitable for time-series data concealment HMM. Based on this time series model and the calculation of the time and cost savings dimension decrease techniques for the estimation and prediction of the model was applied to the problem was to verify the validity. As a result, the model predictions in both the short term rather than long-term predictions of the model estimates the optimal predictive value similar pattern very similar to both the actual data and was able to confirm that.

The Study of the Financial Index Prediction Using the Equalized Multi-layer Arithmetic Neural Network (균등다층연산 신경망을 이용한 금융지표지수 예측에 관한 연구)

  • 김성곤;김환용
    • Journal of the Korea Society of Computer and Information
    • /
    • v.8 no.3
    • /
    • pp.113-123
    • /
    • 2003
  • Many researches on the application of neural networks for making financial index prediction have proven their advantages over statistical and other methods. In this paper, a neural network model is proposed for the Buying, Holding or Selling timing prediction in stocks by the price index of stocks by inputting the closing price and volume of dealing in stocks and the technical indexes(MACD, Psychological Line). This model has an equalized multi-layer arithmetic function as well as the time series prediction function of backpropagation neural network algorithm. In the case that the numbers of learning data are unbalanced among the three categories (Buying, Holding or Selling), the neural network with conventional method has the problem that it tries to improve only the prediction accuracy of the most dominant category. Therefore, this paper, after describing the structure, working and learning algorithm of the neural network, shows the equalized multi-layer arithmetic method controlling the numbers of learning data by using information about the importance of each category for improving prediction accuracy of other category. Experimental results show that the financial index prediction using the equalized multi-layer arithmetic neural network has much higher correctness rate than the other conventional models.

  • PDF

An Empirical Study on Prediction of the Art Price using Multivariate Long Short Term Memory Recurrent Neural Network Deep Learning Model (다변수 LSTM 순환신경망 딥러닝 모형을 이용한 미술품 가격 예측에 관한 실증연구)

  • Lee, Jiin;Song, Jeongseok
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.6
    • /
    • pp.552-560
    • /
    • 2021
  • With the recent development of the art distribution system, interest in art investment is increasing rather than seeing art as an object of aesthetic utility. Unlike stocks and bonds, the price of artworks has a heterogeneous characteristic that is determined by reflecting both objective and subjective factors, so the uncertainty in price prediction is high. In this study, we used LSTM Recurrent Neural Network deep learning model to predict the auction winning price by inputting the artist, physical and sales charateristics of the Korean artist. According to the result, the RMSE value, which explains the difference between the predicted and actual price by model, was 0.064. Painter Lee Dae Won had the highest predictive power, and Lee Joong Seop had the lowest. The results suggest the art market becomes more active as investment goods and demand for auction winning price increases.

Prediction of Stock Returns from News Article's Recommended Stocks Using XGBoost and LightGBM Models

  • Yoo-jin Hwang;Seung-yeon Son;Zoon-ky Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.2
    • /
    • pp.51-59
    • /
    • 2024
  • This study examines the relationship between the release of the news and the individual stock returns. Investors utilize a variety of information sources to maximize stock returns when establishing investment strategies. News companies publish their articles based on stock recommendation reports of analysts, enhancing the reliability of the information. Defining release of a stock-recommendation news article as an event, we examine its economic impacts and propose a binary classification model that predicts the stock return 10 days after the event. XGBoost and LightGBM models are applied for the study with accuracy of 75%, 71% respectively. In addition, after categorizing the recommended stocks based on the listed market(KOSPI/KOSDAQ) and market capitalization(Big/Small), this study verifies difference in the accuracy of models across four sub-datasets. Finally, by conducting SHAP(Shapley Additive exPlanations) analysis, we identify the key variables in each model, reinforcing the interpretability of models.

원유선물시장(原油先物市場)과 현물시장(現物市場)의 동태적통합(動態的統合) 및 효율성(效率性)

  • Park, Ju-Ho
    • Environmental and Resource Economics Review
    • /
    • v.6 no.2
    • /
    • pp.171-191
    • /
    • 1997
  • 83년 7월부터 NYMEX 선물시장에서 거래되기 시작한 원유선물은 90년대 들어 주식 채권 외환 등의 금융시장과 관련하여 크게 성장하고 있으며, 원유선물가격이 현물시장에서의 가격형성에 큰 영향을 미치고 있다. 따라서, 원유선물가격이 미래의 현물가격에 대한 최적의 예측치라고 하는 합리적기대모형(合理的期待模型)에 의거하여 원유선물 가격과 현물가격의 변화추이 및 그들 사이의 장(長) 단기(短期) 균형관계(均衡關係)(동태적통합(動態的統合))와 효율성(效率性)등을 일별(日別) NYMEX 선물유가(근월도래선물(近月到來先物)의 종가(終價))와 WTI 현물유가의 자료를 이용하여 계량분석하였다. 원유선물가격과 현물가격은 단위근(單位根)을 갖는 불안정(不安定)한 시계열이지만, 선물유가와 현물유가사이에는 공적분관계(共積分關係)(공통확률적추세(共通確率的趨勢))가 있어 장기적(長期的) 균형관계(均衡關係)가 존재하며, 또한 공시계열상관관계(共時系列相關關係)(공통안정적순환(共通安定的循環))가 있어 단기적(短期的) 균형관계(均衡關係)도 존재하는 것으로 보여진다. 그리고 선물유가는 미래의 현물유가에 대한 예측력이 있는 것으로 보여진다. 따라서, 원유선물가격이 미래의 현물가격에 대한 최적의 예측치라고 히는 합리적기대모형(合理的期待模型)과 일치하는 것으로 나타났다. 원유선물가격이 현물가격과 장(長) 단기적(短期的)으로 동태적(動態的)인 균형관계를 보이고 있으므로 정부의 합리적인 수입선다변화정책과 유가자유화에 따른 석유업계의 효율적인 운영방안의 하나로 원유선물시장의 활용이 더욱 더 필요할 것으로 생각된다.

  • PDF

Stochastic Shocks and Structural Breaks of Securities Markets (충격(衝擊)의 확률적 장기영향과 자본시장의 구조변화(構造變化))

  • Rhee, Il-King
    • The Korean Journal of Financial Management
    • /
    • v.17 no.1
    • /
    • pp.91-110
    • /
    • 2000
  • 충격이 경제에 가해질 때 이 충격이 경제 내에 일시적으로 존속하는 경우도 있고 이 충격이 영구히 존속하는 경우도 있다. 이 양극단 사이의 과정도 존재할 수 있다. 이것을 표상한 것이 stopbreak 과정이다. 충격의 효과가 영구적 효과와 일시적 효과 사이에서 파동하는 시계열을 모형화한 것이 이 과정인 것이다. 이 과정에서는 일정한 기간에는 영구적인 평균이동이 발생하여 구조변화가 발생한다. 다른 기간에 발생하는 충격은 그 효과가 급속히 소멸한다. 밀접한 관계를 맺고 있는 두 주가의 비율은 한 주가의 변동이 제시하는 것을 분석하고 이것을 이용하여 다른 주가를 예측할 수 있는 정보를 제공한다. 한 주가의 변동이 발생하면 이 두 주가의 비율은 변동한다. 그러나 한 주가의 변동의 정보성이 인정되어 이 정보가 다른 주가에 반영되어 조정되면 두 주가의 비율은 변동이전의 수준으로 회귀할 것이다. 변동이 영구적이면 두 주가비율은 동일한 수준을 유지할 것이다. 반면 다른 주가에 영향을 미치지 못하는 정보이면 두 주가의 비율은 변동된 상태에서 지속될 것이다. 일정기간은 영구적 구조변화가 발생하고 그 이외의 기간에는 구조 변화가 발생하지 않고 있는 것이다. 따라서 stopbreak 과정을 사용하여 정확한 예측을 수행할 수 있다. 주가지수들이 stopbreak 과정에 의하여 생성되고 있음이 발견되었다. 즉 주가지수들은 확률적 영구구조변화가 발생하고 있는 시계열들이다. 종합주가지수/제조업지수 역시 확률적 영구구조변화를 가지는 stopbreak 과정에 의하여 생성되고 있음이 밝혀졌다. 이 과정을 실제에 적용하여 주가의 움직임을 파악하면 예측이 가능하다. 특히 연관성이 깊은 두 주식의 주가비율을 사용할 때 효과적이라 할 수 있다.

  • PDF

A Study on the Predicted Model of the Relationship Between Financial Information and Market Beta (재무정보와 베타예측모델에 관한 연구)

  • 신창섭
    • The Journal of Information Technology
    • /
    • v.1 no.2
    • /
    • pp.25-37
    • /
    • 1998
  • The paper discusses several means for estimating appropriating discount rates to value non-traded assets. That Is, this study discusses the relationship between market equity beta and observable finance information. The relationship can in principle be used to determine betas for non-traded entity for which conventional market model or pure-play techniques are impractical. In addition, the paper shows on model researched by Patterson in 1993. Patterson's research investigates the cross-sectional relationship market beta and accounting beta in Canadian capital market.

  • PDF