This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
The change trend of the stock price has always been identified as a very important problem in the economic field [1]. Stock prices are affected by various internal and external factors, such as domestic and foreign economic environment, international situation, industry prospect, financial data of listed companies, and stock market operation. Thus, the forecasting method also has different emphasis [2, 3].
The traditional analysis method is based on economics and finance, which mainly uses the fundamental analysis method and the technical analysis method. On the one hand, the fundamental analysis method pays more attention to the intrinsic value of stocks and qualitatively analyzes the external factors that affect the stock, such as interest rate, exchange rate, inflation, industrial policy, finance of listed companies, international relations, and other economic and political factors. On the other hand, the technical analysis method mainly focuses on the direction of stock price, trading volume, and investors’ psychological expectation, which primarily focuses on analyzing the stock index trajectory of individual stocks or the whole market by using K-line chart and other tools. At present, traditional fundamental analysis and technical analysis are still the most commonly employed methods for many organizations and individual investors [4, 5].
The accuracy of the traditional fundamental analysis method is difficult to be convincing. The reason is not only that the influencing factors are in a long-term cycle, but also the forecasting results are more dependent on the professional quality of analysts. As a financial time series, stock data have the characteristics of random walk [6]. Based on statistics and probability theory, some scholars use time series linear forecasting model to predict the short-term stock price with a large number of long-term data, such as vector autoregression (VAR) [7], Bayesian vector autoregression (BVAR) model [8], autoregressive integrated moving average mode (ARIMA) [9], and generalized autoregressive conditional heteroskedasticity model (GARCH) [10]. However, the accuracy of using time series model alone is questioned due to the uncertainty and high noise characteristics of financial time series and the relationship between independent variables and dependent variables is prone to dynamic changes over time, which limits its further application and expansion [11].
It has certain limitations to predict stock price trend with single simply using the linear time series forecasting model or neural network model. At present, combining the advantages of various methods and using various best algorithms to improve the hybrid method is the development trend of financial time series deep learning [12]. Therefore, in order to make the best of the time series characteristics of data series, deeply mine the data features, and improve the accuracy of stock price forecasting, this paper proposes a stock price forecasting method based on CNN-LSTM for the stock closing price of the next day forecasting. Combining the advantages of convolutional neural networks (CNN) that can extract effective features from the data, and long short-term memory (LSTM) which can not only find the interdependence of data in time series data, but also automatically detect the best mode suitable for relevant data, this method can effectively improve the accuracy of stock price forecasting. The CNN-LSTM model uses CNN to extract the features of the input time data and uses LSTM to predict the stock closing price on the next day. In order to verify the effectiveness of the model, this paper uses the daily transaction data of 7127 trading days from July 1, 1991, to August 31, 2020, in which the first 6627 trading days data are the training set and the last 500 trading days data are the test set.
2. Related Work
At present, the financial market is a noisy, nonparametric dynamic system, and there are two main kinds of forecasting methods for stock price: traditional analysis method and machine learning method [13]. The traditional econometric methods or equations with parameters are not suitable for analyzing complex, high-dimensional, and noisy financial series data. In recent years, neural network has become a hot research direction in the field of stock forecasting because it can extract data features from a large number of high-frequency raw data without relying on prior knowledge. In 1988, White used neural network to predict IBM stock, but the experimental results were not good [14]. In 2003, Zhang used neural network and autoregressive integrated moving average model (ARIMA) to forecast stocks, respectively. The experimental results show that neural network has obvious advantages in nonlinear data forecasting, but the accuracy still needs to be improved [15]. In 2005, Sun et al. proposed a time series forecasting method based on neural network. This method combines the optimal partition algorithm (OPA) and radial basis function (RBF) neural network [16]. In 2014, Adhikari et al. proposed a method combining random walk (RW) and artificial neural network (ANN) to predict four financial time series data, and the results showed that the forecasting accuracy had a certain improvement [17]. In 2018, Zhang et al. proposed the network structure of stock price forecasting based on LM-BP neural network, which improved the traditional BP neural network training algorithm’s shortcomings of slow training speed and low precision [18]. In 2018, the experimental results of Hu et al. show that convolutional neural network can predict time series, and deep learning is more suitable for solving the problem of time series. However, because CNN is more commonly used to solve image recognition and feature extraction, the forecasting accuracy of CNN alone is relatively low [19]. In 2020, Kamalov used MLP, CNN, and LSTM to forecast the stock price of four major US public companies. Experimental results showed that these three methods showed better results compared to similar studies that forecast the direction of price change [20]. In 2020, Xue et al. established a high-precision short-term forecasting model of financial market time series based on LSTM deep neural network and compared with the BP neural network, the traditional RNN, and the improved LSTM deep neural network. The results showed that the LSTM deep neural network has high forecasting accuracy and can effectively predict the time series of the stock market [21].
The main contributions of this paper are as follows:
(1) By analyzing the correlation and time series of stock price data, a new deep learning method (CNN-LSTM) is proposed to predict the stock price. In this method, CNN is used to extract the time feature of data, and LSTM is used for data forecasting. It can make full use of the time sequence of stock price data to obtain more reliable forecasting.
(2) By comparing the evaluation indexes of CNN-LSTM with multilayer perceptron (MLP), CNN, RNN, LSTM, and CNN-RNN, it is proved that CNN-LSTM has high forecasting accuracy and is more suitable for stock price forecasting.
3. CNN-LSTM
3.1. CNN-LSTM Model
CNN has the characteristic of paying attention to the most obvious features in the line of sight, so it is widely used in feature engineering. LSTM has the characteristic of expanding according to the sequence of time, and it is widely used in time series. According to the characteristics of CNN and LSTM, a stock forecasting model based on CNN-LSTM is established. The model structure diagram is shown in Figure 1, and the main structure is CNN and LSTM, including input layer, one-dimensional convolution layer, pooling layer, LSTM hidden layer, and full connection layer.
[figure omitted; refer to PDF]
The LSTM calculation process is as follows:
(1) The output value of the last moment and the input value of the current time are input into the forget gate, and the output value of the forget gate is obtained after calculation, as shown in the following formula:
where the value range of
(2) The output value of the last time and the input value of the current time are inputted into the input gate, and the output value and candidate cell state of the input gate are obtained after calculation, as shown in the following formulas:
where the value range of
(3) Update the current cell state as follows:
where the value range of
(4) The output
where the value range of
(5) The output value of LSTM is obtained by calculating the output of the output gate and the state of the cell, as shown in the following formula
3.4. CNN-LSTM Training and Prediction Process
The CNN-LSTM process of training and prediction is shown in Figure 3.
[figure omitted; refer to PDF]
The main steps are as follows:
(1) Input data: input the data required for CNN-LSTM training.
(2) Data standardization: as there is a large gap in the input data, in order to train the model better, the z-score standardization method is adopted to standardize the input data, as shown in the following formula:
where
(3) Initialize network: initialize the weights and biases of each layer of the CNN-LSTM.
(4) CNN layer calculation: the input data are successively passed through the convolution layer and pooling layer in the CNN layer, the feature extraction of the input data is carried out, and the output value is obtained.
(5) LSTM layer calculation: the output data of the CNN layer are calculated through the LSTM layer, and the output value is obtained.
(6) Output layer calculation: the output value of the LSTM layer is input into the full connection layer to get the output value.
(7) Calculation error: the output value calculated by the output layer is compared with the real value of this group of data, and the corresponding error is obtained.
(8) To judge whether the end condition is satisfied: the conditions for the end are to complete a predetermined number of cycles, the weight is lower than a certain threshold, and the error rate of the forecasting is lower than a certain threshold. If one of the conditions for the end is met, the training will be completed, update the entire CNN-LSTM network, and go to step 10; otherwise, go to step 9.
(9) Error backpropagation: propagate the calculated error in the opposite direction, update the weight and bias of each layer, and go to step 4 to continue to train the network.
(10) Save the model: save the trained model for forecasting.
(11) Input data: input the input data required for the forecasting.
(12) Data standardization: the input data are standardized according to formula (8).
(13) Forecasting: input the standardized data into the trained model of CNN-LSTM, and then get the corresponding output value.
(14) Data standardized restore: the output value obtained through the model of CNN-LSTM is the standardized value, and the standardized value is restored to the original value. As shown in the following formula (9).
where
(15) Output result: output the restored results to complete the forecasting process.
4. Experiments
In order to prove the effectiveness of CNN-LSTM, we compared this method with MLP, CNN, RNN, LSTM, and CNN-RNN using the same training set and test set data under the same operating environment. All the experiments are carried out under the running environment of Intel i7-4700H 2.6 GHz, 12 GBs of RAM, 500 GBs of hard disk and Windows 10. According to the influence factors, including the opening price, highest price, lowest price, closing price, volume, turnover, ups and downs, and change, the next day’s closing price is predicted.
4.1. Data
In this experiment, the Shanghai Composite Index (000001) is selected as the experimental data. The daily trading data of 7127 trading days from July 1, 1991, to August 31, 2020, are obtained from the wind database. Each piece of data contains eight items, namely, opening price, highest price, lowest price, closing price, volume, turnover, ups and downs, and change. Some of the data are shown in Table 1. Take the data of the first 6627 trading days as training set and the data of the last 500 trading days as test set.
Table 1
Partial sample data.
Date | Opening price | Highest price | Lowest price | Closing price | Volume (share) | Turnover (RMB) | Ups and downs | Change (%) |
1991/7/1 | 136.64 | 138.62 | 136.56 | 136.85 | 2294000 | 12469884 | −0.71 | −0.5161 |
1991/7/2 | 135.91 | 135.96 | 135.69 | 135.96 | 283800 | 3794100 | −0.89 | −0.6503 |
1991/7/3 | 135.28 | 135.96 | 134.98 | 135.27 | 271500 | 1818504 | −0.69 | −0.5075 |
1991/7/4 | 136.63 | 136.63 | 134.19 | 136.63 | 1339400 | 8095138 | 1.36 | 1.0054 |
1991/7/5 | 136.01 | 137.68 | 135.9 | 135.96 | 1454000 | 9394861 | −0.67 | −0.4904 |
4.2. Model Implementation
In order to evaluate the forecasting effect of CNN-LSTM, the mean absolute error (MAE), root mean square error (RMSE), and R-square (R2) are used as the evaluation criteria of the methods.
The MAE calculation formula is as follows:
The RMSE calculation formula is as follows:
The R2 calculation formula is as follows:
The closer the value of MAE and RMSE to 0, the smaller the error between the predicted value and the real value, the higher the forecasting accuracy. The closer R2 is to 1, the better the fitting degree of the model is.
4.3. Implementation of CNN-LSTM
The parameter setting of the CNN-LSTM for this experiment is shown in Table 2.
Table 2
Parameter setting of CNN-LSTM.
Parameters | Value |
Convolution layer filters | 32 |
Convolution layer kernel_size | 1 |
Convolution layer activation function | tanh |
Convolution layer padding | Same |
Pooling layer pool_size | 1 |
Pooling layer padding | Same |
Pooling layer activation function | Relu |
Number of hidden units in LSTM layer | 64 |
LSTM layer activation function | tanh |
Time_step | 10 |
Batch_size | 64 |
Learning rate | 0.001 |
Optimizer | Adam |
Loss function | mean_absolute_error |
Epochs | 100 |
According to the parameter setting of CNN-LSTM network, we can know that the specific model is constructed as follows: the input training set data is a three-dimensional data vector (None, 10, 8), in which 10 is the size of the time_step and 8 is the 8 features of the input dimension. First, the data enter the one-dimensional convolution layer to further extract features and obtain a three-dimensional output vector (None, 10, 32), in which 32 is the size of the convolution layer filters. Next, the vector enters the pooling layer, and a three-dimensional output vector (None, 10, 32) is also obtained. And then, the output vector enters the LSTM layer for training, and the output data (None, 64) after training enter another layer of full connection layer to get the output value; 64 is the number of hidden units in the LSTM layer. The specific CNN-LSTM model structure is shown in Figure 4.
[figure omitted; refer to PDF]
In Figures 5–10, among the six forecasting methods, the broken line fitting degree of real value and predicted value is CNN-LSTM, CNN-RNN, LSTM, CNN, RNN, and MLP. CNN-LSTM has the highest degree of broken line fitting which almost coincides with each other, and MLP has the lowest degree of broken line fitting.
According to the predicted value and real value of each method, the evaluation index of each method can be calculated, and the comparison results of the six methods are shown in Table 3 and Figures 11–13.
Table 3
Comparison of nine methods evaluation indexes.
Method | MAE | RMSE | R2 |
MLP | 37.584 | 49.799 | 0.9442 |
CNN | 30.138 | 42.967 | 0.9585 |
RNN | 29.916 | 42.957 | 0.9593 |
LSTM | 28.712 | 41.003 | 0.9622 |
CNN-RNN | 28.285 | 40.538 | 0.9630 |
CNN-LSTM | 27.564 | 39.688 | 0.9646 |
From Table 3 and Figures 10–12, the MAE and RMSE of MLP are the largest and R2 is the smallest, while the MAE and RMSE of CNN-LSTM are the smallest, R2 is the largest, and the closest is 1.
By comparing LSTM with RNN, the MAE and RMSE of LSTM decrease, R2 increases by 0.3%, MAE decreases from 29.916 to 28.712 by 4.0%, and RMSE decreases from 42.957 to 41.003 by 4.5%, so LSTM was better than RNN. However, the error measurement indexes MAE and RMSE of CNN-LSTM are the smallest, and the maximum R2 is close to 1. Compared with LSTM, after CNN layer, MAE and RMSE of CNN-LSTM proposed in this paper are lower than those without CNN layer; R2 has a certain improvement; MAE decreases by 4.0%, from 28.712 to 27.564; RMSE decreases by 3.2%, from 41.003 to 39.688; and R2 increases by 0.2%. It shows that the forecasting performance of LSTM can be effectively improved by extracting data features through CNN.
The results show that the performance of CNN-LSTM is the best among the six methods. In terms of forecasting accuracy, MAE is 27.564 and RMSE is 39.688, which is the smallest among the six forecasting models and has high forecasting accuracy, in terms of forecasting performance, and the R2 of CNN-LSTM is 0.9646, which is improved by 2.2%, 0.6%, 0.5%, and 0.2%, respectively, compared with the other four methods. Therefore, the CNN-LSTM proposed in this paper is superior to the other four comparative models in terms of fitting degree and error value. It can well predict the closing price of the next day and provide a reference for investors’ investment.
6. Conclusions
According to the chronological characteristics of stock price data, this paper proposes a CNN-LSTM to predict the stock closing price of the next day. The method uses opening price, highest price, lowest price, closing price, volume, turnover, ups and downs, and change of the stock data as the input, making full use of the time sequence characteristics of the stock data. CNN is used to extract the features of the input data. LSTM is used to learn the extracted feature data and predict the closing price of the stock the next day. This paper takes the relevant data of the Shanghai Composite Index as an example to verify the experimental results. The experimental results show that the CNN-LSTM has the highest forecasting accuracy and the best performance compared with the MLP, CNN, RNN, LSTM, and CNN-RNN. MAE and RMSE are the smallest of all methods, and R2 is close to 1. CNN-LSTM is suitable for the forecasting of stock prices and can provide a relevant reference for investors to maximize investment returns. CNN-LSTM also provides the proposal of practical experience for people’s research on financial time series data. However, the model still has some shortcomings. For example, it only considers the impact of stock price data on closing prices and fails to integrate emotional factors such as news and national policy into the forecast. Our future research work is mainly to increase the sentiment analysis of stock-related news and national policies, so as to ensure the accuracy of stock forecast.
Acknowledgments
This work was funded by the Soft Science Project of Hebei Province, Grant 205576142D, and Humanities and Social Science Research Project of Hebei Education Department, Grant SD201010.
[1] R. Vanaga, B. Sloka, "Financial and capital market commission financing: aspects and challenges," Journal of Logistics, Informatics and Service Science, vol. 7 no. 1, pp. 17-30, 2020.
[2] L. Zhang, H. Kim, "The influence of financial service characteristics on use intention through customer satisfaction with mobile fintech," Journal of System and Management Sciences, vol. 10 no. 2, pp. 82-94, 2020.
[3] L. Badea, V. Ionescu, A.-A. Guzun, "What is the causal relationship between stoxx europe 600 sectors? But between large firms and small firms?," Economic Computation And Economic Cybernetics Studies And Research, vol. 53 no. 3,DOI: 10.24818/18423264/53.3.19.01, 2019.
[4] J. Sousa, J. Montevechi, R. Miranda, "Economic lot-size using machine learning, parallelism, metaheuristic and simulation," Journal of Logistics, Informatics and Service Science, vol. 18 no. 2, pp. 205-216, 2019.
[5] A. Coser, M. M. Maer-Matei, C. Albu, "Predictive models for loan default risk assessment," Economic Computation And Economic Cybernetics Studies And Research, vol. 53 no. 2, pp. 149-165, DOI: 10.24818/18423264/53.2.19.09, 2019.
[6] R. Qiao, "Stock prediction model based on neural network," Operations Research and Management Science, vol. 28 no. 10, pp. 132-140, 2019.
[7] C. Jung, R. Boyd, "Forecasting UK stock prices," Applied Financial Economics, vol. 6 no. 3, pp. 279-286, DOI: 10.1080/096031096334303, 1996.
[8] W. Bleesser, P. Liicoff, "Predicting stock returns with bayesian vector autoregressive," Data Analysis, Machine Learning and Applications, vol. 1, pp. 499-506, DOI: 10.1007/978-3-540-78246-9_59, 2005.
[9] A. Adebiyi, A. Adewumi, C. Ayo, "Stock price prediction using the ARIMA model," .
[10] C. Zhang, X. Cheng, M. Wang, "An empirical research in the stock market of Shanghai by GARCH model," Operations Research and Management Science, vol. 4, pp. 144-146, 2005.
[11] Q. Yang, C. Wang, "A study on forecast of global stock indices based on deep LSTM neural network," Statistical Research, vol. 36 no. 6, pp. 65-77, 2019.
[12] K.-S. Moon, H. Kim, "Performance of deep learning in prediction of stock market volatility," Economic Computation And Economic Cybernetics Studies And Research, vol. 53 no. 2, pp. 77-92, DOI: 10.24818/18423264/53.2.19.05, 2019.
[13] J. Li, S. Pan, L. Huang, X. Zhu, "A machine learning based method for customer behavior prediction," Tehnicki Vjesnik-Technical Gazette, vol. 26 no. 6, pp. 1670-1676, 2019.
[14] H. White, "Economic prediction using neural networks: the case of IBM daily stock returns," Earth Surface Processes & Landforms, vol. 8 no. 5, pp. 409-422, 1988.
[15] G. P. Zhang, "Time series forecasting using a hybrid ARIMA and neural network model," Neurocomputing, vol. 50 no. 1, pp. 159-175, DOI: 10.1016/s0925-2312(01)00702-0, 2003.
[16] Y. Sun, Y. Liang, W. Zhang, "Optimal partition algorithm of the RBF neural network and its application to financial time series forecasting," Neural Computing and Applications, vol. 14, pp. 1441-1449, DOI: 10.1007/s00521-004-0439-7, 2005.
[17] R. Adhikari, R. Agrawal, "A combination of artificial neural network and random walk models for financial time series forecasting," Neural Computing and Applications, vol. 24, pp. 305-315, DOI: 10.1007/s00521-013-1386-y, 2014.
[18] L. Zhang, F. Wang, B. Xu, W. Chi, Q. Wang, T. Sun, "Prediction of stock prices based on LM-BP neural network and the estimation of overfitting point by RDCI," Neural Computing and Applications, vol. 30 no. 5, pp. 1425-1444, DOI: 10.1007/s00521-017-3296-x, 2018.
[19] Y. Hu, "Stock market timing model based on convolutional neural network – a case study of Shanghai composite index," Finance & Economy, vol. 4, pp. 71-74, 2018.
[20] E. Alibasic, B. Fazo, I. Petrovic, "A new approach to calculating electrical energy losses on power lines with a new improved three-mode method," Tehnicki Vjesnik-Technical Gazette, vol. 26 no. 2, pp. 405-411, DOI: 10.17559/TV-20170206144804, 2019.
[21] Y. Xue, C. Wang, C. Miao, "Research on financial assets transaction prediction model based on LSTM neural network," Neural Computing and Applications, vol. 1,DOI: 10.1007/s00521-020-04992-7, 2020.
[22] Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86 no. 11, pp. 2278-2324, DOI: 10.1109/5.726791, 1998.
[23] B. S. Kim, T. G. Kim, "Cooperation of simulation and data model for performance analysis of complex systems," International Journal of Simulation Modelling, vol. 18 no. 4, pp. 608-619, DOI: 10.2507/ijsimm18(4)491, 2019.
[24] L. Qin, N. Yu, D. Zhao, "Applying the convolutional neural network deep learning technology to behavioural recognition in intelligent video," Tehnicki Vjesnik-Technical Gazette, vol. 25 no. 2, pp. 528-535, 2018.
[25] S. Hochreiter, J. Schmidhuber, "Long short-term memory," MIT Press, vol. 9 no. 8, pp. 1735-1780, 1997.
[26] V.-D. Ta, C.-M. Liu, D. A. Tadesse, "Portfolio optimization-based stock prediction using long-short term memory network in quantitative trading," Applied Sciences, vol. 10 no. 2, pp. 437-457, DOI: 10.3390/app10020437, 2020.
[27] O. Zarrad, M. A. Hajjaji, M. N. Mansouri, "Hardware implementation of hybrid wind-solar energy system for pumping water based on artificial neural network controller," Studies in Informatics and Control, vol. 28 no. 1, pp. 35-44, DOI: 10.24846/v28i1y201904, 2019.
[28] T. Saric, G. Simunovic, D. Vukelic, K. Simunovic, R. Lujic, "Estimation of CNC grinding process parameters using different neural networks," Tehnicki Vjesnik-Technical Gazette, vol. 25 no. 6, pp. 1770-1775, DOI: 10.17559/TV-20180419095119, 2018.
[29] N. Gupta, A. Jalal, "Integration of textual cues for fine-grained image captioning using deep CNN and LSTM," Neural Computing and Applications, vol. 12,DOI: 10.1007/s00521-019-04515-z, 2019.
[30] A. Yadav, C. K. Jha, A. Sharan, "Optimizing LSTM for time series prediction in Indian stock market," Procedia Computer Science, vol. 167, pp. 2091-2100, DOI: 10.1016/j.procs.2020.03.257, 2020.
[31] H. Y. Kim, C. H. Won, "Forecasting the volatility of stock price index: a hybrid model integrating LSTM with multiple GARCH-type models," Expert Systems with Applications, vol. 103, pp. 25-37, DOI: 10.1016/j.eswa.2018.03.002, 2018.
[32] N. C. Petersen, R. Christoffer, F. Rodrigues, F. C. Pereira, "Multi-output bus travel time prediction with convolutional LSTM neural network," Expert Systems with Applications, vol. 120, pp. 426-435, DOI: 10.1016/j.eswa.2018.11.028, 2019.
[33] Z. Jin, Y. Yang, Y. Liu, "Stock closing price prediction based on sentiment analysis and LSTM," Neural Computing and Applications, vol. 32 no. 13, pp. 9713-9729, DOI: 10.1007/s00521-019-04504-2, 2020.
[34] B. Svetlana, T. Ioannis, "An ensemble of LSTM neural networks for high-frequency stock market classification," Journal of Forecasting, vol. 38 no. 6, pp. 600-619, 2019.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2020 Wenjie Lu et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
Stock price data have the characteristics of time series. At the same time, based on machine learning long short-term memory (LSTM) which has the advantages of analyzing relationships among time series data through its memory function, we propose a forecasting method of stock price based on CNN-LSTM. In the meanwhile, we use MLP, CNN, RNN, LSTM, CNN-RNN, and other forecasting models to predict the stock price one by one. Moreover, the forecasting results of these models are analyzed and compared. The data utilized in this research concern the daily stock prices from July 1, 1991, to August 31, 2020, including 7127 trading days. In terms of historical data, we choose eight features, including opening price, highest price, lowest price, closing price, volume, turnover, ups and downs, and change. Firstly, we adopt CNN to efficiently extract features from the data, which are the items of the previous 10 days. And then, we adopt LSTM to predict the stock price with the extracted feature data. According to the experimental results, the CNN-LSTM can provide a reliable stock price forecasting with the highest prediction accuracy. This forecasting method not only provides a new research idea for stock price forecasting but also provides practical experience for scholars to study financial time series data.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Business School, Jiangsu Second Normal University, Nanjing 210000, China; School of Economics and Management, Hebei University of Science and Technology, Shijiazhuang 050018, China
2 School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang 050018, China
3 Business School, Jiangsu Second Normal University, Nanjing 210000, China