Stock market prediction using machine learning classifiers and social media, news
Article
Khan, W., Ghazanfar, M., Azam, M. A., Karami, A., Alyoubi, K. H. and Alfakeeh, A. S. 2020. Stock market prediction using machine learning classifiers and social media, news. Journal of Ambient Intelligence and Humanized Computing. 13, pp. 3433-3456. https://doi.org/10.1007/s12652-020-01839-w
Authors | Khan, W., Ghazanfar, M., Azam, M. A., Karami, A., Alyoubi, K. H. and Alfakeeh, A. S. |
---|---|
Abstract | Accurate stock market prediction is of great interest to investors; however, stock markets are driven by volatile factors such as microblogs and news that make it hard to predict stock market index based on merely the historical data. The enormous stock market volatility emphasizes the need to effectively assess the role of external factors in stock prediction. Stock markets can be predicted using machine learning algorithms on information contained in social media and financial news, as this data can change investors’ behavior. In this paper, we use algorithms on social media and financial news data to discover the impact of this data on stock market prediction accuracy for ten subsequent days. For improving performance and quality of predictions, feature selection and spam tweets reduction are performed on the data sets. Moreover, we perform experiments to find such stock markets that are difficult to predict and those that are more influenced by social media and financial news. We compare results of different algorithms to find a consistent classifier. Finally, for achieving maximum prediction accuracy, deep learning is used and some classifiers are ensembled. Our experimental results show that highest prediction accuracies of 80.53% and 75.16% are achieved using social media and financial news, respectively. We also show that New York and Red Hat stock markets are hard to predict, New York and IBM stocks are more influenced by social media, while London and Microsoft stocks by financial news. Random forest classifier is found to be consistent and highest accuracy of 83.22% is achieved by its ensemble. |
Keywords | Deep learning; Feature selection; Hybrid algorithm; Natural language processing; Predictive modeling; Sentiment analysis; Stock market prediction |
Journal | Journal of Ambient Intelligence and Humanized Computing |
Journal citation | 13, pp. 3433-3456 |
ISSN | 1868-5145 |
Year | 2020 |
Publisher | Springer |
Accepted author manuscript | License File Access Level Anyone |
Digital Object Identifier (DOI) | https://doi.org/10.1007/s12652-020-01839-w |
Publication dates | |
Online | 14 Mar 2020 |
Jul 2022 | |
Publication process dates | |
Accepted | 25 Feb 2020 |
Deposited | 09 Oct 2023 |
Copyright holder | © 2020, The Authors |
Additional information | This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s12652-020-01839-w |
https://repository.uel.ac.uk/item/8w3qx
Download files
Accepted author manuscript
Paper-AIHC-Stock Prediction using Social Media, News-Revised.pdf | ||
License: Springer Nature Terms of Use for accepted manuscripts of subscription articles, books and chapters | ||
File access level: Anyone |
490
total views2713
total downloads44
views this month148
downloads this month