Full text

Turn on search term navigation

© Karlo Puh and Marina Bagić Babac. This work is published under http://creativecommons.org/licences/by/4.0/legalcode (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Purpose

Predicting the stock market's prices has always been an interesting topic since its closely related to making money. Recently, the advances in natural language processing (NLP) have opened new perspectives for solving this task. The purpose of this paper is to show a state-of-the-art natural language approach to using language in predicting the stock market.

Design/methodology/approach

In this paper, the conventional statistical models for time-series prediction are implemented as a benchmark. Then, for methodological comparison, various state-of-the-art natural language models ranging from the baseline convolutional and recurrent neural network models to the most advanced transformer-based models are developed, implemented and tested.

Findings

Experimental results show that there is a correlation between the textual information in the news headlines and stock price prediction. The model based on the GRU (gated recurrent unit) cell with one linear layer, which takes pairs of the historical prices and the sentiment score calculated using transformer-based models, achieved the best result.

Originality/value

This study provides an insight into how to use NLP to improve stock price prediction and shows that there is a correlation between news headlines and stock price prediction.

Details

Title
Predicting stock market using natural language processing
Author
Puh, Karlo 1 ; Babac, Marina Bagić 1   VIAFID ORCID Logo 

 Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia 
Pages
41-61
Publication year
2023
Publication date
2023
Publisher
Emerald Group Publishing Limited
ISSN
1935519X
e-ISSN
19355181
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2811439750
Copyright
© Karlo Puh and Marina Bagić Babac. This work is published under http://creativecommons.org/licences/by/4.0/legalcode (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.