Content area

Abstract

We propose a two-stage statistical learning framework to investigate how financial news headlines posted over weekends affect stock returns. In the first stage, Natural Language Processing (NLP) techniques are used to extract sentiment features from news headlines, including FinBERT sentiment scores and Impact Probabilities derived from Logistic Regression models (Binomial, Multinomial, and Bayesian). These Impact Probabilities estimate the likelihood that a given headline influences the stock’s opening price on the following trading day. In the second stage, we predict over-weekend log returns using various sets of covariates: sentiment-based features, traditional financial indicators (e.g., trading volumes, past returns), and headline counts. We evaluate multiple statistical learning algorithms—including Linear Regression, Polynomial Regression, Random Forests, and Support Vector Machines—using cross-validation and two performance metrics. Our framework is demonstrated using financial news from MarketWatch and stock data for Apple Inc. (AAPL) from 2014 to 2023. The results show that incorporating sentiment features, particularly Impact Probabilities, improves predictive accuracy. This approach offers a robust way to quantify and model the influence of qualitative financial information on stock performance, especially in contexts where markets are closed but news continues to develop.

Details

1009240
Business indexing term
Company / organization
Title
Sentiment-Driven Statistical Modelling of Stock Returns over Weekends
Publication title
Volume
13
Issue
8
First page
201
Number of pages
37
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20793197
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-08-21
Milestone dates
2025-07-16 (Received); 2025-08-16 (Accepted)
Publication history
 
 
   First posting date
21 Aug 2025
ProQuest document ID
3244001119
Document URL
https://www.proquest.com/scholarly-journals/sentiment-driven-statistical-modelling-stock/docview/3244001119/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-12-02
Database
2 databases
  • Coronavirus Research Database
  • ProQuest One Academic