Content area

Abstract

Background

Alterations in connected speech (CS), such as when describing a picture, have been identified in Alzheimer's disease (AD) and could act as markers of subjective (SCI) and mild cognitive impairment (MCI), and offer opportunities for therapeutic intervention. Machine learning has shown promise in classifying individuals along the AD spectrum using CS features. However, subtle sex differences in language may influence symptom presentation and classification accuracy. We investigated the impact of sex on CS and classification performance across the AD spectrum.

Methods

We analysed Cookie Theft scene descriptions from 751 participants in the CCNA COMPASS‐ND cohort. Forty lexical, semantic and syntactic CS features were extracted using a Python‐based pipeline, and used to train ten logistic regression models. Classification of AD, vascular‐AD (v‐AD), MCI, vascular‐MCI (v‐MCI), and SCI versus cognitively unimpaired (CU) participants was performed separately for men and women using 5‐fold cross‐validation. Age and education were regressed from features within each fold. Mean area under the curve (AUC) was calculated and sex differences in classification performance assessed using Bonferroni‐corrected t‐tests. Features were then ranked based on standardised model coefficients.

Results

Women were classified with higher AUC than men in SCI, MCI, and v‐AD, though only MCI remained significant after correction (p = 0.02). SCI and MCI classifications performed above chance for women, but below chance for men (Figure 1). We therefore focused on important features for v‐AD classifications, which performed above chance for both sexes, using feature rankings. Compared to CU men, men with v‐AD produced fluent speech that lacked detail, with more words indicating lexical access difficulties (e.g “remember”), yet syntactically complex speech (more subordinate phrases and left branching children), which may indicate compensation for lexical difficulties. Compared to CU women, women with v‐AD produced non‐fluent speech with more filled pauses (e.g. “um”), that was repetitive and relied on more common words and phrases, yet also syntactically complex (more subordinate phrases and coordinating conjunctions).

Conclusions

Sex‐stratified classification models revealed differences in performance, with implications for research and clinical applications. Linguistic markers may be more sensitive for women along the AD spectrum, highlighting the importance of sex‐stratified analyses.

Details

1009240
Business indexing term
Title
Investigating sex differences in connected speech across the Alzheimer's disease spectrum using machine learning
Author
Clarke, Natasha 1 ; Bedetti, Christophe 2 ; Metayer, Pierre‐Briac 2 ; Brambati, Simona Maria 1 

 Université de Montréal, Montréal, QC, Canada,, Centre de Recherche de l'Institut Universitaire de Gériatrie de Montréal, Montréal, QC, Canada, 
 Université de Montréal, Montréal, QC, Canada, 
Publication title
Volume
21
Supplement
S3
Number of pages
3
Publication year
2025
Publication date
Dec 1, 2025
Section
CLINICAL MANIFESTATIONS
Publisher
John Wiley & Sons, Inc.
Place of publication
Chicago
Country of publication
United States
ISSN
1552-5260
e-ISSN
1552-5279
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-12-24
Milestone dates
2025-12-24 (publishedOnlineFinalForm)
Publication history
 
 
   First posting date
24 Dec 2025
ProQuest document ID
3286455720
Document URL
https://www.proquest.com/scholarly-journals/investigating-sex-differences-connected-speech/docview/3286455720/se-2?accountid=208611
Copyright
© 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2026-01-06
Database
ProQuest One Academic