Abstract

Good data quality is vital for personalising plans in rehabilitation. Machine learning (ML) improves prognostics but integrating it with Multiple Imputation (MImp) for dealing missingness is an unexplored field. This work aims to provide post-stroke ambulation prognosis, integrating MImp with ML, and identify the prognostic influential factors. Stroke survivors in intensive rehabilitation were enrolled. Data on demographics, events, clinical, physiotherapy, and psycho-social assessment were collected. An independent ambulation at discharge, using the Functional Ambulation Category scale, was the outcome. After handling missingness using MImp, ML models were optimised, cross-validated, and tested. Interpretability techniques analysed predictor contributions. Pre-MImp, the dataset included 54.1% women, 79.2% ischaemic patients, median age 80.0 (interquartile range: 15.0). Post-MImp, 368 non-ambulatory patients on 10 imputed datasets were used for training, 80 for testing. The random forest (the validation best-performing algorithm) obtained 75.5% aggregated balanced accuracy on the test set. The main predictors included modified Barthel index, Fugl-Meyer assessment/motricity index, short physical performance battery, age, Charlson comorbidity index/cumulative illness rating scale, and trunk control test. This is among the first studies applying ML, together with MImp, to predict ambulation recovery in post-stroke rehabilitation. This pipeline reliably exploits the potential of incomplete datasets for healthcare prognosis, identifying relevant predictors.

Details

Title
Multiple imputation integrated to machine learning: predicting post-stroke recovery of ambulation after intensive inpatient rehabilitation
Author
Finocchi, Alice 1 ; Campagnini, Silvia 1 ; Mannini, Andrea 1 ; Doronzio, Stefano 2 ; Baccini, Marco 1 ; Hakiki, Bahia 2 ; Bardi, Donata 1 ; Grippo, Antonello 3 ; Macchi, Claudio 2 ; Navarro Solano, Jorge 4 ; Baccini, Michela 5 ; Cecchi, Francesca 2 

 IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy (GRID:grid.418563.d) (ISNI:0000 0001 1090 9021) 
 IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy (GRID:grid.418563.d) (ISNI:0000 0001 1090 9021); University of Florence, Department of Experimental and Clinical Medicine, Firenze, Italy (GRID:grid.8404.8) (ISNI:0000 0004 1757 2304) 
 IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy (GRID:grid.418563.d) (ISNI:0000 0001 1090 9021); Azienda Ospedaliera Universitaria Careggi (AOUC), Firenze, Italy (GRID:grid.24704.35) (ISNI:0000 0004 1759 9494) 
 IRCCS Fondazione Don Carlo Gnocchi onlus, Milano, Italy (GRID:grid.418563.d) (ISNI:0000 0001 1090 9021) 
 University of Florence, Department of Statistics, Computer Science, Applications, Firenze, Italy (GRID:grid.8404.8) (ISNI:0000 0004 1757 2304) 
Pages
25188
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3120217514
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.