Abstract/Details

Beyond Predictions: A Study of Clinical Model Development, Missing Data Impact Analysis, and Healthcare Provider Perspectives

Edakalavan, Smitha.   University of Pittsburgh ProQuest Dissertations & Theses,  2025. 32168147.

Abstract (summary)

Machine learning (ML) models are increasingly used in healthcare to support clinical decision making, yet their implementation in the real world remains limited by various challenges such as data quality issues, lack of interpretability, integration difficulties with existing clinical workflows, regulatory barriers and generalizability. This research aims to explore how ML models behave under conditions common in clinical practice – specifically missing data and the need for transparent, patient specific explanations – and how clinicians interpret and respond to these model outputs. To explore these issues, we applied our framework (sensitivity analysis framework, imputation and explainability framework) to the prediction of venous thromboembolism (VTE), a preventable but serious complication in surgical patients.

We developed ML models to predict VTE during post-surgical hospitalization and after discharge using structured electronic health records (EHR) data from diverse surgical populations. In addition to optimizing the predictive performance, we also investigate how the missing clinical features at the time of prediction can affect the model’s output. We simulate a range of plausible values for missing features and observe the variation in risk scores. This allows us to quantify how sensitive or uncertain a prediction is. By identifying features that drive this uncertainty, we offer a new layer of transparency into the model’s reliability at the patient level.

To understand how the clinicians interpret these model outputs, we conducted a study in which clinicians review patient cases with and without access to model explanations and uncertainty estimates. Using statistical assessments, we measure how their risk perception, confidence, and decision-making change in response to additional information.

This work tests the hypothesis that ML models that combine accurate predictions with tailored explanations and visibility into the effects of missing data can improve clinician understanding and change clinician behavior. While VTE serves as the motivating application, the broader contribution is in establishing a general framework for evaluating ML model behavior under conditions common in real-world healthcare and for studying how such models are perceived by clinical end-users.

Indexing (details)


Business indexing term
Subject
Artificial intelligence;
Bioinformatics;
Information science
Classification
0800: Artificial intelligence
0723: Information science
0715: Bioinformatics
Identifier / keyword
Machine learning models; Electronic health records; Venous thromboembolism; Risk perception
Title
Beyond Predictions: A Study of Clinical Model Development, Missing Data Impact Analysis, and Healthcare Provider Perspectives
Author
Edakalavan, Smitha
Number of pages
125
Publication year
2025
Degree date
2025
School code
0178
Source
DAI-A 87/9(E), Dissertation Abstracts International
ISBN
9798277405529
Advisor
Ceschin, Rafael
Committee member
Cooper, Gregory; Wang, Yanshan; Visweswaran, Shyam
University/institution
University of Pittsburgh
Department
Biomedical Informatics
University location
United States -- Pennsylvania
Degree
Ph.D.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
32168147
ProQuest document ID
3308462882
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Document URL
https://www.proquest.com/docview/3308462882/$N