Beyond Predictions: A Study of Clinical Model Development, Missing Data Impact Analysis, and Healthcare Provider Perspectives

Abstract/Details

Beyond Predictions: A Study of Clinical Model Development, Missing Data Impact Analysis, and Healthcare Provider Perspectives

Edakalavan, Smitha. University of Pittsburgh ProQuest Dissertations & Theses, 2025. 32168147.

Abstract (summary)

Machine learning (ML) models are increasingly used in healthcare to support clinical decision making, yet their implementation in the real world remains limited by various challenges such as data quality issues, lack of interpretability, integration difficulties with existing clinical workflows, regulatory barriers and generalizability. This research aims to explore how ML models behave under conditions common in clinical practice – specifically missing data and the need for transparent, patient specific explanations – and how clinicians interpret and respond to these model outputs. To explore these issues, we applied our framework (sensitivity analysis framework, imputation and explainability framework) to the prediction of venous thromboembolism (VTE), a preventable but serious complication in surgical patients.

We developed ML models to predict VTE during post-surgical hospitalization and after discharge using structured electronic health records (EHR) data from diverse surgical populations. In addition to optimizing the predictive performance, we also investigate how the missing clinical features at the time of prediction can affect the model’s output. We simulate a range of plausible values for missing features and observe the variation in risk scores. This allows us to quantify how sensitive or uncertain a prediction is. By identifying features that drive this uncertainty, we offer a new layer of transparency into the model’s reliability at the patient level.

To understand how the clinicians interpret these model outputs, we conducted a study in which clinicians review patient cases with and without access to model explanations and uncertainty estimates. Using statistical assessments, we measure how their risk perception, confidence, and decision-making change in response to additional information.

This work tests the hypothesis that ML models that combine accurate predictions with tailored explanations and visibility into the effects of missing data can improve clinician understanding and change clinician behavior. While VTE serves as the motivating application, the broader contribution is in establishing a general framework for evaluating ML model behavior under conditions common in real-world healthcare and for studying how such models are perceived by clinical end-users.

Indexing (details)

Business indexing term

Subject:

Artificial intelligence

Subject

Artificial intelligence;
Bioinformatics;
Information science

Classification

0800: Artificial intelligence
0723: Information science
0715: Bioinformatics

Identifier / keyword

Machine learning models; Electronic health records; Venous thromboembolism; Risk perception

Title

Beyond Predictions: A Study of Clinical Model Development, Missing Data Impact Analysis, and Healthcare Provider Perspectives

Author

Edakalavan, Smitha

Number of pages

125

Publication year

2025

Degree date

2025

School code

0178

Source

DAI-A 87/9(E), Dissertation Abstracts International

ISBN

9798277405529

Advisor

Ceschin, Rafael

Committee member

Cooper, Gregory; Wang, Yanshan; Visweswaran, Shyam

University/institution

University of Pittsburgh

Department

Biomedical Informatics

University location

United States -- Pennsylvania

Degree

Ph.D.

Source type

Dissertation or Thesis

Language

English

Document type

Dissertation/Thesis

Dissertation/thesis number

32168147

ProQuest document ID

3308462882

Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.

Document URL

https://www.proquest.com/docview/3308462882/$N

Copyright information

View related documents