Full Text

Turn on search term navigation

© 2025. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Background:Insufficient patient accrual is a major challenge in clinical trials and can result in underpowered studies, as well as exposing study participants to toxicity and additional costs, with limited scientific benefit. Real-world data can provide external controls, but insufficient accrual affects all arms of a study, not just controls. Studies that used generative models to simulate more patients were limited in the accrual scenarios considered, replicability criteria, number of generative models, and number of clinical trials evaluated.

Objective:This study aimed to perform a comprehensive evaluation on the extent generative models can be used to simulate additional patients to compensate for insufficient accrual in clinical trials.

Methods:We performed a retrospective analysis using 10 datasets from 9 fully accrued, completed, and published cancer trials. For each trial, we removed the latest recruited patients (from 10% to 50%), trained a generative model on the remaining patients, and simulated additional patients to replace the removed ones using the generative model to augment the available data. We then replicated the published analysis on this augmented dataset to determine if the findings remained the same. Four different generative models were evaluated: sequential synthesis with decision trees, Bayesian network, generative adversarial network, and a variational autoencoder. These generative models were compared to sampling with replacement (ie, bootstrap) as a simple alternative. Replication of the published analyses used 4 metrics: decision agreement, estimate agreement, standardized difference, and CI overlap.

Results:Sequential synthesis performed well on the 4 replication metrics for the removal of up to 40% of the last recruited patients (decision agreement: 88% to 100% across datasets, estimate agreement: 100%, cannot reject standardized difference null hypothesis: 100%, and CI overlap: 0.8-0.92). Sampling with replacement was the next most effective approach, with decision agreement varying from 78% to 89% across all datasets. There was no evidence of a monotonic relationship in the estimated effect size with recruitment order across these studies. This suggests that patients recruited earlier in a trial were not systematically different than those recruited later, at least partially explaining why generative models trained on early data can effectively simulate patients recruited later in a trial. The fidelity of the generated data relative to the training data on the Hellinger distance was high in all cases.

Conclusions:For an oncology study with insufficient accrual with as few as 60% of target recruitment, sequential synthesis can enable the simulation of the full dataset had the study continued accruing patients and can be an alternative to drawing conclusions from an underpowered study. These results provide evidence demonstrating the potential for generative models to rescue poorly accruing clinical trials, but additional studies are needed to confirm these findings and to generalize them for other diseases.

Details

Title
Augmenting Insufficiently Accruing Oncology Clinical Trials Using Generative Models: Validation Study
Author
Samer El Kababji  VIAFID ORCID Logo  ; Mitsakakis, Nicholas  VIAFID ORCID Logo  ; Jonker, Elizabeth  VIAFID ORCID Logo  ; Beltran-Bless, Ana-Alicia  VIAFID ORCID Logo  ; Pond, Gregory  VIAFID ORCID Logo  ; Vandermeer, Lisa  VIAFID ORCID Logo  ; Radhakrishnan, Dhenuka  VIAFID ORCID Logo  ; Mosquera, Lucy  VIAFID ORCID Logo  ; Paterson, Alexander  VIAFID ORCID Logo  ; Shepherd, Lois  VIAFID ORCID Logo  ; Chen, Bingshu  VIAFID ORCID Logo  ; Barlow, William  VIAFID ORCID Logo  ; Gralow, Julie  VIAFID ORCID Logo  ; Savard, Marie-France  VIAFID ORCID Logo  ; Fesl, Christian  VIAFID ORCID Logo  ; Hlauschek, Dominik  VIAFID ORCID Logo  ; Balic, Marija  VIAFID ORCID Logo  ; Rinnerthaler, Gabriel  VIAFID ORCID Logo  ; Greil, Richard  VIAFID ORCID Logo  ; Gnant, Michael  VIAFID ORCID Logo  ; Clemons, Mark  VIAFID ORCID Logo  ; Khaled El Emam  VIAFID ORCID Logo 
First page
e66821
Section
Artificial Intelligence
Publication year
2025
Publication date
2025
Publisher
Gunther Eysenbach MD MPH, Associate Professor
e-ISSN
1438-8871
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3222368032
Copyright
© 2025. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.