Abstract

Radiomic features have a wide range of clinical applications, but variability due to image acquisition factors can affect their performance. The harmonization tool ComBat is a promising solution but is limited by inability to harmonize multimodal distributions, unknown imaging parameters, and multiple imaging parameters. In this study, we propose two methods for addressing these limitations. We propose a sequential method that allows for harmonization of radiomic features by multiple imaging parameters (Nested ComBat). We also employ a Gaussian Mixture Model (GMM)-based method (GMM ComBat) where scans are split into groupings based on the shape of the distribution used for harmonization as a batch effect and subsequent harmonization by a known imaging parameter. These two methods were evaluated on features extracted with CapTK and PyRadiomics from two public lung computed tomography datasets. We found that Nested ComBat exhibited similar performance to standard ComBat in reducing the percentage of features with statistically significant differences in distribution attributable to imaging parameters. GMM ComBat improved harmonization performance over standard ComBat (− 11%, − 10% for Lung3/CAPTK, Lung3/PyRadiomics harmonizing by kernel resolution). Features harmonized with a variant of the Nested method and the GMM split method demonstrated similar c-statistics and Kaplan–Meier curves when used in survival analyses.

Details

Title
Generalized ComBat harmonization methods for radiomic features with multi-modal distributions and multiple batch effects
Author
Horng, Hannah 1 ; Singh, Apurva 2 ; Yousefi Bardia 2 ; Cohen, Eric A 2 ; Haghighi Babak 2 ; Katz, Sharyn 2 ; Noël, Peter B 3 ; Shinohara, Russell T 4 ; Kontos Despina 2 

 University of Pennsylvania, Department of Bioengineering, Philadelphia, USA (GRID:grid.25879.31) (ISNI:0000 0004 1936 8972) 
 University of Pennsylvania, Department of Radiology, Center for Biomedical Image Computing and Analysis (CBICA), Philadelphia, USA (GRID:grid.25879.31) (ISNI:0000 0004 1936 8972) 
 University of Pennsylvania, Laboratory for Advanced Computed Tomography Imaging, Department of Radiology, Philadelphia, USA (GRID:grid.25879.31) (ISNI:0000 0004 1936 8972) 
 University of Pennsylvania, Penn Statistics in Imaging and Visualization Endeavor (PennSIVE), Department of Biostatistics, Epidemiology and Informatics, Philadelphia, USA (GRID:grid.25879.31) (ISNI:0000 0004 1936 8972) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2640593994
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.