Abstract

Fully automated machine learning (AutoML) for predictive modeling is becoming a reality, giving rise to a whole new field. We present the basic ideas and principles of Just Add Data Bio (JADBio), an AutoML platform applicable to the low-sample, high-dimensional omics data that arise in translational medicine and bioinformatics applications. In addition to predictive and diagnostic models ready for clinical use, JADBio focuses on knowledge discovery by performing feature selection and identifying the corresponding biosignatures, i.e., minimal-size subsets of biomarkers that are jointly predictive of the outcome or phenotype of interest. It also returns a palette of useful information for interpretation, clinical use of the models, and decision making. JADBio is qualitatively and quantitatively compared against Hyper-Parameter Optimization Machine Learning libraries. Results show that in typical omics dataset analysis, JADBio manages to identify signatures comprising of just a handful of features while maintaining competitive predictive performance and accurate out-of-sample performance estimation.

Details

Title
Just Add Data: automated predictive modeling for knowledge discovery and feature selection
Author
Tsamardinos, Ioannis 1   VIAFID ORCID Logo  ; Charonyktakis, Paulos 2 ; Papoutsoglou, Georgios 3   VIAFID ORCID Logo  ; Borboudakis, Giorgos 2   VIAFID ORCID Logo  ; Lakiotaki, Kleanthi 4 ; Zenklusen, Jean Claude 5 ; Juhl, Hartmut 6 ; Chatzaki, Ekaterini 7   VIAFID ORCID Logo  ; Lagani, Vincenzo 8   VIAFID ORCID Logo 

 JADBio Gnosis DA S.A., Science and Technology Park of Crete, Heraklion, Greece (GRID:grid.511969.3); University of Crete, Department of Computer Science, Heraklion, Greece (GRID:grid.8127.c) (ISNI:0000 0004 0576 3437); Institute of Applied and Computational Mathematics, Foundation for Research and Technology, Hellas, Heraklion, Greece (GRID:grid.511961.b) 
 JADBio Gnosis DA S.A., Science and Technology Park of Crete, Heraklion, Greece (GRID:grid.511969.3) 
 JADBio Gnosis DA S.A., Science and Technology Park of Crete, Heraklion, Greece (GRID:grid.511969.3); University of Crete, Department of Computer Science, Heraklion, Greece (GRID:grid.8127.c) (ISNI:0000 0004 0576 3437) 
 University of Crete, Department of Computer Science, Heraklion, Greece (GRID:grid.8127.c) (ISNI:0000 0004 0576 3437) 
 National Institutes of Health, National Cancer Institute, Bethesda, USA (GRID:grid.94365.3d) (ISNI:0000 0001 2297 5165) 
 Chief Executive Officer, Indivumed Group, Hamburg, Germany (GRID:grid.94365.3d) 
 Democritus University of Thrace, Laboratory of Pharmacology, Medical School, Alexandroupolis, Greece (GRID:grid.12284.3d) (ISNI:0000 0001 2170 8022); Hellenic Mediterranean University Research Centre, Institute of Agri-food and Life Sciences, Crete, Greece (GRID:grid.419879.a) (ISNI:0000 0004 0393 8299) 
 JADBio Gnosis DA S.A., Science and Technology Park of Crete, Heraklion, Greece (GRID:grid.511969.3); Ilia State University, Institute of Chemical Biology, Tbilisi, Georgia (GRID:grid.428923.6) (ISNI:0000 0000 9489 2441) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
ISSN
2397768X
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2677227112
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.