Abstract

The increased prevalence of childhood obesity is expected to translate in the near future into a concomitant soaring of multiple cardio-metabolic diseases. Obesity has a complex, multifactorial etiology, that includes multiple and multidomain potential risk factors: genetics, dietary and physical activity habits, socio-economic environment, lifestyle, etc. In addition, all these factors are expected to exert their influence through a specific and especially convoluted way during childhood, given the fast growth along this period. Machine Learning methods are the appropriate tools to model this complexity, given their ability to cope with high-dimensional, non-linear data. Here, we have analyzed by Machine Learning a sample of 221 children (6–9 years) from Madrid, Spain. Both Random Forest and Gradient Boosting Machine models have been derived to predict the body mass index from a wide set of 190 multidomain variables (including age, sex, genetic polymorphisms, lifestyle, socio-economic, diet, exercise, and gestation ones). A consensus relative importance of the predictors has been estimated through variable importance measures, implemented robustly through an iterative process that included permutation and multiple imputation. We expect this analysis will help to shed light on the most important variables associated to childhood obesity, in order to choose better treatments for its prevention.

Details

Title
Ranking of a wide multidomain set of predictor variables of children obesity by machine learning variable importance techniques
Author
Marcos-Pasero, Helena 1 ; Colmenarejo Gonzalo 2 ; Aguilar-Aguilar, Elena 3 ; Ramírez de Molina Ana 4 ; Reglero Guillermo 5 ; Loria-Kohen Viviana 6 

 GENYAL Platform IMDEA-Food Institute, CEI UAM+CSIC, Nutrition and Clinical Trials Unit, Madrid, Spain 
 IMDEA-Food Institute, CEI UAM+CSIC, Biostatistics and Bioinformatics Unit, Madrid, Spain (GRID:grid.482878.9) (ISNI:0000 0004 0500 5302) 
 GENYAL Platform IMDEA-Food Institute, CEI UAM+CSIC, Nutrition and Clinical Trials Unit, Madrid, Spain (GRID:grid.482878.9) 
 IMDEA-Food Institute, CEI UAM+CSIC, Molecular Oncology and Nutritional Genomics of Cancer, Madrid, Spain (GRID:grid.482878.9) (ISNI:0000 0004 0500 5302) 
 IMDEA-Food Institute, CEI UAM+CSIC, Production and Development of Foods for Health, Madrid, Spain (GRID:grid.482878.9) (ISNI:0000 0004 0500 5302); CEI UAM+CSIC, Department of Production and Characterization of Novel Foods. Institute of Food Science Research (CIAL), Madrid, Spain (GRID:grid.473520.7) (ISNI:0000 0004 0580 7575) 
 GENYAL Platform IMDEA-Food Institute, CEI UAM+CSIC, Nutrition and Clinical Trials Unit, Madrid, Spain (GRID:grid.473520.7) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2479576779
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.