Full text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Background: Training of machine learning algorithms on dish images collected in other countries requires possible sources of systematic discrepancies, including country-specific food composition databases (FCDBs), to be tackled. The US Nutrition5k project provides for ~5000 dish images and related dish- and ingredient-level information on mass, energy, and macronutrients from the US FCDB. The aim of this study is to (1) identify challenges/solutions in linking the nutritional composition of Italian foods with food images from Nutrition5k and (2) assess potential differences in nutrient content estimated across the Italian and US FCDBs and their determinants. Methods: After food matching, expert data curation, and handling of missing values, dish-level ingredients from Nutrition5k were integrated with the Italian-FCDB-specific nutritional composition (86 components); dish-specific nutrient content was calculated by summing the corresponding ingredient-specific nutritional values. Measures of agreement/difference were calculated between Italian- and US-FCDB-specific content of energy and macronutrients. Potential determinants of identified differences were investigated with multiple robust regression models. Results: Dishes showed a median mass of 145 g and included three ingredients in median. Energy, proteins, fats, and carbohydrates showed moderate-to-strong agreement between Italian- and US-FCDB-specific content; carbohydrates showed the worst performance, with the Italian FCDB providing smaller median values (median raw difference between the Italian and US FCDBs: −2.10 g). Regression models on dishes suggested a role for mass, number of ingredients, and presence of recreated recipes, alone or jointly with differential use of raw/cooked ingredients across the two FCDBs. Conclusions: In the era of machine learning approaches for food image recognition, manual data curation in the alignment of FCDBs is worth the effort.

Details

Title
Tailoring the Nutritional Composition of Italian Foods to the US Nutrition5k Dataset for Food Image Recognition: Challenges and a Comparative Analysis
Author
Bianco, Rachele 1   VIAFID ORCID Logo  ; Marinoni, Michela 2 ; Coluccia, Sergio 2   VIAFID ORCID Logo  ; Carioni, Giulia 3   VIAFID ORCID Logo  ; Fiori, Federica 1   VIAFID ORCID Logo  ; Gnagnarella, Patrizia 4   VIAFID ORCID Logo  ; Edefonti, Valeria 5   VIAFID ORCID Logo  ; Parpinel, Maria 1   VIAFID ORCID Logo 

 Department of Medicine—DMED, Università degli Studi di Udine, 33100 Udine, Italy; [email protected] (R.B.); [email protected] (G.C.); [email protected] (F.F.); [email protected] (M.P.) 
 Branch of Medical Statistics, Biometry and Epidemiology “G. A. Maccacaro”, Department of Clinical Sciences and Community Health, Dipartimento di Eccellenza 2023–2027, Università degli Studi di Milano, 20133 Milan, Italy; [email protected] (M.M.); [email protected] (S.C.) 
 Department of Medicine—DMED, Università degli Studi di Udine, 33100 Udine, Italy; [email protected] (R.B.); [email protected] (G.C.); [email protected] (F.F.); [email protected] (M.P.); Division of Epidemiology and Biostatistics, European Institute of Oncology, IRCCS, 20141 Milan, Italy; [email protected] 
 Division of Epidemiology and Biostatistics, European Institute of Oncology, IRCCS, 20141 Milan, Italy; [email protected] 
 Branch of Medical Statistics, Biometry and Epidemiology “G. A. Maccacaro”, Department of Clinical Sciences and Community Health, Dipartimento di Eccellenza 2023–2027, Università degli Studi di Milano, 20133 Milan, Italy; [email protected] (M.M.); [email protected] (S.C.); Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, 20122 Milan, Italy 
First page
3339
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
20726643
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3116690453
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.