Full text

Turn on search term navigation

This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication: https://creativecommons.org/publicdomain/zero/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Accurately predicting grain yield remains a major challenge in sorghum breeding, particularly across genetically and geographically diverse germplasm. To address this, we applied a phenotype-informed machine learning (PIML) framework to analyze nine phenotypic traits in 179 Ethiopian and Senegalese accessions. Using hierarchical clustering and oversampling with ADASYN, we achieved high classification accuracy (0.99) for phenotypic group assignment. Grain yield prediction was most effective with a Neural Boosted model (NTanH(3)NBoost(8)), achieving a mean R2 of 0.36 and RASE (equivalent to RMSE) of 4.87. Feature importance analysis consistently identified seed weight and germination rate as the strongest predictors of grain yield, while disease resistance traits showed limited predictive value. These findings suggest that early selection based on seed quality traits may provide a practical strategy for improving sorghum yield under field conditions, especially in resource-limited environments.

Details

Title
Seed quality drives grain yield in Ethiopian and Senegalese sorghum: Insights from machine learning
Author
Ahn, Ezekiel  VIAFID ORCID Logo  ; Baek, Insuck; Tukuli, Adama R; Lim, Seunghyun; Hong, Seok Min  VIAFID ORCID Logo  ; Kim, Moon S; Meinhardt, Lyndel W; Park, Sunchung; Magill, Clint
First page
e0329366
Section
Research Article
Publication year
2025
Publication date
Aug 2025
Publisher
Public Library of Science
e-ISSN
19326203
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3239700841
Copyright
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication: https://creativecommons.org/publicdomain/zero/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.