Content area

Abstract

Accurate labeling is critical for training reliable deep learning models in agricultural applications. However, manual labeling is often error-prone, especially when performed by non-experts, and such errors (modeled as noise) can significantly degrade model performance. This study addresses the problem of correcting labeling errors in object detection datasets without human intervention. We hypothesize that label noise can be reduced by exploiting the feature space representation of the data, enabling automatic refinement through repeated model-based filtering. To test this, we propose a recursive methodology that employs a YOLOv5 detector to iteratively relabel a dataset of Prunaxx and Paipai tomato images captured in greenhouse environments. The correction process involves training the detector, predicting new labels, and replacing existing labelings over multiple iterations. Experimental results show substantial improvements: the mean Average Precision at an IoU threshold of 0.50 (mAP-50) increased from 0.8 to 0.86, the mean Average Precision across IoU thresholds from 0.50 to 0.95 (mAP-50:95) increased from 0.46 to 0.63, and Recall improved from 0.68 to 0.82. These results demonstrate that the model was able to detect more true positives after filtering, while also achieving more accurate bounding box predictions. Although a slight decrease in Precision was observed in later iterations due to false positives, the overall quality of the dataset improved consistently. In conclusion, the proposed filtering method effectively enhances label quality without manual intervention and offers a scalable solution for improving object detection datasets in precision agriculture.

Details

1009240
Business indexing term
Title
Automatic Correction of Labeling Errors Applied to Tomato Detection
Author
Zamora Suárez Ángel Eduardo 1   VIAFID ORCID Logo  ; Alvarez Hernandez Gerardo Antonio 2   VIAFID ORCID Logo  ; Vasquez, Juan Irving 2   VIAFID ORCID Logo  ; Taud Hind 2   VIAFID ORCID Logo  ; Uriarte-Arcia Abril Valeria 2   VIAFID ORCID Logo  ; Zamora, Erik 3   VIAFID ORCID Logo 

 Unidad Profesional Interdisciplinaria de Biotecnología, Instituto Politécnico Nacional, Av. Acueducto S/N, La Laguna Ticoman, Gustavo A. Madero, Mexico City 07340, Mexico; [email protected] 
 Centro de Innovación y Desarrollo Tecnológico en Cómputo, Instituto Politécnico Nacional, Av. Juan de Dios Bátiz S/N, Nueva Industrial Vallejo, Gustavo A. Madero, Mexico City 07340, Mexico; [email protected] (G.A.A.H.); [email protected] (H.T.); [email protected] (A.V.U.-A.) 
 Centro de Investigación en Computación, Instituto Politécnico Nacional, Av. Juan de Dios Bátiz S/N, Nueva Industrial Vallejo, Gustavo A. Madero, Mexico City 07738, Mexico 
Publication title
Volume
15
Issue
12
First page
1291
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20770472
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-06-15
Milestone dates
2025-05-09 (Received); 2025-06-11 (Accepted)
Publication history
 
 
   First posting date
15 Jun 2025
ProQuest document ID
3223857869
Document URL
https://www.proquest.com/scholarly-journals/automatic-correction-labeling-errors-applied/docview/3223857869/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-06-25
Database
ProQuest One Academic