Full text

Turn on search term navigation

© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Cocoa is a multi-billion-dollar industry but research on improving yields through pollination remains limited. New embedded hardware and AI-based data analysis is advancing information on cocoa flower visitors, their identity and implications for yields. We present the first cocoa flower visitor dataset containing 5,792 images of Ceratopogonidae, Formicidae, Aphididae, Araneae, and Encyrtidae, and 1,082 background cocoa flower images. This dataset was curated from 23 million images collected over two years by embedded cameras in cocoa plantations in Hainan province, China. We exemplify the use of the dataset with different sizes of YOLOv8 models and by progressively increasing the background image ratio in the training set to identify the best-performing model. The medium-sized YOLOv8 model achieved the best results with 8% background images (F1 Score of 0.71, mAP50 of 0.70). Overall, this dataset is useful to compare the performance of deep learning model architectures on images with low contrast images and difficult detection targets. The data can support future efforts to advance sustainable cocoa production through pollination monitoring projects.

Details

Title
Identifying Cocoa Flower Visitors: A Deep Learning Dataset
Author
Xu, Wenxiu 1 ; Barzegar, Saba Ghorbani 2 ; Sheng, Dong 1   VIAFID ORCID Logo  ; Toledo-Hernández, Manuel 3 ; Lan, ZhenZhong 4 ; Wanger, Thomas Cherico 5   VIAFID ORCID Logo 

 College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, China (ROR: https://ror.org/00a2xv884) (GRID: grid.13402.34) (ISNI: 0000 0004 1759 700X); Sustainable Agricultural Systems & Engineering Laboratory, School of Engineering, Westlake University, Hangzhou, China (ROR: https://ror.org/05hfa4n20) (GRID: grid.494629.4) (ISNI: 0000 0004 8008 9315) 
 Sustainable Agricultural Systems & Engineering Laboratory, School of Engineering, Westlake University, Hangzhou, China (ROR: https://ror.org/05hfa4n20) (GRID: grid.494629.4) (ISNI: 0000 0004 8008 9315) 
 Sustainable Agricultural Systems & Engineering Laboratory, School of Engineering, Westlake University, Hangzhou, China (ROR: https://ror.org/05hfa4n20) (GRID: grid.494629.4) (ISNI: 0000 0004 8008 9315); Sustainable Development Department, Instituto Tecnológico Vale, Belém, Brazil (ROR: https://ror.org/05wnasr61) (GRID: grid.512416.5) (ISNI: 0000 0004 4670 7802) 
 School of Engineering, Westlake University, Hangzhou, China (ROR: https://ror.org/05hfa4n20) (GRID: grid.494629.4) (ISNI: 0000 0004 8008 9315) 
 Sustainable Agricultural Systems & Engineering Laboratory, School of Engineering, Westlake University, Hangzhou, China (ROR: https://ror.org/05hfa4n20) (GRID: grid.494629.4) (ISNI: 0000 0004 8008 9315); Key Laboratory of Coastal Environment and Resources of Zhejiang Province, Westlake University, Hangzhou, China (ROR: https://ror.org/05hfa4n20) (GRID: grid.494629.4) (ISNI: 0000 0004 8008 9315); Production Technology & Cropping Systems Group, Department of Plant Production, AgroScope, Nyon, Switzerland (ROR: https://ror.org/04d8ztx87) (GRID: grid.417771.3) (ISNI: 0000 0004 4681 910X) 
Pages
1309
Section
Data Descriptor
Publication year
2025
Publication date
2025
Publisher
Nature Publishing Group
e-ISSN
20524463
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3234112793
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.