Full text

Turn on search term navigation

© 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Cheap and ubiquitous sensing has made collecting large agricultural datasets relatively straightforward. These large datasets (for instance, citizen science data curation platforms like iNaturalist) can pave the way for developing powerful artificial intelligence (AI) models for detection and counting. However, traditional supervised learning methods require labeled data, and manual annotation of these raw datasets with useful labels (such as bounding boxes or segmentation masks) can be extremely laborious, expensive, and error‐prone. In this paper, we demonstrate the power of zero‐shot computer vision methods—a new family of approaches that require (almost) no manual supervision—for plant phenomics applications. Focusing on insect detection as the primary use case, we show that our models enable highly accurate detection of insects in a variety of challenging imaging environments. Our technical contributions are two‐fold: (a) We curate the Insecta rank class of iNaturalist to form a new benchmark dataset of approximately 6 million images consisting of 2526 agriculturally and ecologically important species, including pests and beneficial insects. (b) Using a vision‐language object detection method coupled with weak language supervision, we are able to automatically annotate images in this dataset with bounding box information localizing the insect within each image. Our method succeeds in detecting diverse insect species present in a wide variety of backgrounds, producing high‐quality bounding boxes in a zero‐shot manner with no additional training cost. This open dataset can serve as a use‐inspired benchmark for the AI community. We demonstrate that our method can also be used for other applications in plant phenomics, such as fruit detection in images of strawberry and apple trees. Overall, our framework highlights the promise of zero‐shot approaches to make high‐throughput plant phenotyping more affordable.

Details

Title
Zero‐shot insect detection via weak language supervision
Author
Feuer, Benjamin 1 ; Joshi, Ameya 1 ; Cho, Minsu 1 ; Chiranjeevi, Shivani 2 ; Deng, Zi Kang 3 ; Balu, Aditya 2 ; Singh, Asheesh K. 2   VIAFID ORCID Logo  ; Sarkar, Soumik 2 ; Merchant, Nirav 3 ; Singh, Arti 2   VIAFID ORCID Logo  ; Ganapathysubramanian, Baskar 2   VIAFID ORCID Logo  ; Hegde, Chinmay 1   VIAFID ORCID Logo 

 Department of Electrical and Computer Engineering, New York University, New York, New York, USA 
 Translational AI Center, Iowa State University, Ames, Iowa, USA 
 Data Science Institute, University of Arizona, Tucson, Arizona, USA 
Section
SPECIAL SECTION: AFFORDABLE PHENOMICS
Publication year
2024
Publication date
Dec 1, 2024
Publisher
John Wiley & Sons, Inc.
e-ISSN
25782703
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3149131349
Copyright
© 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.