CV.eDNA: A hybrid approach to invertebrate

Abstract

Automated invertebrate classification using computer vision has shown significant potential to improve specimen processing efficiency. However, challenges such as invertebrate diversity and morphological similarity among taxa can make it difficult to infer fine-scale taxonomic classifications using computer vision. As a result, many invertebrate computer vision models are forced to make classifications at coarser levels, such as at family or order. Here we propose a novel modular method to combine computer vision and bulk DNA metabarcoding specimen processing pipelines to improve the accuracy and taxonomic granularity of individual specimen classifications. To improve specimen classification accuracy, our methods use multimodal fusion models that combine image data with DNA-based assemblage data. To refine the taxonomic granularity of the models classifications, our methods cross-references the classifications with DNA metabarcoding detections from bulk samples. We demonstrated these methods using a continental-scale, invertebrate bycatch dataset collected by the National Ecological Observatory Network. We also introduce the CV.eDNA R package, which aims to assist practitioners looking to implement our methods. Using our methods, we reached a classification accuracy of 79.6% across the 17 taxa using real DNA assemblage data, and 83.6% when the assemblage data was error-free, resulting in a 2.2% and 6.2% increase in accuracy when compared to a model trained using only images. After cross-referencing with the DNA metabarcoding detections, we improved taxonomic granularity in up to 72.2% of classifications, with up to 5.7% reaching species-level. By providing computer vision models with coincident DNA assemblage data, and refining individual classifications using DNA metabarcoding detections, our methods the potential to greatly expand the capabilities of biological computer vision classifiers. Our methods allow computer vision classifiers to infer taxonomically fine-grained classifications when it would otherwise be difficult or impossible due to challenges of morphologic similarity or data scarcity. These methods are not limited to terrestrial invertebrates and could be applied in any instance where image and DNA metabarcoding data are concurrently collected.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

* This version of the manuscript has been revised to reflect changes to the associated GitHub repository, including the creation of the CV.eDNA R package.

* https://github.com/Jarrett-Blair/CV-DNA-Hybrid

Details

Title

CV.eDNA: A hybrid approach to invertebrate biomonitoring using computer vision and DNA metabarcoding

Author

Blair, Jarrett D; Weiser, Michael D; Siler, Cameron D; Kaspari, Michael; Smith, Sierra N; Mclaughlin, J F; Marshall, Katie Elizabeth

University/institution

Cold Spring Harbor Laboratory Press

Section

New Results

Publication year

2025

Publication date

Feb 10, 2025

Publisher

Cold Spring Harbor Laboratory Press

ISSN

2692-8205

Source type

Working Paper

Language of publication

English

DOI

https://doi.org/10.1101/2024.09.02.610558

ProQuest document ID

3165216412

Full text outside of ProQuest

https://www.biorxiv.org/content/10.1101/2024.09.02.610558v3

© 2025. This article is published under http://creativecommons.org/licenses/by/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

CV.eDNA: A hybrid approach to invertebrate biomonitoring using computer vision and DNA metabarcoding

Jump to:

Abstract

Details

Suggested sources