Abstract

Biomedical image analysis algorithm validation depends on high-quality annotation of reference datasets, for which labelling instructions are key. Despite their importance, their optimization remains largely unexplored. Here we present a systematic study of labelling instructions and their impact on annotation quality in the field. Through comprehensive examination of professional practice and international competitions registered at the Medical Image Computing and Computer Assisted Intervention Society, the largest international society in the biomedical imaging field, we uncovered a discrepancy between annotators’ needs for labelling instructions and their current quality and availability. On the basis of an analysis of 14,040 images annotated by 156 annotators from four professional annotation companies and 708 Amazon Mechanical Turk crowdworkers using instructions with different information density levels, we further found that including exemplary images substantially boosts annotation performance compared with text-only descriptions, while solely extending text descriptions does not. Finally, professional annotators constantly outperform Amazon Mechanical Turk crowdworkers. Our study raises awareness for the need of quality standards in biomedical image analysis labelling instructions.

High-quality annotation of datasets is critical for machine-learning-based biomedical image analysis. However, a detailed examination of recent image competitions reveals a gap between annotators’ needs and quality of labelling instructions. It is also found that annotator performance can be substantially improved by providing exemplary images.

Details

Title
Labelling instructions matter in biomedical image analysis
Author
Rädsch, Tim 1   VIAFID ORCID Logo  ; Reinke, Annika 2   VIAFID ORCID Logo  ; Weru, Vivienn 3 ; Tizabi, Minu D. 4 ; Schreck, Nicholas 5 ; Kavur, A. Emre 6   VIAFID ORCID Logo  ; Pekdemir, Bünyamin 7 ; Roß, Tobias 8 ; Kopp-Schneider, Annette 5   VIAFID ORCID Logo  ; Maier-Hein, Lena 9   VIAFID ORCID Logo 

 German Cancer Research Center (DKFZ), Division of Intelligent Medical Systems, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); German Cancer Research Center (DKFZ), Helmholtz Imaging, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584) 
 German Cancer Research Center (DKFZ), Division of Intelligent Medical Systems, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); German Cancer Research Center (DKFZ), Helmholtz Imaging, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); Heidelberg University, Faculty of Mathematics and Computer Science, Heidelberg, Germany (GRID:grid.7700.0) (ISNI:0000 0001 2190 4373) 
 German Cancer Research Center (DKFZ), Division of Biostatistics, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); National Center for Tumor Diseases (NCT), Heidelberg, Germany (GRID:grid.461742.2) (ISNI:0000 0000 8855 0365) 
 German Cancer Research Center (DKFZ), Division of Intelligent Medical Systems, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); National Center for Tumor Diseases (NCT), Heidelberg, Germany (GRID:grid.461742.2) (ISNI:0000 0000 8855 0365) 
 German Cancer Research Center (DKFZ), Division of Biostatistics, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584) 
 German Cancer Research Center (DKFZ), Division of Intelligent Medical Systems, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); German Cancer Research Center (DKFZ), Helmholtz Imaging, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); German Cancer Research Center (DKFZ), Division of Medical Image Computing, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584) 
 Helmholtz Zentrum München, Helmholtz Pioneer Campus, München, Germany (GRID:grid.4567.0) (ISNI:0000 0004 0483 2525) 
 German Cancer Research Center (DKFZ), Division of Intelligent Medical Systems, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); Quality Match GmbH, Heidelberg, Germany (GRID:grid.7497.d) 
 German Cancer Research Center (DKFZ), Division of Intelligent Medical Systems, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); German Cancer Research Center (DKFZ), Helmholtz Imaging, Heidelberg, Germany (GRID:grid.7497.d) (ISNI:0000 0004 0492 0584); Heidelberg University, Faculty of Mathematics and Computer Science, Heidelberg, Germany (GRID:grid.7700.0) (ISNI:0000 0001 2190 4373); National Center for Tumor Diseases (NCT), Heidelberg, Germany (GRID:grid.461742.2) (ISNI:0000 0000 8855 0365); Heidelberg University, Medical Faculty, Heidelberg, Germany (GRID:grid.7700.0) (ISNI:0000 0001 2190 4373) 
Pages
273-283
Publication year
2023
Publication date
Mar 2023
Publisher
Nature Publishing Group
e-ISSN
25225839
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2789608010
Copyright
© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.