Full Text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Background: Machine learning methods for clinical assistance require a large number of annotations from trained experts to achieve optimal performance. Previous work in natural language processing has shown that it is possible to automatically extract annotations from the free-text reports associated with chest X-rays. Methods: This study investigated techniques to extract 49 labels in a hierarchical tree structure from chest X-ray reports written in Danish. The labels were extracted from approximately 550,000 reports by performing multi-class, multi-label classification using a method based on pattern-matching rules, a classic approach in the literature for solving this task. The performance of this method was compared to that of open-source large language models that were pre-trained on Danish data and fine-tuned for classification. Results: Methods developed for English were also applicable to Danish and achieved similar performance (a weighted F1 score of 0.778 on 49 findings). A small set of expert annotations was sufficient to achieve competitive results, even with an unbalanced dataset. Conclusions: Natural language processing techniques provide a promising alternative to human expert annotation when annotations of chest X-ray reports are needed. Large language models can outperform traditional pattern-matching methods.

Details

Title
Effective Machine Learning Techniques for Non-English Radiology Report Classification: A Danish Case Study
Author
Schiavone, Alice 1   VIAFID ORCID Logo  ; Pehrson, Lea Marie 2   VIAFID ORCID Logo  ; Ingala, Silvia 3   VIAFID ORCID Logo  ; Bonnevie, Rasmus 4   VIAFID ORCID Logo  ; Fraccaro, Marco 4   VIAFID ORCID Logo  ; Li, Dana 5   VIAFID ORCID Logo  ; Michael Bachmann Nielsen 2   VIAFID ORCID Logo  ; Elliott, Desmond 1   VIAFID ORCID Logo 

 Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark 
 Department of Computer Science, University of Copenhagen, 2100 Copenhagen, Denmark; Department of Diagnostic Radiology, Copenhagen University Hospital Rigshospitalet, 2100 Copenhagen, Denmark; Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark 
 Department of Diagnostic Radiology, Copenhagen University Hospital Rigshospitalet, 2100 Copenhagen, Denmark; Cerebriu A/S, 1434 Copenhagen, Denmark 
 Unumed Aps, 1055 Copenhagen, Denmark 
 Department of Diagnostic Radiology, Copenhagen University Hospital Rigshospitalet, 2100 Copenhagen, Denmark; Department of Clinical Medicine, University of Copenhagen, 2100 Copenhagen, Denmark 
First page
37
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
26732688
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3170854905
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.