Content area

Abstract

Data augmentation is a central component of joint embedding self-supervised learning (SSL). Approaches that work for natural images may not always be effective in medical imaging tasks. This study systematically investigated the impact of data augmentation and preprocessing strategies in SSL for lung ultrasound. Three data augmentation pipelines were assessed: (1) a baseline pipeline commonly used across imaging domains, (2) a novel semantic-preserving pipeline designed for ultrasound, and (3) a distilled set of the most effective transformations from both pipelines. Pretrained models were evaluated on multiple classification tasks: B-line detection, pleural effusion detection, and COVID-19 classification. Experiments revealed that semantics-preserving data augmentation resulted in the greatest performance for COVID-19 classification—a diagnostic task requiring global image context. Cropping-based methods yielded the greatest performance on the B-line and pleural effusion object classification tasks, which require strong local pattern recognition. Lastly, semantics-preserving ultrasound image preprocessing resulted in increased downstream performance for multiple tasks. Guidance regarding data augmentation and preprocessing strategies was synthesized for developers working with SSL in ultrasound.

Details

1009240
Title
The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound
Author
Blake, VanBerlo 1   VIAFID ORCID Logo  ; Hoey, Jesse 1   VIAFID ORCID Logo  ; Wong, Alexander 2 ; Arntfield, Robert 3   VIAFID ORCID Logo 

 David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada 
 Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada 
 Schulich School of Medicine and Dentistry, Western University, London, ON N6A 3K7, Canada 
Publication title
Volume
12
Issue
8
First page
855
Number of pages
35
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
23065354
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-08-08
Milestone dates
2025-06-17 (Received); 2025-08-04 (Accepted)
Publication history
 
 
   First posting date
08 Aug 2025
ProQuest document ID
3243983881
Document URL
https://www.proquest.com/scholarly-journals/efficacy-semantics-preserving-transformations/docview/3243983881/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-08-27
Database
2 databases
  • Coronavirus Research Database
  • ProQuest One Academic