Abstract

Next generation sequencing of cell-free DNA (cfDNA) is a promising method for treatment monitoring and therapy selection in metastatic breast cancer (MBC). However, distinguishing tumor-specific variants from sequencing artefacts and germline variation with low false discovery rate is challenging when using large targeted sequencing panels covering many tumor suppressor genes. To address this, we built a machine learning model to remove false positive variant calls and augmented it with additional filters to ensure selection of tumor-derived variants. We used cfDNA of 70 MBC patients profiled with both the small targeted Oncomine breast panel (Thermofisher) and the much larger Qiaseq Human Breast Cancer Panel (Qiagen). The model was trained on the panels’ common regions using Oncomine hotspot mutations as ground truth. Applied to Qiaseq data, it achieved 35% sensitivity and 36% precision, outperforming basic filtering. For 20 patients we used germline DNA to filter for somatic variants and obtained 245 variants in total, while our model found seven variants, of which six were also detected using the germline strategy. In ten tumor-free individuals, our method detected in total one (potentially germline) variant, in contrast to 521 variants detected without our model. These results indicate that our model largely detects somatic variants.

Details

Title
Machine learning-based somatic variant calling in cell-free DNA of metastatic breast cancer patients using large NGS panels
Author
Jongbloed, Elisabeth M. 1 ; Jansen, Maurice P. H. M. 1 ; de Weerd, Vanja 1 ; Helmijr, Jean A. 1 ; Beaufort, Corine M. 1 ; Reinders, Marcel J. T. 2 ; van Marion, Ronald 3 ; van IJcken, Wilfred F. J. 4 ; Sonke, Gabe S. 5 ; Konings, Inge R. 6 ; Jager, Agnes 1 ; Martens, John W. M. 1 ; Wilting, Saskia M. 1 ; Makrodimitris, Stavros 7 

 Erasmus University Medical Center, Department of Medical Oncology, Erasmus MC Cancer Institute, Rotterdam, The Netherlands (GRID:grid.5645.2) (ISNI:000000040459992X) 
 Delft University of Technology, Delft Bioinformatics Lab, Delft, The Netherlands (GRID:grid.5292.c) (ISNI:0000 0001 2097 4740) 
 Erasmus MC Cancer Institute, Department of Pathology, Rotterdam, The Netherlands (GRID:grid.508717.c) (ISNI:0000 0004 0637 3764) 
 Erasmus University Medical Center Rotterdam, Erasmus Center for Biomics, Rotterdam, The Netherlands (GRID:grid.5645.2) (ISNI:000000040459992X) 
 Netherlands Cancer Institute, Department of Medical Oncology, Amsterdam, The Netherlands (GRID:grid.430814.a) (ISNI:0000 0001 0674 1393) 
 Amsterdam UMC, location Vrije Universiteit Amsterdam, Department of Medical Oncology, Amsterdam, The Netherlands (GRID:grid.509540.d) (ISNI:0000 0004 6880 3010) 
 Erasmus University Medical Center, Department of Medical Oncology, Erasmus MC Cancer Institute, Rotterdam, The Netherlands (GRID:grid.5645.2) (ISNI:000000040459992X); Delft University of Technology, Delft Bioinformatics Lab, Delft, The Netherlands (GRID:grid.5292.c) (ISNI:0000 0001 2097 4740) 
Pages
10424
Publication year
2023
Publication date
2023
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2830012364
Copyright
© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.