Full text

Turn on search term navigation

© 2025 Rawlinson et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Application of transcriptomics, proteomics and metabolomics technologies to clinical cohorts has uncovered a variety of signatures for predicting disease. Many of these signatures require the full ‘omics data for evaluation on unseen samples, either explicitly or implicitly through library size normalisation. Translation to low-cost point-of-care tests requires development of signatures which measure as few analytes as possible without relying on direct measurement of library size. To achieve this, we have developed a feature selection method (Forward Selection-Partial Least Squares) which generates minimal disease signatures from high-dimensional omics datasets with applicability to continuous, binary or multi-class outcomes. Through extensive benchmarking, we show that FS-PLS has comparable performance to commonly used signature discovery methods while delivering signatures which are an order of magnitude smaller. We show that FS-PLS can be used to select features predictive of library size, and that these features can be used to normalize unseen samples, meaning that the features in the complete model can be measured in isolation for making new predictions. By enabling discovery of small, high-performance signatures, FS-PLS addresses an important impediment for the further development of precision medical care.

Details

Title
A flexible framework for minimal biomarker signature discovery from clinical omics studies without library size normalisation
Author
Rawlinson, Daniel; Zhou, Chenxi  VIAFID ORCID Logo  ; Kaforou, Myrsini; Kim-Anh Lê Cao; Coin, Lachlan J M  VIAFID ORCID Logo  ; RAPIDS Study Group
First page
e0000780
Section
Research Article
Publication year
2025
Publication date
Mar 2025
Publisher
Public Library of Science
e-ISSN
27673170
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3252726098
Copyright
© 2025 Rawlinson et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.