Full text

Turn on search term navigation

© 2021 Wieder et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Over-representation analysis (ORA) is one of the commonest pathway analysis approaches used for the functional interpretation of metabolomics datasets. Despite the widespread use of ORA in metabolomics, the community lacks guidelines detailing its best-practice use. Many factors have a pronounced impact on the results, but to date their effects have received little systematic attention. Using five publicly available datasets, we demonstrated that changes in parameters such as the background set, differential metabolite selection methods, and pathway database used can result in profoundly different ORA results. The use of a non-assay-specific background set, for example, resulted in large numbers of false-positive pathways. Pathway database choice, evaluated using three of the most popular metabolic pathway databases (KEGG, Reactome, and BioCyc), led to vastly different results in both the number and function of significantly enriched pathways. Factors that are specific to metabolomics data, such as the reliability of compound identification and the chemical bias of different analytical platforms also impacted ORA results. Simulated metabolite misidentification rates as low as 4% resulted in both gain of false-positive pathways and loss of truly significant pathways across all datasets. Our results have several practical implications for ORA users, as well as those using alternative pathway analysis methods. We offer a set of recommendations for the use of ORA in metabolomics, alongside a set of minimal reporting guidelines, as a first step towards the standardisation of pathway analysis in metabolomics.

Details

Title
Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis
Author
Wieder, Cecilia  VIAFID ORCID Logo  ; Frainay, Clément  VIAFID ORCID Logo  ; Poupin, Nathalie  VIAFID ORCID Logo  ; Rodríguez-Mier, Pablo  VIAFID ORCID Logo  ; Vinson, Florence  VIAFID ORCID Logo  ; Cooke, Juliette  VIAFID ORCID Logo  ; Lai, Rachel PJ  VIAFID ORCID Logo  ; Bundy, Jacob G  VIAFID ORCID Logo  ; Jourdan, Fabien  VIAFID ORCID Logo  ; Ebbels, Timothy
First page
e1009105
Section
Research Article
Publication year
2021
Publication date
Sep 2021
Publisher
Public Library of Science
ISSN
1553734X
e-ISSN
15537358
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2582586824
Copyright
© 2021 Wieder et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.