Introduction
Lipids (fats) are essential and diverse families of molecules that play structural, energy storage and signalling roles. They are connected through complex metabolic pathways, which comprise linked series of enzymatic reactions (several are outside cells,
As the quantity and detail of quantitative lipid data continues to grow it has become considerably more challenging to interpret the complex sets of changes within the lipidome
8
. Relevant biological perturbations usually happen not at the level of a single lipid molecular species in isolation, but as broad sets of changes over entire lipid classes and subclasses
9
. For example, a group of phosphatidylcholines will tend to change together in the same direction, since they are being regulated by the same enzyme isoform(s). This introduces characteristic patterns in the data that can reveal important clues as to the underlying level of genetic and transcriptomic regulation. Consequently, there is a need for software tools to automate and facilitate biosynthetic pathway scale analyses with lipids grouped according to structural motifs. Nguyen
BioPAN is openly available on the LIPID MAPS ® Lipidomics Gateway, at https://lipidmaps.org/biopan/. The tool is designed to allow users to upload and analyse their own data, and example datasets are also provided for evaluation. BioPAN is fully integrated with LipidLynxX, allowing users to convert different naming (short notation) conventions from diverse software inputs to be able to make it readable on BioPAN. Additionally, LipidLynxX will convert lipid results with fatty acid position, double bond location, and stereochemistry into a sum composition of carbon number and double bond equivalents. For example, DAG (16:0/20:1(Z11)), is converted by LipidLynxX to DG 36:1. BioPAN offers a link with LIPID MAPS Structure Database (LMSD) 9 . So, BioPAN aids for automatic integration of lipid metabolism with lipid profiles, finding strong relationships between lipid substrates and lipid products catalysed by active or suppressed enzymes.
Methods
Implementation
The statistical model used in BioPAN was originally described by Nguyen
The BioPAN workflow relies on the calculation of the Z-score, which considers the mean and the standard deviation of the experiment assuming a normally distributed data of lipid subclasses. A probability function (P) is computed and subtracting from one to obtain Q, which in turn is the probability that a Z-score is due to chance. The Z-score is then used to predict whether a particular reaction is significantly (
Operation
BioPAN is implemented as an open access web-based tool. The front end uses HTML, jQuery (v3.5.1) for the interface and the Cytoscape.js library (v3.10.2) 13 to draw and manipulate the results graphs. The server-side code uses PHP (v7.4.11) for request handling and R (v4.0.2) for data processing and statistical analysis. BioPAN runs on all modern web browsers and is available at https://lipidmaps.org/biopan/.
BioPAN allows uploading of lipidomics datasets containing quantitative data from two different experiments or conditions (
The first step in BioPAN is to load a file of quantitative lipidomics data. After uploading, BioPAN uses the LipidLynxX
15
tool to cross-match some lipid names into the LIPID MAPS
® Lipidomics Gateway nomenclature style according to the guidelines from COMP_DB (
https://www.lipidmaps.org/resources/tools/bulk_structure_searches_documentation.php)
9,
14,
16
. BioPAN then classifies the submitted lipids molecular species as either unrecognised, processed, or unprocessed. Unrecognised molecular species are lipids whose subclasses are not included into BioPAN database, while unprocessed species are part of BioPAN, but were not associated with any reactions. The unprocessed category corresponds to lipid molecular species that did not have a matching molecular species, as a reactant and product, with the same number of carbons and double bonds. For example, a substrate molecular species, such as PA 34:2 should have a matching product with the same sum composition (
The second step within BioPAN requires users to associate each sample with a condition. (
Figure 1.
Main results page view for BioPAN.
Pathways are displayed as an interactive graph at the centre of the screen. Users have control over the information presented in the graph using the menus on the left side panel and results tables are displayed at the bottom of the screen.
The bottom panel shows a tabular view of statistically significant (
Users can also choose to have the nodes in the graph aggregated into lipid subclasses, or they can view individual lipid molecular species. by using the level drop down menu. Alternatively, the user can opt to view only reaction chains using the subset of lipid data menu, which contains pathways stored within BioPAN (
An all lipid displayed view can be filtered to display only a user-selected subset of lipids. These can be chosen either from a text box, or using a tree of checkboxes for each subclass, allowing users to focus on particular lipids of interest (see Extended Data, section S-2, Figure S-1). The search box allows the user to search for one or two queries using the logical operators AND / OR. Selecting the AND operator, the user can view lipid subclasses or molecular species for which the name implies both queries. For example, searching for “PC” and “34:0” on the lipid molecular species graph displays lipids “PC(34:0)” and “O-PC(34:0)”. Choosing the OR operator allow the user to visualise lipids whose names includes one of the two searches. For instance, searching “LPC” or “LPA” on the subclasses graph displays lipids “LPC”, “O-LPC” and “LPA”.
Example use case
Lipidomics data from cerebral cortex and liver of young and aged mice
17
were used here as an example to illustrate how to use BioPAN and interpret its results. Ando
The datasets from the original paper, were downloaded as TXT files from Metabolomics Workbench (Project ID PR000713, Studies ID ST001065 and ST001066) and are available as underlying data (see
The lipid network at the lipid subclass level is shown in
Figure 2A for the liver and
Figure 2B for the cerebral cortex. The lipid pathway graph allows the user to quickly spot trends and differences between the two sets of samples.
Figure 2 shows that reactions using ether-linked diacylglycerol (O-DG) as a substrate are suppressed in both liver and cerebral cortex in old versus young, which agrees with the reported accumulation of this particular lipid subclass by Ando
Figure 2.
BioPAN lipid networks.
Lipid network graphs exported from BioPAN for the liver (
A) and the cerebral cortex (
B) of aged mice compared to young mice
17
. Green nodes correspond to active lipids and green shaded arrows to active pathways. Reactions with a positive Z score have green arrows while negative Z scores are coloured purple. Pathways options: aged condition of interest, young control condition, lipid type, active status, subclass level, reaction subset of lipid data,
BioPAN also shows more differences in the metabolism of sphingolipids between the tissues. Specifically, in the liver the pathway is shifted towards the formation of ceramide-1-phosphate (Cer1P), while in the cerebral cortex sphingolipid metabolism points towards the formation of sphingomyelin (SM). Moreover, in the liver, glycerophospholipids and lysophospholipids synthesis is active with exception of LPA, contrary to what it is observed in the cerebral cortex, where glycerophospholipids and lysophospholipids pathways are suppressed, but production of LPA is favoured.
BioPAN provides tables showing the suggested genes known to be involved in each reaction.
Table 1 shows the lipid subclass active reaction results generated for the liver dataset. For example, the pathway PE→PC→PS→LPS designates the genes
Table 1.
BioPAN predicted genes of the lipid active reactions in liver of aged vs young mice.
Table contains the active reactions chains found by BioPAN according to the Z-score values. Several genes are predicted as being involved in the current status of each reaction based on their function. Pathway options: aged condition of interest, young control condition, lipid type, active status, subclass level, reaction subset of lipid data,
lipid subclass active reaction (Aged vs Young) | ||
---|---|---|
Reaction chains | Z-score | Predicted genes |
PE→PC→PS→LPS | 3.978 |
|
DG→PC→PS→LPS | 3.686 |
|
PE→PC→LPC | 3.519 |
|
LPA→PA→PG→LPG | 3.367 |
|
PE→PC→PA→PG→LPG | 3.179 |
|
DG→PC→LPC | 3.161 |
|
PG→LPG | 3.077 |
|
LPA→PA→PS→LPS | 2.898 |
|
DG→PE→LPE | 2.815 |
|
PE→PS | 2.700 |
|
PC→PS→LPS | 2.657 |
|
DG→MG | 2.507 |
|
DG→TG | 2.301 |
|
DG→PC→PA | 2.280 |
|
dhSM→dhCer→Cer→Cer1P | 2.218 |
|
DG→PA | 2.050 |
|
O-PE→O-LPE | 1.942 |
|
LPA→PA→PI | 1.878 |
|
PC→LPC | 1.843 |
|
PE→PC→PA→PI | 1.837 |
|
PS→LPS | 1.668 |
|
PE→PC→CL | 1.667 |
|
Looking at fatty acids (FA) pathways at the molecular species level with BioPAN (
Figure 3) shows the FA network obtained for liver and cerebral cortex of aged in comparison to young mice. The results show active metabolism towards the production of monounsaturated FA 16:1 and polyunsaturated FA 20:4, FA 20:5, and FA 22:4 in the cerebral cortex compared to the liver. Additionally, BioPAN has highlighted accumulation of the long acyl chain, polyunsaturated FA 24:5 in liver, which is associated with the activation of the
Figure 3.
BioPAN fatty acids networks.
FA graphs exported from BioPAN tool for the liver (
A) and the cerebral cortex (
B) of aged mice compared to young mice
17
. Green nodes correspond to active lipids and green shaded arrows to active pathways. Reactions with a positive Z score have green arrows while negative Z scores are coloured purple. Pathways options: aged condition of interest, young control condition, lipid type, active status, subclass level, reaction subset of lipid data,
Table 2.
BioPAN predicted genes for fatty acid active reactions in liver of aged vs young mice.
Table contains the active reaction chains found by BioPAN according to the Z-score values. Several genes are predicted as being involved in the current status of each reaction based on their function. Pathway options: aged condition of interest, young control condition, fatty acid type, active status,
fa active reaction (Aged vs Young) | ||
---|---|---|
Reaction chains | Z-score | Predicted genes |
FA(22:5)→FA(24:5) | 2.108 |
|
FA(18:3)→FA(20:3) | 1.831 |
|
Conclusion
Modern lipidomics techniques generate a huge amount of data and information on many lipid subclasses, making the interpretation of results challenging. BioPAN provides several advantages over simpler analyses of individual lipids. Mapping of the lipid network graph provides a bigger picture allowing a new perspective on the data where new connections can be made, and a study can be moved forward into particular areas of interest. BioPAN offers a quick visualisation of differences in lipid pathways in mammalian studies and suggests genes and their enzyme products involved in those differences. Further work into the BioPAN database will be carried out to incorporate new biosynthetic pathways, such as: cholesterol, eicosanoids, and oxidised lipid subclasses, among others. Addition of glucosyl and galactosyl ceramides (Glc/Gal-Cer) to complement to the existing sphingolipids pathway in BioPAN will also be considered.
Data availability
Underlying data
The datasets analysed as an example are available at the NIH Common Fund's National Metabolomics Data Repository (NMDR) website, the Metabolomics Workbench, https://www.metabolomicsworkbench.org/ where it has been assigned Project ID PR000713 (studies ST001065 and ST001066).
The data can be accessed directly via its Project DOI: https://dx.doi.org/10.21228/M8HM48, https://www.metabolomicsworkbench.org/data/DRCCMetadata.php?Mode=SetupDownloadResults&StudyID=ST001065 and https://www.metabolomicsworkbench.org/data/DRCCMetadata.php?Mode=SetupDownloadResults&StudyID=ST001066.
Open Science Framework: BioPAN: a web-based tool to explore mammalian lipidome metabolic pathways on LIPID MAPS, https://doi.org/10.17605/OSF.IO/PKD6B 18 .
This project contains the modified dataset from Ando
Cerebral cortex dataset: https://osf.io/reavk/
Liver dataset: https://osf.io/zd4as/
Extended data
Open Science Framework: BioPAN: a web-based tool to explore mammalian lipidome metabolic pathways on LIPID MAPS, https://doi.org/10.17605/OSF.IO/PKD6B 18 .
This project contains the following extended data:
Table S1, reaction table.
Section S2, filtering and tree checkbox discussion.
Figure S1, Filter tool example for acyl chains 38:4 and 38:5.
Software availability
Software, documentation and tutorial video are available from: https://lipidmaps.org/biopan/.
Archived source code at the time of the publication: https://doi.org/10.17605/OSF.IO/PKD6B 18 .
The user can try BioPAN locally by following the readme instruction file located in: https://osf.io/7g2dx/.
License: GNU GPL 3.0
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright: © 2021 Gaud C et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Lipidomics increasingly describes the quantification using mass spectrometry of all lipids present in a biological sample. As the power of lipidomics protocols increase, thousands of lipid molecular species from multiple categories can now be profiled in a single experiment. Observed changes due to biological differences often encompass large numbers of structurally-related lipids, with these being regulated by enzymes from well-known metabolic pathways. As lipidomics datasets increase in complexity, the interpretation of their results becomes more challenging. BioPAN addresses this by enabling the researcher to visualise quantitative lipidomics data in the context of known biosynthetic pathways. BioPAN provides a list of genes, which could be involved in the activation or suppression of enzymes catalysing lipid metabolism in mammalian tissues.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer