INTRODUCTION
Bacteria have evolved to colonize nearly every ecosystem on the planet. A key to their survival is the ability to sense and respond to diverse input (1). In prokaryotes, the sensor histidine kinases (HKs) are the primary signaling transduction system involved in environmental sensing (2, 3). Through their sensing domains, HKs can detect chemical and physical stimuli including pH, osmolarity, photons, toxins, proteins, and small molecules (4), and allow cells to react by deploying diverse cellular programs including cell division, biofilm formation, quorum sensing, antibiotic resistance, and virulence (5, 6). HKs are an extremely diverse class of proteins that, outside of a few well-studied archetypes, are still largely uncharacterized (7, 8). These proteins can be identified using their two conserved domains (Pfam: HATPase/PF02518, HisKA/PF00512) and can have zero, one or more sensory domains, which are the “eyes” used to detect fluctuations in their microenvironment (9). Currently, sensory domains are defined with Pfam domains; however, a single Pfam domain family can contain thousands of unique sensors that each respond to different environmental stimuli. For example, the well-characterized “PhoQ-like” family of HK proteins can sense shifts in pH, osmolarity, small antimicrobial peptides, and membrane proteins, suggesting that within this family there is enough sequence dissimilarity to create a diversity of sensor domain repertoire that detects different signals (8).
HK engineering is an effective target for biotechnology, medicine, and ecosystem monitoring. HK proteins have been engineered to be biosensors that accurately monitor stimuli (10), improve titer in bioindustry (5), and identify and trace environmental contamination (11). HKs in pathogenic bacteria regulate virulent secretion systems and antibiotic resistance and therefore are of interest for targeted pharmaceuticals (12). Targeting HKs is an attractive method to precisely block a microbial response without destroying the functionality of the surrounding healthy consortia, as is the typical result of antibiotic treatment (4, 13). For example, deletion of the HK PhoP in
It has been proposed that there is a direct relationship, up to a limit, between the number of sensory proteins and the complexity of the environment (9). For example, parasitic microbes that live in highly constrained and controlled environments tend to have small genomes with small numbers of sensors (9). Evolutionary events like domain shuffling are common in HK proteins, allowing for the rewiring of signaling networks without necessarily adding new sensing domains (15, 16). Prior research suggests a stable microbial state that is defined best by the community’s cumulative attributes; therefore, measurement of genomic indicators, such as the sensory profile, can more accurately represent a community than the more volatile taxonomic abundance (17, 18). Species phylogenetic information has been shown to be correlated with distinct ecosystems (19), to vary according to temporal shifts in their environment (20–22), and to predict functional microbial traits (23, 24). Functional protein domains can “cluster” microorganisms at the macro- (25) or micro-scale (26) by their environmental niche, suggesting that ecosystem prediction using genetic content is possible. For example, recently, a sensory protein index (SPI) metric was defined that was found to correlate loosely with
Recent efforts have been made to utilize “big data” microbial genomics and sequence features derived from metagenomic sampling over coarser taxonomic abundance to understand strain-level genetic variation in different ecosystems (28). For example, a study used clustering of public metagenomes to describe gene distribution in different biospheres and to elucidate gene families that are rare, abundant, or specific to certain habitats (29). Meanwhile, metagenomic analysis surveys have been used to identify new potential pathogens in certain urban areas from airborne samples (30) and to document the site specificity for microbial genetic signatures on the human body from skin samples (31). Recently, the first systematic prediction of ecosystem niche using prokaryotic genomic content was shown for different physical parameter gradients (ρ = 0.7–0.81) (32). The authors suggest this already strong association could be further improved after refining specific genes or functional families since variation in many protein families in biogeochemically distinct environments is minimal (33). Moreover, the sheer data size of genomic models using full metagenomic profiles limits analytical ability and speed.
In the following research, we aimed to see whether an environmental niche’s distinct HK sensor domain profile can serve as a basis for classification. To this end, we clustered sensory domains from 20,712 metagenomes covering a diverse set of ecosystems and taxa to create a sensor catalog. We then used machine learning modeling and feature importance to explore how the new sensor profile sheds insight into how microbes interact, sense, and respond to their environment.
RESULTS
HK sensor identification and clustering from environmentally diverse metagenomes
HK sensor profiles for individual metagenomes were constructed as a substrate for machine learning (ML) approaches to classify ecosystems and predict environmental parameters, discover new sensors, and facilitate a better understanding of the sensory repertoire in different ecosystems. We first identified HK proteins from 20,712 metagenomes spanning over 75 ecosystems, and extracted the amino-acid sequence for each HK’s sensory domain(s) (Fig. 1). Since HK domains are incredibly diverse even within classified domain families like Pfam, we opted to cluster domains at a higher degree of similarity than these standard methods to more precisely represent domains with common function. We hypothesized the use of HK domain clusters would result in a more precise sensory profile and would enhance the sensitivity of our models.
Fig 1
Overview of the data collection, sensory protein extraction, and curation of the matrix for machine learning. Experimental design. We first surveyed 20,712 metagenomes spanning over 75 ecosystems. We identified each metagenome’s HK proteins using two Pfam conserved domains (“CD”) and then extracted just the sensory domain from each protein using Pfam sensor domain annotations. We clustered 21,984,304 total proteins using MMseqs2 with 17,703,911 unique sensory domain sequences, leading to 113,186 clusters that may each respond to unique stimuli. We focused on clusters that were present in at least 100 metagenomes, dropping any rare features, leading to 14,990 final clusters for analysis for our ML models. These sequence clusters were used for machine learning, hierarchical clustering, and ecosystem sensor taxonomy. In the text, “full-length” proteins contain all sensory domains and conserved domains, while “sensor” references just the sensor domain of the HK protein. Both full-length proteins and isolated sensors used for clustering are protein-coding genes.
We next used MMseqs2 to group proteins by their sequence similarity. When performing MMseqs2, we first benchmarked with
Comparison of sensor clusters to Pfam domains
After clustering, we examined the relationship between the MMseqs2 sensor clusters and Pfam domains to determine whether the increased resolution due to finer groups would lead to higher precision. We found that most Pfam domains are associated with thousands of clusters, suggesting one Pfam contains proteins reacting to a huge variety of environmental stimuli (Fig. 2a). We next explored our HK sensory domains more closely and found the 11.5 × 106 full-length histidine kinase proteins, most (7.6 × 106) contained only one annotated sensory domain, although some contain up to 32 distinct sensory domains (Fig. 2b). Upon further investigation, the latter HKs consist of many small repeating transmembrane proteins. Finally, we used hierarchical clustering to better visualize the distribution of clusters across our ecosystems in a heatmap and tree enrichment diagram (Fig. 2c; Fig. S1). We found predictable groupings of similar ecosystems using these techniques, indicating there is structure to the cluster matrix that is interpretable on the ecosystem level. 98.7% of the sensor domain clusters have unknown functions, and only a handful have been extensively characterized, creating a valuable new catalog for both ecosystem and sensor exploration.
Fig 2
Similar sensory-domain clusters can predictably organize ecosystem types by increasing the complexity over Pfam identifiers. (a) The count of sensory clusters associated with each Pfam sensor domain. Many Pfams have a large number of clusters associated with them, for example, pfam13426 (PAS domain) is present in almost 30,000 clusters. The bottom 20 Pfam domains are summed in the final bar of the figure. (b) Shows the number of HK proteins that have a certain number of sensory domains. 43% of proteins have one sensory domain. The graph is split into two ranges to visualize all bars. (c) Heatmap (log2 scale) built using hierarchical clustering. Similar ecosystems group together in the heatmap, indicated with black bars, confirming that the information in the sensor profile matrix lends a predictable structure to the data set. For example, the
Landscape of ecosystem diversity using MMseqs2 domain clusters
Prior research suggests HK sensor diversity scales with ecosystem complexity and that constant environments require a minimal number of sensing proteins (9). We explored the diversity of sensor clusters across the ecosystems in our data set using two calculations. First, Relative Ecosystem Richness (RER) is the fraction of all unique clusters found in an ecosystem. Second, Ecosystem Typical Sample Richness (ESTR) is the fraction of an ecosystem’s clusters found in the average sample from that ecosystem. Together, these two measurements show the sensor space covered by an ecosystem and how much average samples within an ecosystem cover its sensor space (Fig. 3a). By the nature of the calculations, these two measures are expected to be anticorrelated because a sample is less likely to cover all clusters if it is from an ecosystem with a large number of unique clusters. However, we observe that the ecosystems with high RER and low ETSR are more spatially and environmentally heterogeneous, which may indicate their class labeling is too broad and are labeling physically distinct biome types. The degree of spatiotemporal variation in a single geographic region may also drive defined microbial successions and therefore require distinct functional profiles (34). This can lead to high RER due to higher diversity required and lower ESTR because different samples are caught at different moments in space time. Finally, some of these ecosystems have been measured hundreds of times and others only a few. In Fig. 3, the ecosystems with the highest RER and lowest ESTR have labels most likely to be affected by all three of these effects, namely label specificity, degree of spatiotemporal variation, and ecosystem observation count. For example, the label “Plant:Rhizosphere” might group together microbial communities that are proximal and distal to plant roots from many cropping conditions. Indeed, this class contains 981 metagenomes, from 179 different geographic locations, and some studies only contain a single sample. This reduces the chance of highly conserved features for this type of biome and may make it difficult to unambiguously predict a sample’s class from its profiles because the variance uncertainty is so large.
Fig 3
Relative Ecosystem Richness, Ecosystem Typical Sample Richness, and Sensor Fraction using Mmseqs2 clusters (a) Scatter plot of the ecosystem sensor Relative Ecosystem Richness (the proportion of total gene cluster diversity found within that ecosystem) and Ecosystem Typical Sample Richness (the proportion of the ecosystem’s gene cluster diversity typically found in a single sample). The size of each point is proportional to the number of samples taken from that ecosystem, indicating the extent of sampling. Generally, host-associated ecosystems display lower Relative Ecosystem Richness, with the exception of
We also calculated the sensor percentage, the percentage of all proteins in a metagenome that are sensors (Fig. 3b).
A gradient boosting on decision tree regression model can predict ecosystem physical parameters using sensory profiles
Our central hypothesis is that sensory domains are predictive of ecosystem class. The definition of class, however, is imperfect as real ecosystems are not cleanly separable by discrete ontological labels. For example, where two types of environments “interface” such as water and land at the littoral zone there is a continuum of change from water to dry land. Furthermore, even within a seemingly homogeneous class like “Sediment,” there may be variation over parameters such as pH or temperature among others. Thus, we used ML classifiers or regressors to make broad predictions of class or precise predictions in parameter variation within a class, respectively. In both cases, we used the Shapley Additive exPlanations (SHAP) (35, 36) to rank our features most important predictors, corresponding to the specific sensor domain clusters.
As a first demonstration, we show that domain profiles can predict levels of continuous environmental parameters within an ecosystem class. That is, they can act as biosensors for these physical signals. We selected metagenomes from the
A gradient-boosting decision tree classification model can accurately classify ecosystems using sensory profiles
Although the regression models indicate the sensory profile can predict characteristics with a reasonable R2 score, few ecological subsets of metagenomes in our data set have enough members labeled with a quantitative physical parameter such that regression is possible. Moreover, we noticed that the
The confusion matrix for classification can be used to diagnose which ecosystems are predicted well or poorly by the model and for those predicted poorly which ecosystem they are misassigned to. We found that most mispredicted ecosystems are in
Annotations of feature-important sensor clusters help interpret physical differences among ecosystems
The top two most predictive features in our ecosystem label classifier are
Important domain clusters identify sensors for oxygen and disease status in human tissues
To examine in detail the power and limitations of sensor profiles in interpreting the differences among ecosystems, we chose to more deeply explore the
A t-SNE plot derived from a matrix of these domain clusters from relevant metagenomes shows clear separable groupings of the metagenomes from different tissue classes based on these domain cluster profiles. An initial t-SNE indicated that the sensory matrix can create clear groupings along the human tissues (
Fig 4
Feature importance of sensory clusters explain differences between gut and mouth. (a) t-SNE plot for all metagenomes in
Oxygen sensors can distinguish gut and oral metagenomes
It is logical that oxygen would be a differentiator between the gut, which typically consists of facultative and obligate anaerobes and the aerobic mouth. Indeed, high values of
A biofilm-associated domain is predictive of health-associated bacteria in the mouth, and other human-associated ecosystems
In the
Indicators for disease state and conditions in the
In our analysis, the
Hierarchical clustering revealed the
Fig 5
Disease classes can be predicted from the sensor profile to make meaningful insights. (a) Hierarchical clustering across normal and disease conditions in
Encouraged by these results, we next looked at feature importance and initially noticed multiple high-rank features have similar patterns between classes, indicating those sensors were stable across patients. However, certain clusters diverged from normal. For example, we found the absence of
Meanwhile, we found
The cluster containing the QseC domain also stood out in the metagenome collection, since this HK is a recognized identifier for enterohemorrhagic
DISCUSSION
Extensive research has used metagenomic functional and genomic diversity to explore ecosystem conditions, but few systematic models have been presented for ecosystem classification. Efforts have been made to use functional profiles to predict physical parameter gradients in an ecosystem (44, 45). Tools have been created to predict the flow of organisms or functions from one environment to another, for example, the package SourceTracker is a Bayesian approach that can predict environmental contamination from metagenomic profiles (46). However, SourceTracker is most useful when there is a known physical relationship for dispersal among the systems being studied. In our method, we are not considering such relationships although there is evidence for dispersal in environments which are confused in our predictions. However, to utilize the SourceTracker tool, community sources and sinks must be considered and standardized, and for our multi-environment data set, there is not an obvious or consistent way to define sources and sinks. A second typically disregarded but important limitation for predictions that use the full taxonomic or functional profile is the sheer size of data sets. Our method does not require taxonomic prediction, and the data set used for analysis consists of a greatly reduced matrix with counts of clustered protein families.
In this research, we have found that sensor-abundance profiles can lead to accurate ecosystem classification and prediction of physical parameters. We found domain clusters are a powerful predictor of classes and physical parameters in diverse environments and can be used to identify the critical signals that adapt microbial communities to different niches. Well-annotated sensor clusters can identify possibly critical environmental resources or stressors that most distinguish an ecosystem and its state from others. If a domain cluster is localized phylogenetically, it can indicate critical taxa for the exploitation of these environmental stressors. We also found the sensor profile was precise enough to cluster and identify rare HKs. For example, we found a cluster annotated to be
We found in the human oral and gut ecosystems results of our models showed consistency with other data from the field. For example, it has been well documented that initial infant gut colonizers are aerobic (49), and indeed, we found high feature importance for the VicK (
We find in this research that the
Once a sensor profile is defined in an ecosystem, it can be used to monitor deviations from a “normal” healthy state. Our CatBoost regression model results indicate the sensor profile can be used to predict physical parameters, and there is therefore great potential for HK research in
The selection of HK sensory domains has allowed us to directly interpret environments in the context of what stimuli organisms that inhabit those environments respond to. We propose that our selection was also fortuitous in terms of ecosystem label predictive capability. Gene content in certain taxa can be highly variable, and many genes carried in a genome are not specifically necessary for survival or operation in a given environment. Prior work suggests that most gene families are extremely rare or unique after analyzing the gene distribution and UniGene richness of clustered genomes (29) and that most bacterial species that have inhabited the Earth are extinct (61). Therefore, species composition, taxonomy, and arbitrary gene function may not be exceptionally predictive of the environment in which they are found. Instead, by focusing on gene classes expected to be enriched in functions necessary for survival, such as sensors, we are more likely to obtain predictive features. Although the inclusion of additional features such as taxonomy or sequence domains might lead to improved model performance, the inclusion of additional features would decrease the model interpretability and lead to a competitive selection of sensory domain compared to other features. We demonstrate that feature importance from classification models is a convenient tool to determine the most impactful sensors in ecosystems or disease states, and we believe this methodology can be applied to similar research especially when a model is built using features with biological interpretability. Our results indicate the microbial sensor profile has the potential to be applied to an array of tasks from ecosystem adaptation and management, medicinal diagnostics, and pharmaceutical targets, for selecting and designing new biosensors in the industry. Since 98.7% of the HKs we use in our analysis are uncharacterized, this can cause difficulties in initial interpretability; however, feature importance allows us to prioritize HKs that are most predictive and discriminatory among different environments. These prioritized sensing domains are critical targets for more in-depth characterization to understand the importance of the signals they are sensing. This work provides a microbial biosensor resource to the scientific community and explores practical applications of sensory domains in future research.
MATERIALS AND METHODS
Identification and extraction of HK sensory domains from IMG using Pfam domains
Our focus for this research was on HKs, the most abundant signaling protein in prokaryotes, and their sensory domains. The Pfam database (62), the best-in-class collection of protein families represented using hidden Markov models, was used to identify relevant proteins and their annotations. Specifically, we downloaded all recorded HK proteins from the Interpro website (63, 64) by custom query for the HK IPR conserved domains (IPR003594, IPR003661). The IPR/Pfam annotation names for every sensory domain contained in these HK proteins were extracted, and we added descriptive features for each domain
The Integrated Microbial Genomes & Microecosystems system (IMG/M) (65) contains an immense wealth of metagenomic data, pulled from sources such as the Department of Energy’s Joint Genome Institute, external scientific studies, and The National Center for Biotechnology Information (NCBI) public sequence archives (65). IMG metagenomes are linked to the Genomes Online Database (GOLD) environmental classification framework (66), which provides ecosystem labels for three classes:
MMseqs2 clustering of HK sensory proteins
MMseqs2 (Many-against-Many searching) (68) is a software to cluster huge sequence sets into groups of similar sequences, reaching the same sensitivity as PSI-BLAST (37) but magnitudes faster. It is used to maintain key databases like UniProtKB (69). We compared the clusters of full-length HK proteins to individual (extracted) sensory domains but found clustering of full-length proteins led to a dramatic increase in the number of final protein clusters, impacting computational analysis feasibility (Table S2). For this and biologically motivated reasons, we clustered extracted sensory domains only. We also varied MMseqs2 parameters for coverage mode and fraction to obtain the best clusters for analysis (Table S2), along with MMseqs2 methods for taxonomic profiling and protein annotation outside of PSI-BLAST (Fig. S5). In the end, we clustered 21,984,304 proteins with 17,703,911 unique sensory domain sequences, leading to 113,186 clusters. From these, we obtained ~226,000 annotations with 1842 accessions, which after manual data cleaning led to 310 well-characterized HK proteins.
Histidine kinase annotation with BLASTp and MMseqs2
Annotation of the histidine kinase sensors was desired to assist analysis of the data and focus our attention on specific clusters. To annotate histidine kinase present in metagenomes, we used the MMseqs2 search function to map sensory sequences to Uniprot IDs, using the Uniref50 database (69) and the full-length HK protein sequence. We created a custom script to query Uniprot for name, description, and other identifiers from the MMseqs2 provided Uniprot IDs. From an initial list of proteins, we obtained ~226,000 annotations with 1842 accessions, which after manual data cleaning led to 310 well-characterized HK proteins. Finally, BLASTp was used to verify search results from MMseqs2 for sequences studied in the final results. Where referenced, BLASTp was used to annotate specific sensory domain amino acid sequences, using the default “nr” parameters (results obtained 06/2022-09/2022).
Ecosystem and sample richness
To provide a broad view of HK domain cluster diversity across and within ecosystems, we calculate two measures. First, Relative Ecosystem Richness is the total of all unique clusters in a given ecosystem divided by the total unique clusters across all ecosystems. This metric shows how much each ecosystem contributes to the global cluster diversity. Ecosystems with high values contribute more to the observed global diversity. Second, Ecosystem Typical Sample Richness is the average number of unique gene clusters in the samples from a given ecosystem divided by the total unique clusters of that ecosystem. This metric is a rough measure of how representative a typical sample is of the cluster diversity within the ecosystem. Ecosystems with high values capture a larger proportion of the cluster diversity in that ecosystem.
CatBoost input matrix preparation
Prominent machine learning algorithms include LightGBM (Light Gradient Boosted Machine) (70), XGBoost (eXtreme Gradient Boosting) (71), CatBoost (Category Boosting) (72), TabNet (73), and various packages in the scikit-learn package such as RandomForest (74). Initially, we tested our data set using TabNet and CatBoost. CatBoost uses gradient boosting to build decision trees, and it is considered a faster and slightly more accurate method than similar packages LightGBM and XGBoost (75–77). We found CatBoost improved greatly upon TabNet in initial testing (data not shown) and we proceeded to use this package exclusively since a comparison between different machine learning tools has been performed elsewhere and was not the focus of this work. In this work, CatBoost was used for three separate analyses in both regression and classification tasks. First, we built a regression model with the
To construct the model training matrix, we selected sensor clusters present in at least 100 metagenomes to limit input data sparsity for CatBoost (14,990 final clusters). For the ecosystem-classifier model, to have enough training samples we selected ecosystems with >30 metagenomes (71 GOLD ecosystems). We also removed “Unclassified” ecosystems from training as these were deemed too ambiguous for comprehensive analysis (66 ecosystems). Our final matrix has 14,990 columns for sensor clusters, and 20,712 rows for metagenomes. The matrix values correspond to the metagenomic sensor abundance, calculated by summing the total cluster’s gene copy number, and dividing by a total protein gene copy number (Fig. 1). The analyses underlying Fig. 1 to 3 in this work use the full 113,876 clusters for an accurate picture of the sensor landscape, but we used the filtered 14,990 with CatBoost to limit matrix sparsity and improve model accuracy. For the disease state classifier, we used conditions with >14 metagenomes and all
CatBoost learning methods for ecosystem and disease-state classification tasks
CatBoost version 1.0.6 was used for all models, and scikit-learn (78) was used for data set splitting and to generate the confusion matrix and F1 score. Shapley is a popular feature importance package, using a polynomial time algorithm to compute optimal explanations based on game theory. SHAP v0.41.0 with TreeExplainer was used (35). Models were trained on the National Energy Research Scientific Computing Center (NERSC) Perlmutter GPU using the “MultiClass” loss function and “Accuracy” custom metric. We used grid search, a standard optimization technique, to tune hyperparameters such as learning rate, tree-depth, and L2 leaf regularization. We found feature selection to have a little or negative impact on model accuracy. Additional details about grid search parameters and feature selection methods are provided in Fig. S4/github.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2023 Park et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
ABSTRACT
Microbial communities have evolved to colonize all ecosystems of the planet, from the deep sea to the human gut. Microbes survive by sensing, responding, and adapting to immediate environmental cues. This process is driven by signal transduction proteins such as histidine kinases, which use their sensing domains to bind or otherwise detect environmental cues and “transduce” signals to adjust internal processes. We hypothesized that an ecosystem’s unique stimuli leave a sensor “fingerprint,” able to identify and shed insight on ecosystem conditions. To test this, we collected 20,712 publicly available metagenomes from
IMPORTANCE
Microbes infect, colonize, and proliferate due to their ability to sense and respond quickly to their surroundings. In this research, we extract the sensory proteins from a diverse range of environmental, engineered, and host-associated metagenomes. We trained machine learning classifiers using sensors as features such that it is possible to predict the ecosystem for a metagenome from its sensor profile. We use the optimized model’s feature importance to identify the most impactful and predictive sensors in different environments. We next use the sensor profile from human gut metagenomes to classify their disease states and explore which sensors can explain differences between diseases. The sensors most predictive of environmental labels here, most of which correspond to uncharacterized proteins, are a useful starting point for the discovery of important environment signals and the development of possible diagnostic interventions.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer