INTRODUCTION
Shipworms (family Teredinidae) are bivalve mollusks found throughout the world’s oceans (1, 2). Many shipworms eat wood, assisted by cellulases from the intracellular symbiotic gammaproteobacteria that inhabit their gills (Fig. 1) (3–6). Other shipworms use sulfide metabolism, also relying on gill-dwelling gammaproteobacteria for sulfur oxidation (7). Shipworm gill symbionts of several different species are thus essential to shipworm nutrition and survival. One of the most remarkable features of the shipworm system is that wood digestion does not take place where the bacteria are located, such that the bacterial cellulase products are transferred from the gill to a nearly sterile cecum (8), where wood digestion occurs (Fig. 1) (9). This enables the host shipworms to directly consume glucose and other sugars derived from wood lignocellulose rather than from the less-energetic-fermentation by-products of cellulolytic gut microbes as found in other symbioses. Shipworm symbionts are also essential for the nitrogen fixation that helps to offset the low nitrogen content of wood (10, 11). Thus, shipworms have evolved structures and mechanisms enabling bacterial metabolism that supports animal host nutrition.
FIG 1
(Top) Diagram of generic shipworm anatomy. Insets are from Fig. 2, panels B and D, in Betcher et al. (8). Bars, 20 μm. Red, signal from a fluorescent universal bacterial probe, indicating large numbers of bacterial symbionts in the bacteriocytes of the gill and paucity of bacteria in the cecum; green, background fluorescence. (Bottom) Collection locations of specimens included in this study. See Table S1 for details.
While the bacteria in many nutritional symbioses are difficult to cultivate, shipworm gill symbiotic gammaproteobacteria have been brought into stable culture (5, 12, 13). This led to the discovery that these bacteria are exceptional sources of secondary metabolites (14). Of bacteria with sequenced genomes, the gill symbiont
An early analysis of the
While the potential of
To address those issues, we used comparative metagenomics, selecting six species of wood-eating shipworms (
RESULTS AND DISCUSSION
Sequencing data.
Most of the genomes and metagenomes are described here for the first time, or, in a few cases, previously reported genomes/metagenomes were resequenced/reassembled/reanalyzed (see Materials and Methods). Two bacterial genomes of
Mapping gill metagenomes to cultivated bacteria.
A phylogenetic tree created from the 16S rRNA genes of the cultivated bacteria (Fig. 2A; see also Fig. S1 in the supplemental material) revealed that the strains were all gammaproteobacteria. Of these, the two sulfur oxidizers were members of the order Chromatiales (Thiosocius and allies) whereas the remaining 21 were strains of cellulolytic bacteria that were members of the order Cellvibrionales, including 11 strains of
FIG 2
Cultivated bacterial isolates represent the major shipworm gill symbionts. (A) Isolated bacteria analyzed in this study are shown in abstracted schematic of a 16S rRNA phylogenetic tree. The complete tree with accurate branch lengths and bootstrap numbers is shown in Fig. S1.
Further, whole-genome-based average nucleotide identity (gANI) measurements reinforce the 16S rRNA-based phylogenetic tree of sequenced strains (Fig. 2 and 3; see also Fig. S1 and Table S2 in the supplemental material). Previously proposed cutoffs for bacterial species differentiation suggest that bacterial strains with gANI values of ≥0.95 are conspecific, although several well-known species have lower gANI values (29). The concatenated
FIG 3
Heat map of relationships between symbiont isolate genomes and gill metagenome bins. The scale bar is shaded according to identity on the basis of calculated values (AF × gANI). Color bars in the phylogenetic tree indicate bacterial species identity, either in the metagenomes or in the genome, and they are identical to the codes shown in Fig. 2. This figure indicates the high degree of certainty that the cultivated isolates are the same species as the major bacteria present in the gill.
The bacteria living in gills were grouped into bins that represent individual species of bacteria (Fig. 2B). For example, in Kuphus spp., >95% of bacterial reads could be mapped to cultivated isolate strain
TABLE 1
Example gANI values for shipworm gills in comparison to sequenced isolates, extracted from Table S2 data
Comparison | Total metagenomic bin size (bp) | gANI |
---|---|---|
Kuphus spp. to T. teredicincola 2141T | 23,758,169 | 0.963839 |
D. mannii and B. thoracites to 2753L | 26,758,239 | 0.990273 |
D. mannii to 2719K | 14,895,102 | 0.992012 |
B. setacea to BSC2 | 19,687,153 | 0.974108 |
Teredo sp. to 1162T | 1,489,154 | 0.984036 |
B. setacea to BS08 | 3,513,639 | 0.995137 |
Similarly, Cellvibrionaceae strain 2753L was mapped to 20 bins in
In other cases, either because we had multiple strains representing a species (as in the case of
In sum, these data demonstrate conclusively that the cultivated isolates obtained from shipworm gills accurately represent the strains found within shipworms. The data suggest that the isolates are the same species as the naturally occurring symbionts in the animals and that in many cases their DNA sequences are >99% identical at the whole-genome level. More than 85% of the DNA in each specimen’s gill metagenome is represented by a cultivated isolate in our collection (with the exception of one specimen), and the remaining <15% of the DNA belongs to multiple, low-abundance species, most of which are not reproducibly found among isolates from different shipworm specimens. Further, the shipworm literature focuses on the readily cultivable species
Strain variation increases genetic diversity of shipworm microbiota.
Metagenome binning showed that each gill contained 1 to 3 major bacterial species. Since we had data representing deep sequencing of the major metagenomic bacterial species, we expected to provide complete assemblies. Previously, using similarly deep data, we had obtained relatively complete assemblies or even assembled whole bacterial genomes from metagenomes (30). Here, however, our metagenome bin N50 values were only in the very low thousands.
Investigating why the assembly was difficult, we noted that we often obtained very similar contigs with different copy numbers. For example, a single metagenome bin containing Bsc2-like contigs is shown in Table S3. The pairwise identities between contigs were between 93% and 98% in DNA sequence, indicating that these bins were comprised of mixtures of very closely related bacteria. We saw a very similar phenomenon in a recent investigation of
In the Kuphus study, we cut the DNA gyrase B gene into 50-bp segments and aligned single reads to each 50-bp segment (7). By quantifying the reads for each observed single nucleotide polymorphism (SNP), we confirmed that the gill symbiont species consisted of several strains, and we quantified their relative abundances. Here, we investigated the major strains found in the remaining shipworm species using the same method and show four representative examples in Fig. S3. This analysis showed that similar levels of strain variation represent a widespread phenomenon in shipworm gills and that such variation is not just restricted to
Discovery and analysis of BGCs.
We compared secondary metabolic pathways in the isolates and in the animal specimens. An inventory of the BGC content performed using antiSMASH (31) revealed a large number of BGCs: 431 BGCs were identified in the 23 cultivated isolates alone. Because raw antiSMASH output includes many hypothetical or poorly characterized BGCs, we focused on well-characterized classes of secondary metabolic proteins and pathways: polyketide synthases (PKSs), nonribosomal peptide synthetases (NRPSs), siderophores, terpenes, homoserine lactones, and thiopeptides. Using these criteria, we identified 168 BGCs from 23 cultivated isolates and 401 BGCs from 22 shipworm gill metagenomes (Fig. 4). Because the genomes of cultivated isolates were well assembled, we could discern and analyze entire BGCs. In contrast, gill metagenomes had smaller contigs such that the BGCs were fragmented.
FIG 4
Most BGCs found in the metagenomes and in the bacterial isolate genomes are shared. A total of 401 BGCs from metagenome sequences were compared to the bacterial isolate genomes, 305 of which were found in isolates. Conversely, 148 of 168 BGCs from sequenced bacterial isolates were found in the metagenomes. The shared numbers likely differ because the contigs assembled from the metagenome sequences were shorter on average such that several metagenome fragments can map to a single BGC in an isolate.
The BGCs identified in this study originated nearly universally from the cellulolytic Cellvibrionales strains, with very few BGCs found in the sulfide-oxidizing strains of Chromatiales. We found only five BGCs that were similar to previously identified clusters outside shipworms, based upon >70% of genes conserved in antiSMASH. The remainder appeared to be unknown or uncharacterized BGCs. In turn, the new BGCs are likely to represent new compounds, while the characterized BGCs represent those corresponding to previously identified compounds. In addition, it is possible that some of the new BGCs may represent known compounds for which biosynthetic pathways have not yet been discovered.
To facilitate comparison between metagenomes, we grouped all 569 BGCs into 122 gene cluster families (GCFs), where each GCF was comprised of closely related BGCs (32, 33) (Fig. 5; see also Table S4). BGCs grouped into a single GCF are highly likely to encode the production of identical or closely related secondary metabolites.
FIG 5
GCFs found in (A) bacterial genomes and (B) gill metagenomes. (A) A list of strains of cultivated bacterial genomes is provided in the x axis, while the number of total GCFs in different sequenced strains is shown in the y axis. Colors indicate bacteria as described for Fig. 2A. Because there were 11 isolates of
Some important BGCs were excluded using our method. One of these is worth describing briefly, since it illustrates a major limitation of the methods that we describe and is potentially important for symbiosis. In the genome of Chromatiales strain 2719K, we discovered a gene cluster for tabtoxin (34, 35) or a related compound (Fig. 6). This cluster does not contain common PKS/NRPS elements and thus is not one of the GCFs shown in Fig. 5, 7, or 8. A key biosynthetic gene in the tabtoxin-like cluster was pseudogenous in strain 2719K, but the
FIG 6
A possible tabtoxin pathway is found in the
FIG 7
GCF distribution across shipworm species. Shown is a similarity network diagram, in which circles indicate individual BGCs from sequenced isolates (gray) and gill metagenomes (colors indicate species of origin; see legend). Lines indicate the MultiGeneBlast scores from comparisons between identified BGCs, with thinner lines indicating a lower degree of similarity. For example, the cluster labeled “GCF_8” encodes the pathway for the siderophore turnerbactin, the structure of which is shown at the right. The main cluster, circled by a light blue box, includes BGCs that are very similar to the originally described turnerbactin gene cluster. More distantly related BGCs, with fewer lines connecting them to the majority nodes in GCF_8, might represent other siderophores. The GCF_11 data likely all represent tartrolon D/E, a boronated polyketide shown at the right. For detailed alignments of BGCs, see Fig. S4.
FIG 8
Integration of tBLASTn and networking analyses reveals the pattern of occurrence of GCFs in isolates and metagenomes. Here, we show only the most commonly occurring GCFs. The values in each box indicate the number of BGC occurrences per specimen for each GCF (see Fig. S5 for details). When the number equals 1, then the BGC is found in all specimens of that species. When the number is less than 1, it then indicates the fraction of specimens in which the pathway is found. A number greater than 1 is specific to GCF_3, for which two different types are possible (see Fig. 8). In that case, there were two different classes of GCF_3 in two
Comparison of isolate and gill BGCs.
Of 401 BGCs identified in the metagenomes, 305 had close relatives in cultivated isolates, indicating that ∼75% of BGCs in the metagenomes are covered in our sequenced culture collection (Fig. 4). Conversely, among 168 isolate BGCs, 148 (90%) were found in the metagenomes. Thus, sequencing of additional cultivated isolates in our strain collections would likely yield additional novel BGCs. Since the 11
It is difficult to quantify BGCs in metagenomes, which usually contain relatively small contigs. Since the BGC classes analyzed were >10 kbp in length, each BGC was usually represented by multiple, short contigs, which are not easily linked. Here, we had an advantage in that the cultivated isolates accurately represented the gill metagenomes; thus, we were able to map the identified metagenomic contigs to the assembled BGCs found in cultivated isolates.
Using this mapping, we accurately estimated the number of unique BGCs in the gill symbiont community. For example, 305 metagenome BGCs were found to be synonymous with 148 isolate BGCs, indicating that the metagenome BGC count can be estimated to be approximately double the actual number of BGCs. To verify this estimate, we selected GCFs 2, 3, 5, and 8, aligning the metagenomic contigs against the BGCs from cultivated isolates (Fig. S4). In the metagenomes, of the total of 401 BGCs identified, 100 were members of these four GCFs, but some of them were just fragments of the full-length BGCs found in cultivated isolates. When the 100 metagenomic BGCs were aligned to their congeners in cultivated isolates, they could be collapsed into 46 unique BGCs. Thus, using two different approaches, we were able to estimate that the 401 metagenomic BGCs of all GCFs represented ∼200 actual BGCs in the shipworm gills. To the best of our knowledge, this has not been possible for other metagenomes/symbioses and represents a powerful aspect of this system.
Most GCFs detected were unique or nearly so, occurring in only one or two of the examined strains (Fig. 7 and 8). Only 8 GCFs were found to be distributed in 10 or more isolates, and these mostly represented pathways that are universal or nearly universal in
We used MultiGeneBlast (32) output to construct a GCF similarity network (Fig. 7), which provides an easily interpretable diagram of how GCFs are distributed among bacteria. However, a notable shortcoming was observed. In a long-term drug discovery campaign, we have found the tartrolon BGC in nearly all
To remedy this problem, we obtained GCFs from cultivated isolates and searched them against metagenome contigs using tBLASTn (Fig. 8). This provided an orthogonal view of secondary metabolism in shipworms, revealing the presence of the tartrolon pathway, as well as of other pathways that do not assemble well in metagenomes because of characteristics such as repetitive DNA sequences. A weakness of this second method is that it does not tell us whether two pathways are related closely enough to encode the production of similar compounds. Thus, these two methods provide different insights into BGCs in shipworm gills.
The high level of similarity of BGCs between cultivated isolates and metagenomes further reinforced the species identities determined by gANI (Table 1). Since secondary metabolism is often one of the most variable genomic features in bacteria, the sharing of multiple pathways between gills and isolates provides further evidence that the isolates are representative of the true symbionts found in gills.
We identified three categories of GCFs: (i) GCFs that are widely shared among shipworm species, (ii) GCFs that were specific to select shipworm and symbiont species, and (iii) GCFs that were distributed among specimens without an obvious relationship to host or symbiont species identity. These pathways are described in the following sections.
(i) Widely shared GCFs. Four pathways (GCF_2, GCF_3, GFC_5, and GCF_8) were prevalent in all wood-eating shipworms, regardless of sample location (Fig. 7 and 8). These GCFs were harbored in the genomes of
The most widely occurring pathway in shipworm gill metagenomes is GCF_3. It was identified in all gill metagenomes with cellulolytic symbionts, including the metagenome of specimen
FIG 9
Three types of GCF_3 gene clusters were found to be distributed in all cellulolytic shipworms in this study. tBLASTx was used to compare the clusters, demonstrating the presence of three closely related GCF_3 gene families in all cellulolytic shipworm gills.
GCF_2 encodes a NRPS/trans-AT PKS pathway, the chemical products of which are unknown. It is found in all shipworm specimens in this study and in all
GCF_5 harbors a combination of terpene cyclase and predicted arylpolyene biosynthetic genes (39). Although the cyclase and surrounding regions have all of the genes necessary to make and export hopanoids, the GCF_5 biosynthetic product is unknown. In addition to occurring in all
GCF_8 is exemplified by the previously described turnerbactin BGC, from
(ii) Bacterial species-specific GCFs. CFs 1, 4, and 11 were found in all
The gill metagenomes of
Brazilian shipworms Bankia sp. and Teredo sp. contain
The
The
(iii) GCFs for which patterns of occurrence are not obviously related to host species identity. Overall, the most abundant pathways in shipworms were identical to those seen in the corresponding cultivated bacterial symbionts (Fig. 7 and 8). Since the pattern of bacterial distribution in shipworm hosts follows host species identity and life habits, the presence of abundant GCFs also follows similar patterns. However, as described above, many pathways were found only once or occurred relatively rarely among symbiont genomes and gill metagenomes. In these cases, trends of host symbiont co-occurrence could not be discerned. This observation is reinforced in Fig. 7, where most GCFs in the diagram occur only once (represented by single, unlinked spots). Thus, while the occurrence of common biosynthetic pathways is evolutionarily conserved among host species and thus likely has a uniquely critical role in the symbiosis, most are not conserved. These observations suggest that a more comprehensive sampling of shipworm specimens, species, and cultivated isolates would yield many additional, unanticipated BGCs.
Variability in conserved shipworm GCFs increases potential compound diversity.
Even among conserved GCFs, variability was observed, as revealed by bulges in the network diagram (Fig. 7). For example, in ubiquitous GCF_3, three different pathway variants are visible. Siderophore pathway GCF_8 contains one central cluster, encoding turnerbactin pathways, and an extended arm that appears to encode compounds related to, but not identical to, turnerbactin.
Conclusions.
In shipworms, cellulolytic bacteria have long been known to specifically inhabit gills and have been hypothesized to cause an evolutionary path that leads to wood specialization in most of the members of the family, along with drastic morphophysiological modifications (1, 5, 44). These symbionts could be cultivated, although we have only recently been able to sample the full spectrum of major symbionts present in gills. The unexpected finding that
Here, we show that cultivated isolates obtained from shipworm gills accurately represent the bacteria living within the gills. They represent the same species and often are nearly identical at the strain level. They contain many of the same BGCs. The gills of shipworms contain about 1 to 3 major species of symbiotic bacteria, along with a small percentage of other, less consistently occurring bacteria. Complicating this relatively simple picture, there is significant strain variation within shipworms. The observed symbiont species mixtures are representative of the animal lifestyles. For example,
The key finding is that the BGCs in the metagenomes are represented in the strains in our culture collection. This is a rare event in the biosynthetic literature. In most other marine systems, it has been very challenging to cultivate the symbiotic bacteria responsible for secondary metabolite production (45). In some organisms, such as humans, there are many representative cultivated isolates that produce secondary metabolites, but connecting those metabolites to human biology, or even to their existence in humans, is quite challenging (16, 46). Here, we have defined an experimentally tractable system to investigate chemical ecology that circumvents these limitations. Our results reveal potentially important chemical interactions that would affect a variety of marine ecosystems and a novel and underexplored source of bioactive metabolites for drug discovery.
It has not escaped our notice that this work provides the foundation for understanding the connections between symbiont community composition, secondary metabolite complement, and host lifestyle and ecology. It has proven difficult to link these factors together in relevant models. The existence of methods for aquaculture and transformation for shipworms and their symbiotic bacteria will enable a rigorous, hypothesis-driven understanding of the role of complex metabolism in symbiosis.
MATERIALS AND METHODS
Collection and processing of biological material.
Shipworm samples (see Table S1 in the supplemental material) were collected from found wood. Briefly, infested wood was collected and transported immediately to the laboratory or stored in the shade until extraction (<1 day). Specimens were carefully extracted using woodworking tools to avoid damage. Extracted specimens were processed immediately or stored in individual containers of filtered seawater at 4°C until processing. Specimens were checked for viability by siphon retraction in response to stimulation and observation of heartbeat and live specimens selected. Specimens were assigned a unique code, photographed, and identified. Specimens were dissected using a dissecting stereoscope. Taxonomic vouchers (valves, pallets, and siphonal tissue for sequencing host phylogenetic markers) were retained and stored in 70% ethanol. The gill was dissected, rinsed with sterile seawater, and divided for bacterial isolation and metagenomic sequencing. Once the gill was dissected, it was processed immediately or flash-frozen in liquid nitrogen.
All collections followed Nagoya Protocol requirements. Brazilian sampling were performed under SISBIO license number 48388, and genetic resources were accessed under the authorization of the Brazilian National System for the Management of Genetic Heritage and Associated Traditional Knowledge (SisGen permit number A2F0DA0).
Among the animals that we obtained in field collections, we analyzed three specimens each of
Bacterial isolation, DNA extraction, and analysis.
Genomic DNA used for whole-genome sequencing of novel isolates and select
Metagenomic DNA extraction.
Gill tissue samples from Philippine shipworm specimens (Table S1B) were flash-frozen in liquid nitrogen and stored at –80°C prior to processing. Bulk gill genomic DNA was purified by the use of a Qiagen blood and tissue genomic DNA kit using the manufacturer’s suggested protocol.
Gill tissue samples from Brazil shipworm specimens were pulverized by flash-freezing in liquid nitrogen and submitted to metagenomic DNA purification by adapting a protocol previously optimized for total DNA extraction from cnidaria tissues (47, 48). Briefly, shipworms gills were carefully dissected (with care taken not to include contamination from other organs), submitted to a series of five washes with 3:1 sterile seawater/distilled water for removal of external contaminants, and macerated until they were powdered in liquid nitrogen. Powdered tissues (∼150 mg) were then transferred to 2-ml microtubes containing 1 ml of lysis buffer (2% [w/v] CTAB [Sigma-Aldrich], 1.4 M NaCl, 20 mM EDTA, 100 mM Tris-HCl [pH 8.0], with freshly added 5 μg proteinase K [vol/vol] [Invitrogen] and 1% 2-mercaptoethanol [Sigma-Aldrich]) and submitted to five freeze-thawing cycles (–80°C to 65°C). Proteins were extracted by washing twice with phenol-chloroform-isoamyl alcohol (25:24:1) and once with chloroform. Metagenomic DNA was precipitated with isopropanol and 5 M ammonium acetate, washed with 70% ethanol, and eluted in TE buffer (10 mM Tris-HCl, 1 mM EDTA). Metagenomic libraries were prepared using a Nextera XT DNA sample preparation kit (Illumina) and sequenced with 600-cycle MiSeq reagent kit chemistry (v3; Illumina) (300-bp paired-end runs) using a MiSeq desktop sequencer.
Metagenome sequencing and assembly.
Five
Identification of bacterial sequences in metagenomic data.
Assembly-assisted binning was used to sort and analyze trimmed reads and the contigs were assembled into clusters putatively representing single genomes using MetaAnnotator beta version (55). Each binned genome was retrieved using SAMtools (version: 1.10) (56, 57). To identify bacterial genomes, genes for each bin were identified with Prodigal (58). Protein sequences for bins with coding density levels of >50% were searched against the NCBI nr database with DIAMOND (v0.9.32) (59). Bins with 60% of the genes hitting the bacterial subject in the nr database were considered to have originated from bacteria.
For the
gANI comparisons and calculation of read counts.
Each bacterial bin was compared to the 23 shipworm isolate genomes using gANI and AF values (63). With a cutoff AF value of >0.5 and gANI value of >0.9, the bacterial bins from each metagenome were mapped to cultivated bacterial genomes and the cultivated bacterial genomes mapped against each other (Table S2). The major but not mapped bins in each genome were classified using gtdb-tk (version 1.1.1) (64). The read counts for each mapped bin were either retrieved from the output of MetaAnnotator (beta version) or calculated using bbwrap.sh (https://sourceforge.net/projects/bbmap/) with the following parameters: kfilter = 22 subfilter = 15 maxindel = 80.
Building BGC similarity networks.
BGCs were predicted from the bacterial contigs of each metagenome and from cultivated bacterial genomes using antiSMASH 4.0 (31) (see Fig. S6 in the supplemental material). From the predictions, only the BGCs for PKSs, NRPSs, siderophores, terpenes, homoserine lactones, and thiopeptides (as well as combinations of these biosynthetic enzyme families) were included in the succeeding analyses. An all-versus-all comparison of these BGCs was performed using MultiGeneBlast (v1.1.14) (32) and a previously reported protocol (65). The bidirectional MultiGeneBlast BGC-to-BGC hits were considered to be reliable. In the metagenome data, some truncated BGCs showed only single-directional correlation to a full-length BGC. Those single-directional hits were refined as follows: protein translations of all coding sequences from the BGCs were compared in an all-versus-all fashion using blastp search. Only those protein hits that had at least 60% identity to and at least 80% coverage of both query and subject were considered to represent valid hits. Single-directional MultiGeneBlast BGC-to-BGC hits were retained if the number of proteins represented at least n − 2 (n is the number of proteins in the truncated BGC) passing the blastp refining. The remaining MultiGeneBlast hits were used to construct a network in Cytoscape (v3.7.0) (66). Finally, each BGC cluster (GCF) that had a relatively low number of bidirectional correlations was manually checked by examining the MultiGeneBlast alignment.
Occurrence of GCFs in metagenomes.
On the basis of the GCFs identified in previous step, the core biosynthetic proteins from each GCF were extracted and queried (NCBI tblastn) against each metagenome assembly. Thresholds of query coverage of >50% and identity of >90% were applied to remove the nonspecific hits, and the remaining hits, in combination with the MultiGeneBlast hits, were used to make the matrix of occurrences of GCFs in metagenomes.
Data availability.
The raw sequencing data are available in GenBank under accession numbers SRX7665675, SRX7665685, SRX7665686, SRX7665676, SRX7665684, SRX7665687, SRX7665688, SRX7665689, SRX7665690, SRX7665691, SRX7665677, SRX7665678, SRX7665679, SRX7665680, SRX7665681, SRX7665682, and SRX7665683 or in JGI (https://img.jgi.doe.gov/cgi-bin/m/main.cgi?) under IMG Genome identifiers 3300000111, 3300000024, 3300000110, 3300000107, 2070309010, 2541046951, 2510917000, 2513237135, 2513237099, 2519899652, 2519899664, 2519899663, 2524614873, 2523533596, 2540341229, 2571042908, 2579779156, 2558309032, 2541046951, 2545555829, 2767802764, 2531839719, 2528768159, 2503982003, 2524614822, 2574179784, 2751185674, 2574179721, and 2751185671. For details, please see Table S1.
b The Marine Science Institute, University of the Philippines Diliman, Quezon City, Philippines
c Department of Medicinal Chemistry, University of Utah, Salt Lake City, Utah, USA
d Bioinformatic and Microbial Ecology Laboratory—BIOME, Federal University of Bahia, Salvador, Bahia, Brazil
e Drug Research and Development Center, Department of Physiology and Pharmacology, Federal University of Ceará, Ceará, Brazil
f Philippine Genome Center, University of the Philippines Diliman, Quezon City, Philippines
g Institute of Marine Science, School of Biological Sciences, University of Portsmouth, Portsmouth, United Kingdom
University of Vienna
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2020 Altamia et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
ABSTRACT
Shipworms play critical roles in recycling wood in the sea. Symbiotic bacteria supply enzymes that the organisms need for nutrition and wood degradation. Some of these bacteria have been grown in pure culture and have the capacity to make many secondary metabolites. However, little is known about whether such secondary metabolite pathways are represented in the symbiont communities within their hosts. In addition, little has been reported about the patterns of host-symbiont co-occurrence. Here, we collected shipworms from the United States, the Philippines, and Brazil and cultivated symbiotic bacteria from their gills. We analyzed sequences from 22 shipworm gill metagenomes from seven shipworm species and from 23 cultivated symbiont isolates. Using (meta)genome sequencing, we demonstrate that the cultivated isolates represent all the major bacterial symbiont species and strains in shipworm gills. We show that the bacterial symbionts are distributed among shipworm hosts in consistent, predictable patterns. The symbiotic bacteria harbor many gene cluster families (GCFs) for biosynthesis of bioactive secondary metabolites, only <5% of which match previously described biosynthetic pathways. Because we were able to cultivate the symbionts and to sequence their genomes, we can definitively enumerate the biosynthetic pathways in these symbiont communities, showing that ∼150 of ∼200 total biosynthetic gene clusters (BGCs) present in the animal gill metagenomes are represented in our culture collection. Shipworm symbionts occur in suites that differ predictably across a wide taxonomic and geographic range of host species and collectively constitute an immense resource for the discovery of new biosynthetic pathways corresponding to bioactive secondary metabolites.
IMPORTANCE We define a system in which the major symbionts that are important to host biology and to the production of secondary metabolites can be cultivated. We show that symbiotic bacteria that are critical to host nutrition and lifestyle also have an immense capacity to produce a multitude of diverse and likely novel bioactive secondary metabolites that could lead to the discovery of drugs and that these pathways are found within shipworm gills. We propose that, by shaping associated microbial communities within the host, the compounds support the ability of shipworms to degrade wood in marine environments. Because these symbionts can be cultivated and genetically manipulated, they provide a powerful model for understanding how secondary metabolism impacts microbial symbiosis.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer