Introduction
Phages, viruses that infect bacteria, are key players in the modulation of ecological and evolutionary processes in diverse bacterial communities throughout the Earth’s ecosystems [1]. For example, in marine environments, phage infection is responsible for the daily turnover of 20%-40% of marine bacterial biomass and, in turn, impacts nutrient cycling [2, 3]. Additionally, during phage infection, the bacterial host metabolism is drastically reprogrammed through alterations in transcription, protein production, metabolite production, and the import and export of resources that can have diverse impacts on other organisms within the community [4]. Finally, phages drive evolution by transferring genetic material from one bacterium to another via transduction [2]. This can impact essential metabolic functions like photosynthesis, carbon cycling, and nutrient acquisition [5]. Beyond the oceans, phage-bacteria interactions are increasingly studied across many ecosystem types, including the human gut [6, 7], permafrost [8, 9], fracking [10, 11], glacial ice brines [12, 13], and many other environments [1, 14].
The roles of phages in soils are among the most challenging to elucidate due to soil complexity and the lack of cultivated model systems. However, overcoming these challenges is critical to understanding phage impacts in soils. Soil phages are abundant, with phage concentrations ranging from ∼103 to 109 particles measured per gram dry weight depending on the soil [15]. Particularly at higher concentrations, it is predicted that there are more frequent phage-bacteria encounters in soils that could lead to higher infection rates than in marine systems [16], and these phage infections are predicted to infect key microbes involved in carbon cycling, which could impact the biogeochemical cycles of these microbes [8, 9]. Hence, phages are now increasingly recognized as major players in soil ecosystems [15, 16].
Problematically, however, there is a lack of cultured and experimentally tested soil phage-host systems that are needed to advance the field from ecological studies to mechanistic hypothesis testing. To date, in terrestrial systems, such work has predominantly focused on phages that target plant bacterial pathogens [17–19], even though phages infecting plant growth-promoting rhizobacteria can drastically impact root microbial communities [18, 20–24]. For example, adding phages to natural and mock soil communities has been found to impact nutrient availability [16, 21], bacterial community structure [22–24], and, in turn, plant health.
Klebsiella is an ideal target host for phage model system development as the genus is widespread across soils, freshwater, plants, and humans [25–27]. While 565 Klebsiella phages have already been isolated and sequenced [28], these are almost exclusively derived from studies focused on pathogenic Klebsiella strains infecting encapsulated strains. No studies have examined phages infecting soil-associated, nonencapsulated Klebsiella species. Klebsiella sp. M5al has been a model strain for molecular genetics of N2-fixation, production of 1,3-propanediol and 2,3-butanediol, and has been found to colonize rice roots without causing soft rot disease [29–31]. Here, we isolated 24 phages infecting Klebsiella sp. M5al. These 24 novel, nonredundant phages were genomically sequenced and compared to 565 complete, genome-sequenced Klebsiella phages. Additionally, we examined the host range for these 24 phages by assaying against a panel of Klebsiella and Raoultella isolates to see if phages isolated on a nonencapsulated strain can infect multiple species or genera as previously found for other capsule-independent Klebsiella phages. Together, these data serve as a framework to expand the understanding of phage-Klebsiella interactions within diverse environmental contexts.
Results and discussion
Isolation, genomic characterization, and diversity of phages infecting Klebsiella sp. M5al
To isolate phages infecting Klebsiella sp. M5al, we assembled a collection of 32 samples from soil, water, and sewage around Ohio S1 Table and screened the collection using standard plaque isolation techniques [32] (see Methods). Of the 32 plaquing assays, 14 samples yielded lysis zones, indicating the possible presence of phages. Plaques were isolated and purified from these samples per standard protocol [32], and phage genomes were sequenced. This resulted in 24 novel, nonredundant phages isolated on Klebsiella sp. M5al Table 1. The resultant genome sequences revealed a range of genome sizes (38,140–177,853 base pairs), predicted open reading frames (ORFs) (47–310 ORFs), and GC content (38.93%– 56.69%; Tables 1 and S2).
[Figure omitted. See PDF.]
To assign taxonomies, we compared the complete genomes of our 24 isolated phages to all complete viruses in the NCBI virus database [28]. The closest relative was identified via BLAST [31] for each phage and used to calculate intergenomic similarity S3 Table. Taxon rank was assigned according to current guidelines from the International Committee on the Taxonomy of Viruses (ICTV; see methods; 32). Using these standards, 16 of the 24 phages were assigned to four existing genera spanning three families: Autographiviridae, Drexlerviridae, and Straboviridae Table 1. One phage, KLB26, was unclassified at the family level but had 59% intergenomic similarity to a Drexlerviridae phage S3 Table. To confirm that KLB26 belongs to a novel genus in the Drexlerviridae phage family, we used core genome analyses, gene-sharing network family prediction [33], and phylogenetic comparisons S4 and S5 Tables [32]. Finally, seven phages could not be classified into existing taxa. Using gene-sharing networks [33] and core genome clustering, these phages were found to belong to two novel phage families that we are proposing: Candidatus (Ca.) Mavericviridae (n = 3) and Ca. Rivulusviridae (n = 4).
Occurrence of the isolated phages in soils and rivers
Since these phages were isolated from soils, rivers, and sewage, we aimed to determine whether they occurred in recently available large-scale metagenomic samples from soil or freshwater environments. To this end, we screened our phage genomes against the Global Soil Virus Atlas (GSVA) [34], which contains virus genome fragments identified from 2,953 sequenced soil metagenomes and the Genome Resolved Open Watersheds database (GROWdb) [35], where we identified virus genome fragments from the 163 surface water riverine samples that offer representation of approximately 90% of the U.S. watersheds [35].
Due to privacy restrictions on the part of the underlying GSVA data and the recent availability of the GROWdb dataset, we were unable or hesitant, respectively, to use the original dataset reads for read-mapping-based detection. Instead, we asked whether the genomes of our 24 phages or the genome sequencing reads from our 24 phages shared significant similarities to the uncultivated virus genomes (UViGs) from the GSVA and GROWdb datasets. This revealed that none of our genomes showed significant similarity to the uViGs in either dataset but that at least some minimal number of reads from three of the phage genomes–KLB19 (0.004%), KLB22 (0.02%), and KLB29 (0.05%)–mapped to three sample sites in the GSVA dataset (S6 Table), though notably, these uViGs were less than 10kb and low quality, leading to their exclusion after quality control [34]. Thus, our phage isolates do not appear to represent widespread ecological genotypes, at least in these soil and river datasets.
However, we posit that this limited detection is due to the high diversity of soil viruses, which are often unique to specific environments even when sampled just mere meters apart [34, 36–38]. Indeed, within the GSVA dataset, only 13.9% of (vOTUs) were found in more than one sample, and fewer than 1% were present in more than five samples [34]. This suggests that the soil and river virology community has many new taxa, phenomena, and biological mysteries remaining to be explored.
Genomic diversity of newly isolated phages compared to all known Klebsiella phages
Next, we explored how our newly isolated and sequenced phage genomes compare to previously sequenced Klebsiella dsDNA phage genomes. These phages were predominantly derived from Klebsiella strains that are both capsule producers and human pathogens [39, 40]. As of February 6th, 2023, there were 565 nonredundant dsDNA phages with complete genomes in the NCBI Virus [28]–within the class Caudoviricetes S7 Table. To compare our phages to all known Klebsiella phages, we created a concatenated phylogenetic tree using the large terminase, small terminase, portal protein, and major capsid protein from representatives of the 565 nonredundant phages (defined as having <80% coverage and <95% percent identity to another phage) (Fig 1, see Methods). These analyses revealed that of the 523 representative Klebsiella phage genomes, 441 assort into nine ICTV-recognized phage families [41], and 124 were unclassified at the family level. Most (64.6%) belonged to three phage families [Autographiviridae (n = 210), Drexlerviridae (n = 98), and Straboviridae (n = 57)], with minimal representation (11.1%) from the remaining six families [Dermercviridae (n = 17), Peduoviridae (n = 16), Schitoviridae (n = 10), Ackermannviridae (n = 10), Casjensviridae (n = 9), and Zobellviridae (n = 1)].
[Figure omitted. See PDF.]
Phylogenetic tree of 523, Klebsiella infecting, NCBI representative phage genomes, and the phages isolated in this study. The partitioned tree was constructed using concatenated sequences from the small terminase, large terminase, portal, and major capsid proteins. Phage names are colored according to their assigned phage family, and phages isolated in this study are marked by red stars. UF bootstrap represents the ultra-fast bootstrap values calculated by IQTree. The tree scale refers to the number of amino acid substitutions per site.
With this taxonomic framework from published phages, we next sought to assess where our phages taxonomically belonged. This revealed that the 24 phages belonged to five families, including 17 from the three largest phage families [Autographivridae (n = 3), Drexlerviridae (n = 2), and Straboviridae (n = 12)], and seven that assorted into two novel families as described above, Ca. Mavericviridae (n = 3) and Ca. Rivulusviridae (n = 4) (Fig 1). Given these taxonomic assignments and a diverse phage collection that infects nonencapsulated Klebsiella, we next sought to characterize the host range and genomic content of the phages to assess signatures of niche differentiation within and between these families.
Straboviridae family.
The Straboviridae phage family originally belonged to the Myoviridae phage family until ICTV updates in 2022, which separated the T4-like phages into the Straboviridae infecting Gammaproteobacteria and the Kyanoviridae infecting cyanobacteria [41]. Focusing only on the former, there are currently 633 Straboviridae representatives, 57 of which infect Klebsiella species. Their genome sizes range from 59 to 349 kb, with an average genome size of 172.4 kb S8 Table [28]. These 57 Klebsiella infecting Straboviridae phages represent three phage genera, Jiaodavirus (n = 32), Slopekvirus (n = 22), and Pseudotevenvirus (n = 1), and two phages unclassified at the genus level S8 Table.
Our study isolated 12 Straboviridae phages from sewage, water, and soil that belong to three phage genera, Jiaodavirus (n = 2), Kanagawavirus (n = 1), and Slopekvirus (n = 9) Tables 1 and S1. The genome sizes of our 12 Straboviridae phages range from 165.7–177.8 kb with 262–310 predicted open reading frames (ORFs) and 1–17 tRNAs Table 1 (Figs 2A, 2B and S1). The phage-encoded tRNAs were nearly identical within all phages isolated in each genus (Fig 2A). We compared codon usage between Klebsiella sp. M5al to see if tRNA presence was due to codon usage or amino acid usage bias, comparing all ORFs and predicted late ORFs [42]. No correlation existed between phage codon or amino acid usage and tRNAs encoded by the phage S1 Fig. However, this may not be surprising, as studies of codon usage bias in other phages have had mixed results [42–50].
[Figure omitted. See PDF.]
(A) tRNA abundances of all Straboviridae phages in the Kanagawavirus, Jiaodavirus, and Slopekvirus genera. Genome comparisons of (B) Straboviridae (C) Drexlerviridae (D) Autographiviridae phages to model phages in each family. Arrows represent forward (right) or reverse (left) open reading frames (ORFs), ORFs are color-coded by function, and ORFs shared across genomes are connected by shading that denotes their percent identity. Core ORFs shared across all RefSeq genomes in this family are marked with a black star. The genome figure was made using Clinker. S2 Fig. shows the Straboviridae genome broken into four parts for a zoomed-in comparison of the phage genomes.
Drexlerviridae family.
The Drexlerviridae family originally belonged to the Siphoviridae phage family until ICTV updates in 2020 [28]. This phage family has 367 complete genome-sequenced phages infecting Gammaproteobacteria (n = 345) and Campylobacterota (n = 22), including 98 that infect Klebsiella. The genome sizes range from 22–78 kb. Of the 98 Klebsiella Drexlerviridae phages, 92 derive from the phage genus Webervirus, one from the Vilniusvirus genus, and five are currently unclassified at the genus level S9 Table.
Our study isolated two Drexlerviridae phages, KLB26 and KLB19, from 2 environmental river water samples S1 Table. KLB19 belongs to the genus Vilniusvirus and has a phage genome length of 52.6 kb consisting of 88 ORFs, including one tRNA for serine and an intron within the tape measure chaperone Table 1 (Fig 2C). Like the Straboviridae phage family, there was no correlation between phage codon and amino acid usage for KLB19 S3 Fig. KLB26 is unclassified at the genus level, has a genome length of 51.7 kb, and comprises 86 ORFs with an intron in the tape measure chaperone Table 1 (Fig 2C).
Autographiviridae family.
The Autographiviridae phage family originally belonged to the Podoviridae family until 2020 ICTV updates [28]. The Autographiviridae phage family comprises 1,106 complete genomes with genome sizes from 30 kb ‐ 61 kb. The Autographiviridae phages infect six classes of bacteria: Alphaproteobacteria (n = 63), Bacilli (n = 2), Betaproteobacteria (n = 49), Cyanophyceae (n = 44), Gammaproteobacteria (n = 947), and Myxococcota (n = 1). Of the phages that infect Gammaproteobacteria, 210 infect Klebsiella and represent five phage genera Przondovirus (n = 100), Drulisvirus (n = 78), Teetrevirus (n = 14), Ningirsuvirus (n = 2), Eapunavirus (n = 1), and 14 phages unclassified at the genus level S10 Table.
Our study isolated and sequenced three Klebsiella phages—KLB4, KLB7, and KLB12—in the genus Teetrevirus from two sewage plants S1 Table. The genome sizes of our 12 Straboviridae phages range from 38.1–38.7 kb with 47–49 predicted ORFs. KLB7 and KLB12 belong to the same phage species and have 99.9% intergenic similarity. The differences include one single nucleotide polymorphism across the 38kb genomes, and KLB12’s genome is 55bp larger than KLB7’s genome (in a non-coding region of the phage genome; Fig 2D). KLB4 belongs to a different phage species (∼89% intergenomic similarity to KLB7 and KLB12), with 11 unique or significantly divergent ORFs from KLB7 and KLB12 (Fig 2D).
Ca. Mavericviridae.
Three phages—JVSB2, KLB16, and KLB22—were unclassified at the family level based on BLAST S3 Table [31]. These phages were isolated from soil, water, and sewage S1 Table and had genome sizes ranging from 44.8 to 47.3 kb with 65–72 ORFs Table 1 (Fig 3A).
[Figure omitted. See PDF.]
(A) Phage genome comparison of our Ca. Mavericviridae phages against the closest reference genomes. Arrows represent forward (to the right) or reverse (to the left) ORFs, ORFs are color-coded by function, and ORFs shared across genomes are connected by shading that denotes their percent identity. Core ORFs shared across all RefSeq genomes in this family are marked with a black star. The genome figure was made using Clinker. (B) Concatenated phylogenetic tree of the ten core genes found in >95% of all Ca. Mavericviridae phage families. Red stars denote the phages isolated in this study. UFBootstrap support is shown as a number on each node. The tree scale refers to the number of amino acid substitutions per site.
Comparing these phages to all NCBI viruses with complete genomes using BLAST [51] and gene-sharing networks [33], we identified 337 phages clustered into a single family. However, based on core genome and phylogenetic analyses, these 337 phages were reduced to 95, which shared core genes found in >95% of all phages and formed a monophyletic lineage.
We sought to determine if these phages were a novel phage family. Core genome analysis of the 95 phages identified ten genes present in at least 95% of the phages S11 Table. We used ViPTree [52] to generate a phylogenetic tree based on the viral proteome, which revealed that these phages represent a monophyletic group and are closely related to Drexlerviridae S4 Fig. To determine if this group of phages could be a novel family or belonged to, Drexlerviridae, we ran core genome analyses using all 95 phages and all Drexlerviridae phages. We identified no core genes shared between Drexlerviridae and the Ca. Mavericviridae S12 Table and found that Ca. Mavericviridae showed a monophyletic lineage, supporting this group as a novel family (See methods; Fig 3B). With these 95 phages, we further evaluated taxonomic ranks based on ICTV recommendations. Genomes with intergenomic similarity ≥70% were classified in the same genus and ≥95% in the same species [53]. Based on the intergenomic similarity calculated via VIRDIC [54], we propose 11 genera; our phages belong to three genera: Ca. Ashvirus, Ca. Buckeyevirus, and Ca. Alumvirus S5 Fig.
Ca. Rivulusviridae.
Four phages–KLB5, KLB24, KLB28, and KLB29 –were unclassified at the family level based on BLAST. These phages were isolated from soil and water and had genome sizes ranging from 39.7–40.2 kb with 53–60 ORFs Table 1 (Fig 4A). KLB24 and KLB28 share 96.4% intergenomic similarity. KLB29 is more closely related to KLB24 than KLB28, with 64.1% and 65.2% intergenomic similarities, respectively. KLB5 is distantly related to KLB24, KLB28, and KLB29 with 5.1–5.6% intergenomic similarity S6 Fig.
[Figure omitted. See PDF.]
(A) Arrows represent forward (to right) or reverse (to left) ORFs, ORFs are color-coded by function, and ORFs shared across genomes are connected by shading that denotes their percent identity. Core ORFs, genes shared across all RefSeq genomes, in this family are marked with a black star. The genome figure was made using Clinker. (B) Phylogenetic tree of the large terminase of representatives of each dsDNA family compared to the Ca. Rivulusviridae phages. Red stars denote the phages isolated in this study. UFBootstrap support is shown as a circle on each node. The tree scale refers to the number of amino acid substitutions per site.
These phages had no close relatives found via BLAST [51], but gene-sharing networks [33] identified 17 phages predicted to be related at the family level. Core genome analyses found 21 core genes. These core genes formed a monophyletic lineage using all proteomes in VipTree and through phylogenetic analyses of the core genes of Ca. Rivulusviridae (Figs 4B and S4, see methods), further supporting a novel phage family. With these 17 phages, we propose five phage genera based on intergenomic similarity [54], with our phages belonging to three: Colbvirus, Sherbvirus, and Darbyvirus S6 Fig.
Core genes of phages
With recent ICTV shifts to assigning phage families based on genetic similarity and core genes instead of phage morphology [41], we examined family-level core gene sets for each phage family. For previously identified phage families, we compared our phage genomes to RefSeq phages [Straboviridae (n = 222), Drexlerviridae (n = 122), Autographiviridae (n = 376)] and for the novel phage families with significantly fewer RefSeq phages we used all RefSeq and non-RefSeq phages [Mavericviridae (n = 96) and Rivulusviridae (n = 17)]. To account for the impact of misannotation or poor-quality sequences, we looked at the soft-core genome, defined as the core genes found in >95% of all phage genomes [55–57]. These analyses revealed that, on average, 18% of phage genes were core genes. The number of core genes in each phage family ranges from 1 to 43 genes, corresponding to 2 to 36.4% of genes. The majority of these genes were structural proteins that are part of the capsid and tail structures (S4 and S11–15 Tables and Fig 5).
[Figure omitted. See PDF.]
Core genes of the (A) Straboviridae, (B) Drexlerviridae, (C) Autographivirdae, (D) Ca. Mavericviridae and (E) Ca. Rivulusviridae phage families.
The Straboviridae family had the most functionally diverse core genes, with 15.7% of the ORFs being core genes (n = 43). These genes include transcription, DNA replication, structural, DNA packaging, nucleases, and other functions such as polynucleotide kinase, metallo-phosphoesterase, and an exonuclease (Fig 5A), and many of these genes have previously been identified when examining T4-like phages [58–62]. The Straboviridae phage family encoded two core genes involved in transcription, a late transcription coactivator and a late sigma transcription factor, to guide the host RNA polymerase to genes involved in the late stages of phage infection. In contrast, none of the other phage families transcription factors were core genes. This is likely due to the phage family encoding its machinery for DNA replication and the larger phage genome compared to the other phage families. However, in Escherichia coli T4, ∼1/3 of the ORFs, >100 genes, are involved in the takeover of the host RNA polymerase and transcription during early, middle, and late gene expression [63, 64]. Even with the number of genes involved in host takeover presumably, the lack of shared transcription- and replication-associated core genes indicates host-interaction strategies that more strongly adapt to individual hosts.
The remainder of the phage families presented much simpler cases. First, Drexlerviridae had 15 core genes, representing 19.2% of all ORFs (Fig 5B). Unlike Straboviridae, no genes are involved in DNA replication; this is unsurprising since Drexlerviridae phages, like T1, rely on bacterial host machinery for DNA replication and transcription [65]. Second, Autographiviridae had one core gene (Fig 5C), the large subunit terminase involved in DNA packaging. While Autographiviridae was initially proposed partly due to the conserved large, single-subunit RNA polymerase responsible for middle and late phage transcription, our analyses did not find that as a core gene. MMseqs2 [66] generated 17 unique clusters for the single-subunit RNA polymerase. Additionally, protein-sharing networks using all proteins show low protein sharing between some Autographiviridae subfamilies [67]. These findings indicate that further evaluation for this phage family is likely needed. Third, Ca. Mavericviridae had ten core genes, representing 16.9% of all ORFs (Fig 5D). These genes were involved in DNA packaging and structural proteins. Finally, Ca. Rivulusviridae had 21 core genes, representing 36.4% of all ORFs (Fig 5E). These genes were involved in DNA replication, structural, DNA packaging, nucleases, and hypothetical proteins. One of those core genes was a bifunctional DNA primase and polymerase that is predicted to synthesize DNA directly from dNTPs without a DNA primer bound to the DNA template. This class of primase and polymerase has previously been found in archaea, bacteria, humans, and some bacteriophages [68].
Auxiliary metabolic genes across the Klebsiella phages
When phages infect a bacterial host, they can drastically reprogram bacterial metabolism toward producing phage proteins [4]. Though challenging to in silico predict such reprogramming from genomes alone, some phages encode auxiliary metabolic genes (AMGs) that represent bacterial-derived genes that help redirect energy and resources toward phage production [69] and have been the focus of >100 papers in the viral ecology literature because of their prominence in environmental phages [70]. Thus, AMGs can indicate metabolic ‘knobs’ that phages tune to maximize phage production under varied conditions of phage-host co-evolution and ecological niche differentiation.
To identify AMGs, we screened our phages using current standard, scalable procedures emerging for virus ecogenomics [70–72]. This identified 48 AMGs found in 12 of our 24 phages. These AMGs were only found in the Straboviridae family. Among these 48 AMGs, predicted functional roles range from amino acid biosynthesis to NAD recycling and nucleotide metabolism (Fig 6A and 6B).
[Figure omitted. See PDF.]
(A) Phylogenetic tree of five of the types of AMGS identified in Klebsiella phages. The inner ring is colored by the phage family. Outer rings are colored by the AMG type identified using Dram-v. This tree was pruned from the tree in Fig 1. (B) Graphical representation of the AMGs identified in the phages isolated in this study created with BioRender.com. (C) Bar graph showing the number of phages in each family that had an AMG or did not have an AMG. The unclassified phages were clustered into family groupings using a gene-sharing network.
Since all these AMGs were found in a single phage family, we screened the Klebsiella phage dataset to see if AMG presence was family-dependent. Among the nine ICTV classified families, four of the families had the majority of the phages encoding AMGs. One family, Casjensviridae, exhibited AMGs in five out of nine phages, and four showed no AMGs (Fig 6C). We further explored potential family trends among unclassified phages by clustering them at the family level using a gene-sharing network. Among the 18 family clusters identified, four families exclusively encoded AMGs, seven had none, and seven displayed a mix of phages with and without AMGs S16 and S17 Tables (Fig 6C). To gain insights into why certain phage families contained AMGs, we assessed the presence and abundance of AMGs relative to genome size. Interestingly, phages with AMGs tended to have significantly larger genomes (p = <0.0001). Additionally, the number of AMGs was correlated with genome size (R2 = 0.28 p = <0.0001) S7 Fig. This analysis elucidates a nuanced distribution pattern of AMGs across Klebsiella phage families. It suggests a potential association between genome size and AMG presence, highlighting the intricate nature of viral genomic content.
Beyond family-level AMG carriage observations, we next explored what kinds of functions Klebsiella phage genomes encode via AMGs. Among the Straboviridae, our phages encode several genes involved in nucleotide biosynthesis. Viruses are well known for reprogramming nucleotide metabolism, as nucleotides are necessary for DNA replication and transcription. It has been debated whether nucleotide biosynthesis genes should be classified as AMGs due to their involvement in common viral functions [73]. However, these nucleotide biosynthesis AMGs help us understand how phages are reprogramming the metabolic potential of the bacterial host and serve an essential role in increasing the host’s metabolic capacity to aid in nucleotide production for phages with high nucleotide demands. To illustrate this, our phages in the genera Slopekvirus and Jiaodavirus encoded an anaerobic ribonucleotide reductase that reduces ribonucleotides to 2’-deoxyribonucleotides which can then be used as a precursor for dNTPs [74]. The Kanagawavirus KLB31 encodes a thymidylate synthase that catalyzes the synthesis of dTMP [75]. We hypothesize that these phage-specific AMG differences indicate two unique approaches to acquiring nucleotides during DNA replication, either through ribonucleotides or via the thymidine metabolic route.
Beyond nucleotide-related AMGs, we also found amino acid-related AMGs. KLB31 encodes a 3-deoxy-D-arabinoheptulosonate-7-phosphate (DAHP) synthetase. This enzyme catalyzes the first step in the seven-step shikimate pathway for the biosynthesis of three aromatic amino acids (phenylalanine, tyrosine, and tryptophan), effectively controlling the amount of carbon entering the pathway [76]. Though not yet observed in terrestrial phages, genes related to the shikimate pathway were recently identified in a global oceans AMG study [70]. We hypothesize that during infection, these AMGs may help generate aromatic amino acids or products that use these amino acids, such as precursors for other metabolites, such as those involved in electron transport, communication, cofactors, and lipid production [77].
Finally, we identified AMGs associated with phages potentially affecting redox balance. Specifically, all Klebsiella Slopekvirus genus phages encode enzymes involved in salvaging nicotinamide adenine dinucleotide (NAD), which plays crucial roles in electron transfer and various metabolic processes during bacterial and phage infection [78, 79]. These enzymes facilitate the conversion of nicotinamide to NAD+, essential for nucleotide biosynthesis and modifying transcription and DNA synthesis during lytic phage infection [80, 81]. This suggests that phages with NAD+ salvaging pathways utilize NAD+ for multiple functions, including metabolic processes and modifying host RNA polymerase activity to favor phage promoters.
Host range of phages
Many Klebsiella isolates are covered by a capsule, a thick and diverse polysaccharide matrix surrounding the bacterium that masks cell surfaces [82]. Klebsiella phages have evolved a depolymerase to digest the capsule and reach the cell surface [83]. These capsules are highly diverse, with >79 capsular serotypes [82]. As such, phages recognize specific serotypes, which results in many Klebsiella phages infecting only a narrow range of hosts [84, 85]. Phages that do not rely on the capsule for adsorption have been found to have a broader host range, even against encapsulated Klebsiella isolates [84, 85]. Since our phages infect a nonencapsulated Klebsiella and do not rely on the capsule for adsorption, we sought to see if our phages had broader host ranges.
We performed host-range analyses for the 24 phages against 22 bacterial isolates that represented 4 Klebsiella species (19 isolates) and 3 Raoultella species closely related to Klebsiella (3 isolates) S18 Table (Fig 7). The phages with the broadest host range were the Straboviridae and Autographiviridae phage families, while the Ca. Mavericviridae, KLB5 and KLB26, had the narrowest host ranges, infecting 1–2 Klebsiella or Raoultella isolates (Fig 7). This indicated that not all phages isolated on nonencapsulated Klebsiella have broad host ranges. The host range of specific phages may be due to other phage and host genomic characteristics such as tail receptors, phage defense systems, etc.
[Figure omitted. See PDF.]
(A) Heatmap of the 24 Klebsiella phages screened against 19 Klebsiella isolates and 3 Raoultella isolates.
The phages in the Straboviridae phage family infected 5–14 isolates (average = 11.75). It has been previously found that phages in this family, specifically the Slopekvirus and the Jiaodavirus genera, have broader host ranges, with many of these phages in this genus infecting >50% of Klebsiella isolates tested, including infecting several Klebsiella genera [7, 84, 86]. It is hypothesized that the broader host range of several phage genera in these groups could be due to homing endonucleases [86, 87] or tRNAs [43]. We compared the abundance of two types of homing endonucleases to the host range for our phages and the abundance of tRNAs to the host range. Still, we found no correlation between homing endonucleases (R2 = 0.2979) or tRNA abundances (R2 = 0.3908) and host range S8 Fig (Fig 7). Indicating in our Slopekvirus phages tRNAs nor homing endonucleases significantly impacted the host range, and other factors are likely involved in driving host range.
KLB4, KLB7, and KLB12 in the Autographivirdae family had the second broadest host range and could infect between 6–9 strains, including 2 in the Raoultella genus. These Autographiviridae phages encoded 1–2 phage defense elements. KLB4, KLB7, and KLB12 encoded a S-adenosyl-L-methionine (SAM) hydrolase, and KLB4 encoded a dGTPase inhibitor. These phage defense elements protect the phage from type I restriction-modification systems [88] and depletion of the nucleotide pool [89], respectively, and may help explain the broader host range.
Conclusion
Here, we isolated a diverse range of phages infecting soil-associated, plant growth-promoting, nonencapsulated Klebsiella sp. M5al. We found that many of these phages fell within the three largest families of Klebsiella phages. The isolation of these viruses led to the proposal of two novel phage families, Ca. Mavericiviridae and Ca. Rivulusviridae, which exemplifies the ever-expanding diversity and classification of known phages [41]. Identifying AMGs in the Straboviridae provides function-based hypotheses about phage and environment-specific metabolic reprogramming “niches” that parallel observations coming from focused “virocell” experimental systems [4, 90]. Host range analyses provide examples of phage families that are more specific versus general in their infection modalities, and as data scale across the field, will undoubtedly contribute to correlative inferences that will be predictive, a key advance for scaling phage therapy and other phage-based applications. Overall, this research significantly advances our understanding of Klebsiella phages infecting a soil-derived, nonencapsulated Klebsiella strain, which provides a foundation for in-depth “virocell” experiments to explore intricacies of phage biology, phage-bacteria interaction dynamics in situ, and ultimately applications in agriculture, environmental management, and biotechnology.
Methods
Sample collection for phage isolation
For phage isolation, soil, water, and sewage were collected around Ohio S1 Table. The soil was collected from personal properties by the owners. Water samples were collected from public rivers. For sewage, the influent was collected from the Jackson Pike and Southerly wastewater plants in Columbus, Ohio, with permission from the sewage plants. Influent samples collected from the Jackson Pike and Southerly WWTP’s were provided by Dr. Thomas Wittum’s laboratory. All samples were collected in Ohio, and phages were isolated at Ohio State University and Ashland University. No permits were required as all the sampling and phage collection were performed within Ohio state boundaries.
Bacteriophage isolation
Klebsiella sp. M5al was kindly provided by Dr. Adam Arkin from the University of Berkeley and stored at -80°C in 15% glycerol until use. Klebsiella sp. M5al was cultivated at 30°C, shaking at 150 rpm in Luria Broth (LB). The Klebsiella phages were isolated from soil, water, and sewage samples around Ohio using a modified version of previously described techniques S1 Table [32]. Briefly, for soil samples, 1–2 grams of soil were resuspended into 8 mL of LB. For water samples, equal volumes of the water sample were mixed with 2XLB. For sewage samples, bacteria were first removed by spinning the samples at 8000 xg for 10 minutes to pellet debris. This was followed by filtration of the sewage through a 0.22 μM PES filter. The filtered sewage was mixed with equal volumes of 2XLB for a total volume of 9mLs. All samples were supplemented with 10 mM CaCl2 and 10 mM MgSo4 and spiked with 100uL of an overnight culture of Klebsiella sp. M5al. These samples were then incubated overnight at 30°C shaking at 150 rpm to enrich for phages infecting Klebsiella sp. M5al, then centrifuged at 8000xg for 10 minutes to pellet the bacteria. The lysate was filtered through a 0.22 μM PES filter to remove bacterial contaminants. Then 5 μL of the filtered lysates were spot plated onto a bacterial lawn consisting of 300 μL of overnight culture and 4 mL of 0.7% LB soft agar overlaid on a 1.4% LB agar plate. After 24 hours of incubation at 30°C, spot plates were screened for zones of lysis. Lysates with positive lysis zones were serially diluted and then plated onto 4 mL of 0.7% LB soft agar overlay on a 1.4% LB agar plate with 300 μL of overnight bacterial culture added. Plates were incubated at 30°C overnight. Single, isolated plaques were resuspended in 500 μL of SM buffer (100mM NaCl, 8mM MgSO4, 50mM Tris, pH7,5), diluted, and sequentially plated three times to obtain a monoclonal phage population. Phages were then amplified using previously described techniques [91].
Phage DNA isolation and sequencing
Phage DNA was extracted as previously described [92]. Briefly, extracellular DNA and RNA were removed from ∼110 PFU lysates and treated with DNase and RNase for 90 minutes. These enzymes were then deactivated using EDTA (final concentration 20 mM). Next, the proteinaceous phage capsids were degraded using Proteinase K (20mg/mL) and incubated at 56°C for 90 minutes. The phage DNA was purified and concentrated using the DNeasy Blood and Tissue kit protocol for gram-negative bacteria for DNA purification (catalog #69506). Purified DNA was sequenced using NextSeq 2000 200Mbp at SeqCenter (previously MIGS), producing an average of 1.5 Mbp total read pairs per isolate.
Phage genome assembly
Phage genomes were assembled as previously described [93, 94]. The reads were trimmed using trimmomatic (v.0.39.2) and checked for quality using fastQC (v.0.11.8) [95, 96]. Reads were subsampled using VSEARCH (v.2.14.1) to obtain a coverage between 20-200X. The phage genome was assembled by SPAdes (v.3.15.5) using subsampled reads and visualized using Bandage to ensure one complete contig was assembled [97–99]. The BBTools (v.38.69) package, BBmap, was used to check coverage at this step to ensure coverage was between 20-200X, as outlined in Millard’s protocol [94, 100]. The output sam file from bbmap was converted to a bam file and indexed using samtools (v.1.10) (92). Error correction was performed by mapping the reads back to the assembled phage genome using Pilon (v.1.24) [101].
Phage termini were determined using several methods. First, all phage genomes and reads were run through PhageTerm (v.3.1.0) software to see if termini could be predicted using read mapping [102]. Phages that could not have their termini predicted were first checked for a circular genome permutation using the apc. pl script. This program also checks for repetitive artifacts from the assembly that were manually removed (https://github.com/jfass/apc/blob/master/apc.pl). Phages whose termini were not predicted via PhageTerm were then either (1) aligned based on the closest phage relative (>90% coverage*percent identity) or (2) for phages that did not have a close relative, these genomes were rearranged with an arbitrary gene like the small and large terminases S2 Table [93]. After the termini were determined for a phage, the coverage of all of the trimmed reads against the nearly finalized genome was checked with BBmap to determine the actual coverage [100], and the isolated genome underwent a final polishing step with Pilon to create the final genome assembly file [101].
Long read sequencing of incomplete phage genomes
Three phages (KLB24, KLB28, KLB30) had incomplete contigs after short-read assembly with SPades. These phage genomes were sequenced using Oxford Nanopore Technology’s MinION Mk1b long-read sequencer, and a hybrid, short, and long-read assembly was done to obtain a single, complete contig for each isolate. Briefly, reads were demultiplexed using the guppy basecaller (v2.3.1), and nanopore adapters were removed using porechop (v0.2.4) [103]. The BBtools reformat script removed reads shorter than 1000 base pairs [100]. Reads were then assembled using the hybrid option of SPAdes, which utilizes a short read assembly with correction by the long read sequences [98]. The subsampled short reads were used for the hybrid assembly. The same steps used in short read assembly are followed after hybrid SPAdes assembly of the reads.
Phage genome annotation
Multiple tools were used to identify the ORFs in phage genomes with a scoring system that has been shown to provide the most accurate genome annotations [104, 105]. Our scoring system included eight ORF prediction tools GeneMark, GeneMarkS, GeneMarkS2, GeneMarkHMM, Glimmer, Prodigal, MetaGeneAnnotator, and Phanotate [106–113]. For each gene the scoring system used the number of ORF program calls, gene length, gene overlap, protein identification, programming potential, and presence in an operon. Scores of 3 or more were kept while scores of 0–2 were only kept if there was a BLAST hit or predicted by two or more ORF programs.
ORF functions were assigned based on high-scoring hits to BLASTp, Swiss-prot protein database, and HHPRED [51, 114]. Introns, spanins, terminators, and tRNAs were identified using programs from the structural and functional phage annotation pipelines in Apollo [115].
To calculate whether tRNA presence were due to codon biases, the codon usage for the bacteria and each phage was calculated in R using uco from the seqinr package. Codon usage of the bacterium was subtracted from the phage to calculate differences in codon usage. Boxplots were generated for each codon using ggplot2 [116].
Phage taxonomy classifications
To identify the genus and family level of each phage, the complete genomes of all 24 phages were run through BLAST [51] to identify the closest phage relative. Using the top BLAST hit, the intergenomic similarity was calculated by multiplying the genome coverage by the percent identity. Using current ICTV recommendations, phages with ≥95% intergenomic similarity belonged to the same species and genus, while phages with ≥70% and <95% intergenomic similarity belonged to the same genus with a novel species [53].
The protein sequences of the 24 phages isolated in this study, the close relatives of those phages identified via BLAST, and all NCBI Virus RefSeq genomes (release 220) were run through a gene sharing network program (beta mode) [33] using default parameters S5 Table. With this gene-sharing network, distances are based on a Jaccard similarity-like distance metric, and instead of a single network, multiple networks at several clustering identities are employed. Optimal distance thresholds are identified for each network and each taxonomic rank from order to genus. By employing these disparate networks and adding group-specific markers, the gene-sharing network tool can classify realm, order, family, subfamily, and genus ranks with varying confidence levels for each realm. To find close proteomic relatives to the unclassified phages at the family level, we identified the RefSeq phages with the same phage family. KLB26 was classified in the Drexlerviridae phage family with a novel phage genus S5 Table. KLB24, KLB28, KLB29, and KLB5 were classified into a single phage family with 5 additional phages. JVSB2, KLB16, and KLB22 clustered in a single family with 334 additional phages. Non-RefSeq phages for the two novel phage families were identified via BLAST [51]. To define a phage family, we used ICTV recommended guidelines that these phages must (a) share a significant number of orthologues, (b) represent a cohesive and monophyletic group via proteome-based clustering tools, (c) if the phage family shares orthologues with another family, further confirm that these orthologues are a monophyletic lineage [53]. To confirm the phages formed a monophyletic lineage the phages from the two novel families were run through VipTree [52] and MMSeqs2 [66] to confirm that a significant number of genes were shared within the family.
Rivulusviridae formed a monophyletic lineage, while the Mavericviridae phages were polyphyletic gene found in >95% of all phages S19 Table and S4 Fig. To be on the conservative side, we only used the phages identified in VipTree as monophyletic and had ∼10% of ORFs shared in >95% of all phages, so 239 phages in the Mavericviridae phage family were removed. All further analyses were on our 3 phages with the 92 phages that supported a monophyletic lineage.
Phylogenetic analyses of phages
All Klebsiella infecting phages were retrieved from NCBI Virus using filters for viruses (taxid: 10239), nucleotide completeness, and Klebsiella (taxid: 570). As of February 6th, 2023, 581 phages in NCBI Virus were downloaded and used for this study [28]. We excluded 16 phages with 100% ANI, resulting in a final set of 565 Klebsiella phages.
To further reduce redundancy in the phage tree, these phages were clustered with MMseqs2 (60) using the options ‐‐min-seq-id 0.95 -c 0.8. This reduced the dataset to 523 sequences, which were subsequently used for the phylogenetic tree (Fig 1). For phylogenetic inference, we used two approaches. Firstly, we used a multi-loci gene tree comprising four genes: the large terminase, small terminase, major capsid, and portal gene. Multiple methods were used to extract the marker genes from every genome. Genes that were already annotated as the marker genes were extracted from the genomes in nucleotide format, and then each marker gene’s list of proteins was manually inspected to remove obvious incorrect annotations, such as having too short or too long of a sequence compared to others. Each marker gene was clustered with VSEARCH using the ‐‐cluster_fast ‐‐id 0.75 options, and then every cluster was aligned using MAFFT, which was used to build a hidden Markov model (HMM) using hmmbuild with default options [117]. The HMM for each cluster of each marker gene was used to search a file containing every gene of the 537 phage dataset. The results were collated for each HMM of each marker gene, duplicates were removed, and results with an e-value lower than 10−3 were kept. If there were multiple results for a marker gene for one genome, the result with the lower e-value was kept. For the tree of all Klebsiella phages, the evolutionary model used was Blosum62+R5 for the large terminase, PMB+F+R3 for the small terminase, VT+F+I+G4 for the major capsid protein and the portal protein using the partition model and Maximum Likelihood phylogenetic inference in IQ-TREE 2.0 [118–120]. The phylogenetic trees were then visualized, and tree tips annotated and color-coded in iTOL v6 [121].
Phylogenetic inference of the Ca. Mavericviridae and the novel Ca. Rivulusviridae family were both based on a gene tree of the large terminase gene. Briefly, among representative reference NCBI sequences, the large terminase was extracted from the respective genomes based on their annotation and combined with large terminase genes annotated from our sequences. Sequences were then aligned using the E-INS-i strategy with over 1,000 iterations in MAFFT v7.017 [117]. All trees’ aligned sequences were trimmed using Trimal [122] using the -gappyout option that utilized gap and similarities distribution to establish the appropriate trimming threshold. The trimmed sequence alignments were visualized and manually inspected in JalView [123], for overhangs and/or short sequences, which were subsequently removed. The most suitable evolutionary model for the large terminase sequences was inferred using ModelFinder [124]. The evolutionary model LG+I+G4 and LG+F+R5 was selected as the optimal evolutionary mode for the Ca. Mavericviridae and Ca. Rivulusviridae, respectively, and integrated into the Maximum Likelihood phylogenetic inference in IQ-TREE 2.0 [118].
Comparison of phages against metagenomic datasets
Viral contigs were predicted from the Genome Resolved Open Watershed database (GROWdb) assemblies by first filtering contigs to only those larger than 5kb using seqkit version 2.7.0. Viral contigs >5kb were then predicted with Genomad v1.7.4 using the option “end-to-end” [125]. Viral contigs from all samples were collected into a single reference file for virus operational taxonomic units (vOTU) clustering and clustered using CheckV version 0.8.1 and a custom script that leverages BLAST+ with the scripts CheckV anicalc.py and aniclust.py with options “—min-ani 95 –min-tcov80” [126].
The prevalence of the 24 Klebsiella phages was mapped against vOTUs derived from the Global Soil Virus Atlas [34] and the GROWdb [35]. The 24 phage genomes were mapped against these vOTUs using CoverM v0.6. 1–3 using two options ‐‐‐‐min-read-percent-identity .95 and ‐‐min-read-percent-identity .70 and ‐‐min-read-aligned-percent .75 ‐‐min-covered-fraction .70 -m trimmed_mean [127]. The depth of sequencing from each isolate then normalized coverage values. The reads from the 24 phage isolates were mapped against these vOTUs from both of the above datasets using CheckV’s anicalc.py and aniclust.py with options “‐‐min-ani 95 or ‐‐min-ani 70 and ‐‐min-tcov 80” [126].
Phage genome comparisons
Phage genome comparisons were done using Clinker genome analyses [128]. The phage nucleotide sequences were uploaded to Clinker with minimum alignment sequence identity set to 0.3 [128]. One representative for each phage genus was used to compare nucleotide sequence to all other phages in the genus. The coloring of each gene was done manually based on product function. Stars denoting core genes identified using MMSeqs2.0 [66] were manually added to the Clinker maps using Adobe Illustrator.
Identification of core genes in phages
The set of core genes for a single family was found using MMSeqs2. A database was made: mmseqs createdb prot.faa DB [66]. The database was clustered with a minimum sequence identity of 30% and a coverage of 80%: mmseqs cluster DB DB_clu ‐‐min-seq-id 0.30 -c 0.8 tmp. To analyze the data, a tsv file was made of the clustered database: mmseqs createtsv DB DB DB_clu mmseq_unchar_seq_0.3_cov_0.8.tsv. A gene was considered a core gene if present in >95% of the phages.
Identification of AMGs
AMGs were identified using DRAM-v, the viral mode of the software DRAM [72]. The input was the finalized phage genome or a file with all the nonredundant Klebsiella phage sequences, and the output was the annotations made by DRAM-v S16 and 17 Tables. The annotations were curated so that only genes with a rank of C or higher were considered. Annotations with the flag M were considered AMGs, while AMGs containing V, B, or T flags were removed. Since our study looked at complete genomes and not metagenomic sequences that must worry about prophage ends, the F flags were not removed, indicating that the gene was at the end of the genome. An overview of the functions of the AMGs in the manuscript was created using BioRender (Fig 6C). S7 Fig was generated in GraphPad. Statistical significance for S7A Fig statistical significance was calculated using a parametric, unpaired two-tail t-test. S7B and S7C Fig used linear regression to identify the R2 value and tested the null hypothesis that the overall slope of the linear regression is 0, calculated from an F test.
Host-range testing of phages
For host-range testing, each Klebsiella and Raoultella strain was incubated and grown at 30°C 150 rpm overnight S18 Table. The bacterial lawn was made by mixing 300uL of the overnight Klebsiella or Raoultella strain with 5mL of 0.7% LB agar. Each phage was diluted to 1E8 PFU/mL, and 5uL of the phage was plated onto a bacterial lawn. Plates were incubated at 30°C overnight, and the phage spots were scored for either having a clear zone (complete lysis), a hazy zone of clearance (turbid), or no visual difference in the bacterial lawn (no lysis).
Supporting information
S1 Fig. tRNA codon bias in the Straboviridae phage family.
(A) cosine similarity of codons of each phage compared to Klebsiella sp. M5a1 for all genes, (B) genes in the late stage of phage infection, and (C) structural genes in the late stage of phage infection.
https://doi.org/10.1371/journal.pone.0313947.s001
(TIF)
S2 Fig. Representative Straboviridae clinker genome comparison.
Arrows represent forward (right) or reverse (left) open reading frames (ORFs), ORFs are color-coded by function, and ORFs shared across genomes are connected by shading that denotes their percent identity. Core ORFs shared across all RefSeq genomes in this family are marked with a black star. The genome figure was made using Clinker. The genome was split into four sections to better zoom in on the individual genes.
https://doi.org/10.1371/journal.pone.0313947.s002
(TIF)
S3 Fig. tRNA codon bias in the Drexlerviridae phage family.
(A) Cosine similarity of codons of each phage compared to Klebsiella sp. M5a1 for all genes, (B) genes in the late stages of phage infection, and (C) structural genes in the late stage of phage infection.
https://doi.org/10.1371/journal.pone.0313947.s003
(TIF)
S4 Fig. Viptree of the Ca. Rivulusviridae and Ca. Mavericviridae phage families compared to all Ref-Seq phages.
The red stars are the Ca. Mavericidae and Ca. Rivulusviridae phages.
https://doi.org/10.1371/journal.pone.0313947.s004
(TIF)
S5 Fig. VIRDIC genome analysis of the phages in the Mavericviridae family.
Phages with an intergenomic similarity ≥70% were grouped into a phage genus. Eleven phage genera of Mavericviridae are colored (green) Bowlingvirus, (dark blue) Gwanakvirus, (red) Yanchengvirus, (orange) Alumvirus, (purple) Buckeyevirus, (light green) Ashvirus, (light blue) Kijivirus, (yellow) Tamuvirus, (brown) Auburnvirus, (pink) Hildvirus, (light purple) Dhillonvirus. Phages labeled in white are the phages isolated in this study.
https://doi.org/10.1371/journal.pone.0313947.s005
(TIF)
S6 Fig. VIRDIC genome analysis of the phages in the Rivulusviridae phage family.
Phages with an intergenomic similarity ≥70% were grouped into a phage genus. 5 phage genera of Rivulusviridae are color-coded (green) Colbvirus, (blue) Sherbvirus, (yellow) Lucvirus, (orange) Darbyvirus, and (pink) Cinnavirus. Phages labeled in white are the phages isolated in this study.
https://doi.org/10.1371/journal.pone.0313947.s006
(TIF)
S7 Fig. Correlation of AMGs and genome size.
(A) Comparison of the genome sizes of phages that contain AMGs against phages that did not encode AMGs. A parametric, two-tailed, unpaired T-test was used to test statistical significance between the two groups. (B) Linear regression of phage containing AMGS compares genome size and the number of AMGs. (C) Linear regression of all Klebsiella phages genome sizes and number of AMGs. R2 and statistical significance were calculated using standard settings of linear regression in GraphPad. All figures were made in GraphPad.
https://doi.org/10.1371/journal.pone.0313947.s007
(TIF)
S8 Fig. Comparison of the number of phage-encoded homing endonucleases and tRNA abundances to bacterial host range.
(A) Number of phage encoded homing endonucleases and number of bacteria a phage complete lyses and (B) completely or partially lyses. The number of phage-encoded tRNAs and the (C) number of bacteria a phage completely lyses and (D) completely or partially lyses. Linear regression of all Klebsiella phage genome sizes and number of AMGs. R2 and statistical significance were calculated using standard settings of linear regression in GraphPad. All figures were made in GraphPad.
https://doi.org/10.1371/journal.pone.0313947.s008
(TIF)
S9 Fig. VIRDIC genome analysis of all the phages isolated in this study.
https://doi.org/10.1371/journal.pone.0313947.s009
(TIF)
S1 Table. Information on sampling sites for phages isolated in this study.
Information on the sample site, type of sample, date of sampling, date of enrichment, and city that each phage was isolated from.
https://doi.org/10.1371/journal.pone.0313947.s010
(XLSX)
S2 Table. General information for phages isolated in this study.
https://doi.org/10.1371/journal.pone.0313947.s011
(XLSX)
S3 Table. Top BLAST results for each phage.
The top BLAST result for each phage includes the query coverage, e value, percent identity, genome accession, and taxonomy. The intergenomic similarity was calculated by multiplying the query coverage by the percent identity and taxonomy for isolated phages using current ICTV guidelines.
https://doi.org/10.1371/journal.pone.0313947.s012
(XLSX)
S4 Table. MMSeqs2 clustering of Drexlerviridae RefSeq phages.
https://doi.org/10.1371/journal.pone.0313947.s013
(XLSX)
S5 Table. vConTACT3.0 clustering of Ca. Mavericviridae and Ca. Rivulusviridae phages.
https://doi.org/10.1371/journal.pone.0313947.s014
(XLSX)
S6 Table. Similarity hits of the Klebsiella phages to the GSVA and GROWdb viruses.
https://doi.org/10.1371/journal.pone.0313947.s015
(XLSX)
S7 Table. Accession of all non-redundant Klebsiella phages.
https://doi.org/10.1371/journal.pone.0313947.s016
(XLSX)
S8 Table. List of RefSeq phages in the Straboviridae family.
https://doi.org/10.1371/journal.pone.0313947.s017
(XLSX)
S9 Table. List of RefSeq phages in the Drexlerviridae family.
https://doi.org/10.1371/journal.pone.0313947.s018
(XLSX)
S10 Table. List of RefSeq phages in the Autographiviridae family.
https://doi.org/10.1371/journal.pone.0313947.s019
(XLSX)
S11 Table. MMSeqs2 clustering of the 95 Ca. Mavericviridae phages.
https://doi.org/10.1371/journal.pone.0313947.s020
(XLSX)
S12 Table. MMSeqs2 clustering of the Ca. Mavericviridae and Drexlerviridae RefSeq phages.
https://doi.org/10.1371/journal.pone.0313947.s021
(XLSX)
S13 Table. MMSeqs2 clustering of the Ca. Rivulusviridae RefSeq phages.
https://doi.org/10.1371/journal.pone.0313947.s022
(XLSX)
S14 Table. MMSeqs2 clustering of the Straboviridae RefSeq phages.
https://doi.org/10.1371/journal.pone.0313947.s023
(XLSX)
S15 Table. MMSeqs2 clustering of the Autographiviridae RefSeq phages.
https://doi.org/10.1371/journal.pone.0313947.s024
(XLSX)
S16 Table. Dram-v output of the auxiliary metabolic genes from the NCBI Klebsiella phages.
https://doi.org/10.1371/journal.pone.0313947.s025
(XLSX)
S17 Table. Dram-v output of the auxiliary metabolic genes from the 24 phages isolated in this study.
https://doi.org/10.1371/journal.pone.0313947.s026
(XLSX)
S18 Table. List of bacterial strains used in this study.
https://doi.org/10.1371/journal.pone.0313947.s027
(XLSX)
S19 Table. MMSeqs2 clustering of the 334 phages identified with vConTACT3.0 and the three Mavericviridae phages isolated in this study.
https://doi.org/10.1371/journal.pone.0313947.s028
(XLSX)
Acknowledgments
We want to thank the Center of Microbiome Science at OSU for their bioinformatic support in calculating the occurrence of our phages in the GSVA and GROWdb datasets. We also thank Marie Burris for the soil and water samples used for phage isolation and Funing Tian for her feedback on the manuscript. We thank Brian Ahmer for the Klebsiella and Raoultella isolates, as well as the USDA-ARS Culture Collection (NRRL) for several microbial strains used in this work.
References
1. 1. Batinovic S, Wassef F, Knowler SA, Rice DTF, Stanton CR, Rose J, et al. Bacteriophages in Natural and Artificial Environments. Pathogens. 2019 Sep;8(3):100. pmid:31336985
* View Article
* PubMed/NCBI
* Google Scholar
2. 2. Suttle CA. Marine viruses ‐ Major players in the global ecosystem. Nat Rev Microbiol. 2007 Oct;5(10):801–12. pmid:17853907
* View Article
* PubMed/NCBI
* Google Scholar
3. 3. Fuhrman JA. Marine viruses and their biogeochemical and ecological effects. Nature. 1999 Jun;399(6736):541–8. pmid:10376593
* View Article
* PubMed/NCBI
* Google Scholar
4. 4. Howard-Varona C, Lindback MM, Bastien GE, Solonenko N, Zayed AA, Jang HB, et al. Phage-specific metabolic reprogramming of virocells. ISME J. 2020 Jan 2;14(4):881–95. pmid:31896786
* View Article
* PubMed/NCBI
* Google Scholar
5. 5. Sullivan MB, Lindell D, Lee JA, Thompson LR, Bielawski JP, Chisholm SW. Prevalence and Evolution of Core Photosystem II Genes in Marine Cyanobacterial Viruses and Their Hosts. PLOS Biol. 2006 Jul 4;4(8):e234. pmid:16802857
* View Article
* PubMed/NCBI
* Google Scholar
6. 6. Łusiak-Szelachowska M, Weber-Dąbrowska B, Żaczek M, Borysowski J, Górski A. The Presence of Bacteriophages in the Human Body: Good, Bad or Neutral? Microorganisms. 2020 Dec 16;8(12):2012. pmid:33339331
* View Article
* PubMed/NCBI
* Google Scholar
7. 7. Townsend EM, Kelly L, Gannon L, Muscatt G, Dunstan R, Michniewski S, et al. Isolation and Characterization of Klebsiella Phages for Phage Therapy. Phage. 2021;2(1):26–42. pmid:33796863
* View Article
* PubMed/NCBI
* Google Scholar
8. 8. Emerson JB, Roux S, Brum JR, Bolduc B, Woodcroft BJ, Jang HB, et al. Host-linked soil viral ecology along a permafrost thaw gradient. Nat Microbiol. 2018;3(8):870–80. pmid:30013236
* View Article
* PubMed/NCBI
* Google Scholar
9. 9. Trubl G, Jang HB, Roux S, Emerson JB, Solonenko N, Vik DR, et al. Soil Viruses Are Underexplored Players in Ecosystem Carbon Processing. mSystems. 2018 Oct 2;3(5):e00076–18.
* View Article
* Google Scholar
10. 10. Daly RA, Roux S, Borton MA, Morgan DM, Johnston MD, Booker AE, et al. Viruses control dominant bacteria colonizing the terrestrial deep biosphere after hydraulic fracturing. Nat Microbiol. 2019 Feb;4(2):352–61. pmid:30510171
* View Article
* PubMed/NCBI
* Google Scholar
11. 11. Amundson KK, Borton MA, Daly RA, Hoyt DW, Wong A, Eder E, et al. Microbial colonization and persistence in deep fractured shales is guided by metabolic exchanges and viral predation. Microbiome. 2022 Jan 16;10(1):5. pmid:35034639
* View Article
* PubMed/NCBI
* Google Scholar
12. 12. Zhong ZP, Tian F, Roux S, Gazitúa MC, Solonenko NE, Li YF, et al. Glacier ice archives nearly 15,000-year-old microbes and phages. Microbiome. 2021 Jul 20;9(1):160. pmid:34281625
* View Article
* PubMed/NCBI
* Google Scholar
13. 13. Zhong ZP, Rapp JZ, Wainaina JM, Solonenko NE, Maughan H, Carpenter SD, et al. Viral Ecogenomics of Arctic Cryopeg Brine and Sea Ice. mSystems. 2020 Jun 16;5(3): pmid:32546670
* View Article
* PubMed/NCBI
* Google Scholar
14. 14. Chevallereau A, Pons BJ, van Houte S, Westra ER. Interactions between bacterial and phage communities in natural environments. Nat Rev Microbiol. 2022 Jan;20(1):49–62. pmid:34373631
* View Article
* PubMed/NCBI
* Google Scholar
15. 15. Williamson KE, Fuhrmann JJ, Wommack KE, Radosevich M. Viruses in Soil Ecosystems: An Unknown Quantity Within an Unexplored Territory. Httpsdoiorg101146annurev-Virol-101416-041639. 2017 Sep 29;4:201–19. pmid:28961409
* View Article
* PubMed/NCBI
* Google Scholar
16. 16. Kuzyakov Y, Mason-Jones K. Viruses in soil: Nano-scale undead drivers of microbial life, biogeochemical turnover and ecosystem functions. Soil Biol Biochem. 2018 Dec 1;127:305–17.
* View Article
* Google Scholar
17. 17. Villalpando-Aguilar JL, Matos-Pech G, López-Rosas I, Castelán-Sánchez HG, Alatorre-Cobos F. Phage Therapy for Crops: Concepts, Experimental and Bioinformatics Approaches to Direct Its Application. Int J Mol Sci. 2022 Dec 25;24(1):325. pmid:36613768
* View Article
* PubMed/NCBI
* Google Scholar
18. 18. Svircev A, Roach D, Castle A. Framing the Future with Bacteriophages in Agriculture. Viruses. 2018 Apr 25;10(5):218. pmid:29693561
* View Article
* PubMed/NCBI
* Google Scholar
19. 19. Sieiro C, Areal-Hermida L, Pichardo-Gallardo Á, Almuiña-González R, de Miguel T, Sánchez S, et al. A Hundred Years of Bacteriophages: Can Phages Replace Antibiotics in Agriculture and Aquaculture? Antibiotics. 2020 Aug;9(8):493. pmid:32784768
* View Article
* PubMed/NCBI
* Google Scholar
20. 20. Yang K, Wang X, Hou R, Lu C, Fan Z, Li J, et al. Rhizosphere phage communities drive soil suppressiveness to bacterial wilt disease. Microbiome. 2023 Feb 1;11(1):16. pmid:36721270
* View Article
* PubMed/NCBI
* Google Scholar
21. 21. Braga LPP, Spor A, Kot W, Breuil MC, Hansen LH, Setubal JC, et al. Impact of phages on soil bacterial communities and nitrogen availability under different assembly scenarios. Microbiome. 2020 Apr 6;8(1):1–14.
* View Article
* Google Scholar
22. 22. Wang Y, Liu Y, Wu Y, Wu N, Liu W, Wang X. Heterogeneity of soil bacterial and bacteriophage communities in three rice agroecosystems and potential impacts of bacteriophage on nutrient cycling. Environ Microbiome. 2022 Apr 6;17(1):17. pmid:35387674
* View Article
* PubMed/NCBI
* Google Scholar
23. 23. Morella NM, Gomez AL, Wang G, Leung MS, Koskella B. The impact of bacteriophages on phyllosphere bacterial abundance and composition. Mol Ecol. 2018;27(8):2025–38. pmid:29457297
* View Article
* PubMed/NCBI
* Google Scholar
24. 24. Santos-Medellin C, Zinke LA, ter Horst AM, Gelardi DL, Parikh SJ, Emerson JB. Viromes outperform total metagenomes in revealing the spatiotemporal patterns of agricultural soil viral communities. ISME J. 2021 Jul;15(7):1956–70. pmid:33612831
* View Article
* PubMed/NCBI
* Google Scholar
25. 25. Thorpe HA, Booton R, Kallonen T, Gibbon MJ, Couto N, Passet V, et al. A large-scale genomic snapshot of Klebsiella spp. isolates in Northern Italy reveals limited transmission between clinical and non-clinical settings. Nat Microbiol. 2022 Dec;7(12):2054–67. pmid:36411354
* View Article
* PubMed/NCBI
* Google Scholar
26. 26. Rocha J, Henriques I, Gomila M, Manaia CM. Common and distinctive genomic features of Klebsiella pneumoniae thriving in the natural environment or in clinical settings. Sci Rep. 2022 Jun 21;12(1):10441. pmid:35729190
* View Article
* PubMed/NCBI
* Google Scholar
27. 27. Podschun R, Pietsch S, Höller C, Ullmann U. Incidence of Klebsiella Species in Surface Waters and Their Expression of Virulence Factors. Appl Environ Microbiol. 2001 Jul;67(7):3325–7. pmid:11425763
* View Article
* PubMed/NCBI
* Google Scholar
28. 28. NCBI Virus [Internet]. [cited 2023 Dec 20]. Available from: https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/
29. 29. Du C, Zhang Y, Li Y, Cao Z. Novel Redox Potential-Based Screening Strategy for Rapid Isolation of Klebsiella pneumoniae Mutants with Enhanced 1,3-Propanediol-Producing Capability. Appl Environ Microbiol. 2007 Jul;73(14):4515–21. pmid:17513581
* View Article
* PubMed/NCBI
* Google Scholar
30. 30. Du C, Yan H, Zhang Y, Li Y, Cao Z. Use of oxidoreduction potential as an indicator to regulate 1,3-propanediol fermentation by Klebsiella pneumoniae. Appl Microbiol Biotechnol. 2006 Jan;69(5):554–63. pmid:16021488
* View Article
* PubMed/NCBI
* Google Scholar
31. 31. Yu Z, Li S, Li Y, Jiang Z, Zhou J, An Q. Complete genome sequence of N2-fixing model strain Klebsiella sp. nov. M5al, which produces plant cell wall-degrading enzymes and siderophores. Biotechnol Rep. 2018 Mar 1;17:6.
* View Article
* Google Scholar
32. 32. Van Twest R, Kropinski AM. Bacteriophage Enrichment from Water and Soil. In: Clokie MRJ, Kropinski AM, editors. Bacteriophages: Methods and Protocols, Volume 1: Isolation, Characterization, and Interactions [Internet]. Totowa, NJ: Humana Press; 2009. p. 15–21. (Methods in Molecular BiologyTM). Available from: https://doi.org/10.1007/978-1-60327-164-6_2
33. 33. MAVERICLab / vcontact3—Bitbucket [Internet]. [cited 2024 Mar 29]. Available from: https://bitbucket.org/MAVERICLab/vcontact3/src/master/
34. 34. Graham EB, Camargo AP, Wu R, Neches RY, Nolan M, Paez-Espino D, et al. A global atlas of soil viruses reveals unexplored biodiversity and potential biogeochemical impacts. Nat Microbiol. 2024 Jul;9(7):1873–83. pmid:38902374
* View Article
* PubMed/NCBI
* Google Scholar
35. 35. Borton MA, McGivern BB, Willi KR, Woodcroft BJ, Mosier AC, Singleton DM, et al. A functional microbiome catalog crowdsourced from North American rivers [Internet]. bioRxiv; 2023 [cited 2024 Sep 10]. p. 2023.07.22.550117. Available from: pmid:37502915
* View Article
* PubMed/NCBI
* Google Scholar
36. 36. Sun CL, Pratama AA, Gazitúa MC, Cronin D, McGivern BB, Wainaina JM, et al. Virus ecology and 7-year temporal dynamics across a permafrost thaw gradient. Environ Microbiol. 2024;26(8):e16665. pmid:39101434
* View Article
* PubMed/NCBI
* Google Scholar
37. 37. Fudyma JD, ter Horst AM, Santos-Medellín C, Sorensen JW, Gogul GG, Hillary LS, et al. Exploring viral particle, soil, and extraction buffer physicochemical characteristics and their impacts on extractable viral communities. Soil Biol Biochem. 2024 Jul 1;194:109419.
* View Article
* Google Scholar
38. 38. Ma B, Wang Y, Zhao K, Stirling E, Lv X, Yu Y, et al. Biogeographic patterns and drivers of soil viromes. Nat Ecol Evol. 2024 Apr;8(4):717–28. pmid:38383853
* View Article
* PubMed/NCBI
* Google Scholar
39. 39. Herridge WP, Shibu P, O’Shea J, Brook TC, Hoyles L. Bacteriophages of Klebsiella spp., their diversity and potential therapeutic uses. J Med Microbiol. 2020;69(2):176–94. pmid:31976857
* View Article
* PubMed/NCBI
* Google Scholar
40. 40. Aranaga C, Pantoja LD, Martínez EA, Falco A. Phage Therapy in the Era of Multidrug Resistance in Bacteria: A Systematic Review. Int J Mol Sci. 2022 Jan;23(9):4577. pmid:35562968
* View Article
* PubMed/NCBI
* Google Scholar
41. 41. Turner D, Shkoporov AN, Lood C, Millard AD, Dutilh BE, Alfenas-Zerbini P, et al. Abolishment of morphology-based taxa and change to binomial species names: 2022 taxonomy update of the ICTV bacterial viruses subcommittee. Arch Virol. 2023;168(2):74. pmid:36683075
* View Article
* PubMed/NCBI
* Google Scholar
42. 42. Yang JY, Fang W, Miranda-Sanchez F, Brown JM, Kauffman KM, Acevero CM, et al. Degradation of host translational machinery drives tRNA acquisition in viruses. Cell Syst. 2021 Aug 18;12(8):771–779.e5. pmid:34143976
* View Article
* PubMed/NCBI
* Google Scholar
43. 43. Bailly-Bechet M, Vergassola M, Rocha E. Causes for the intriguing presence of tRNAs in phages. Genome Res. 2007 Oct;17(10):1486–95. pmid:17785533
* View Article
* PubMed/NCBI
* Google Scholar
44. 44. Delesalle VA, Tanke NT, Vill AC, Krukonis GP. Testing hypotheses for the presence of tRNA genes in mycobacteriophage genomes. Bacteriophage. 2016 Aug 5;6(3):e1219441. pmid:27738556
* View Article
* PubMed/NCBI
* Google Scholar
45. 45. Villion M, Chopin MC, Deveau H, Ehrlich SD, Moineau S, Chopin A. P087, a lactococcal phage with a morphogenesis module similar to an Enterococcus faecalis prophage. Virology. 2009 May 25;388(1):49–56. pmid:19349056
* View Article
* PubMed/NCBI
* Google Scholar
46. 46. Limor-Waisberg K, Carmi A, Scherz A, Pilpel Y, Furman I. Specialization versus adaptation: two strategies employed by cyanophages to enhance their translation efficiencies. Nucleic Acids Res. 2011 Aug;39(14):6016–28. pmid:21470965
* View Article
* PubMed/NCBI
* Google Scholar
47. 47. Samson JE, Moineau S. Characterization of Lactococcus lactis Phage 949 and Comparison with Other Lactococcal Phages. Appl Environ Microbiol. 2010 Oct;76(20):6843–52. pmid:20802084
* View Article
* PubMed/NCBI
* Google Scholar
48. 48. Cornelissen A, Hardies SC, Shaburova OV, Krylov VN, Mattheus W, Kropinski AM, et al. Complete Genome Sequence of the Giant Virus OBP and Comparative Genome Analysis of the Diverse ϕKZ-Related Phages. J Virol. 2012 Feb;86(3):1844–52.
* View Article
* Google Scholar
49. 49. Prabhakaran R, Chithambaram S, Xia X. Aeromonas phages encode tRNAs for their overused codons. Int J Comput Biol Drug Des. 2014;7(2–3):168–82. pmid:24878728
* View Article
* PubMed/NCBI
* Google Scholar
50. 50. Wittmann J, Dreiseikelmann B, Rohde M, Meier-Kolthoff JP, Bunk B, Rohde C. First genome sequences of Achromobacter phages reveal new members of the N4 family. Virol J. 2014 Jan 27;11(1). pmid:24468270
* View Article
* PubMed/NCBI
* Google Scholar
51. 51. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10. pmid:2231712
* View Article
* PubMed/NCBI
* Google Scholar
52. 52. Nishimura Y, Yoshida T, Kuronishi M, Uehara H, Ogata H, Goto S. ViPTree: the viral proteomic tree server. Bioinforma Oxf Engl. 2017 Aug 1;33(15):2379–80. pmid:28379287
* View Article
* PubMed/NCBI
* Google Scholar
53. 53. Turner D, Kropinski AM, Adriaenssens EM. A Roadmap for Genome-Based Phage Taxonomy. Viruses 2021 Vol 13 Page 506. 2021 Mar 18;13(3):506. pmid:33803862
* View Article
* PubMed/NCBI
* Google Scholar
54. 54. Moraru C, Varsani A, Kropinski AM. VIRIDIC—A Novel Tool to Calculate the Intergenomic Similarities of Prokaryote-Infecting Viruses. Viruses. 2020 Nov;12(11):1268. pmid:33172115
* View Article
* PubMed/NCBI
* Google Scholar
55. 55. Gautreau G, Bazin A, Gachet M, Planel R, Burlot L, Dubois M, et al. PPanGGOLiN: Depicting microbial diversity via a partitioned pangenome graph. PLOS Comput Biol. 2020 Mar 19;16(3):e1007732. pmid:32191703
* View Article
* PubMed/NCBI
* Google Scholar
56. 56. Yahara K, Lehours P, Vale FF. Analysis of genetic recombination and the pan-genome of a highly recombinogenic bacteriophage species. Microb Genomics. 2019 Aug;5(8). pmid:31310202
* View Article
* PubMed/NCBI
* Google Scholar
57. 57. Bukhari SAR, Irfan M, Ahmad I, Chen L. Comparative Genomics and Pan-Genome Driven Prediction of a Reduced Genome of Akkermansia muciniphila. Microorganisms. 2022 Jul;10(7):1350. pmid:35889069
* View Article
* PubMed/NCBI
* Google Scholar
58. 58. Petrov VM, Ratnayaka S, Nolan JM, Miller ES, Karam JD. Genomes of the T4-related bacteriophages as windows on microbial genome evolution. Virol J. 2010 Oct 28;7(1):292. pmid:21029436
* View Article
* PubMed/NCBI
* Google Scholar
59. 59. Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, Weigele PR, et al. Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environmentse mi_2280 3035‥3056. Environ Microbiol. 2010;12(11):3035–56.
* View Article
* Google Scholar
60. 60. Comeau AM, Bertrand C, Letarov A, Tétart F, Krisch HM. Modular architecture of the T4 phage superfamily: a conserved core genome and a plastic periphery. Virology. 2007 Jun 5;362(2):384–96. pmid:17289101
* View Article
* PubMed/NCBI
* Google Scholar
61. 61. Filée J, Bapteste E, Susko E, Krisch HM. A Selective Barrier to Horizontal Gene Transfer in the T4-Type Bacteriophages That Has Preserved a Core Genome with the Viral Replication and Structural Genes. Mol Biol Evol. 2006 Sep 1;23(9):1688–96. pmid:16782763
* View Article
* PubMed/NCBI
* Google Scholar
62. 62. Petrov VM, Nolan JM, Bertrand C, Levy D, Desplats C, Krisch HM, et al. Plasticity of the Gene Functions for DNA Replication in the T4-like Phages. J Mol Biol. 2006 Aug 4;361(1):46–68. pmid:16828113
* View Article
* PubMed/NCBI
* Google Scholar
63. 63. Miller ES, Kutter E, Mosig G, Arisaka F, Kunisawa T, Rüger W. Bacteriophage T4 Genome. Microbiol Mol Biol Rev. 2003 Mar;67(1):86–156. pmid:12626685
* View Article
* PubMed/NCBI
* Google Scholar
64. 64. Kutter E, Bryan D, Ray G, Brewster E, Blasdel B, Guttman B. From Host to Phage Metabolism: Hot Tales of Phage T4’s Takeover of E. coli. Viruses. 2018 Jul 21;10(7):387. pmid:30037085
* View Article
* PubMed/NCBI
* Google Scholar
65. 65. Roberts MD, Martin NL, Kropinski AM. The genome and proteome of coliphage T1. Virology. 2004 Jan 5;318(1):245–66. pmid:14972552
* View Article
* PubMed/NCBI
* Google Scholar
66. 66. Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017 Nov;35(11):1026–8. pmid:29035372
* View Article
* PubMed/NCBI
* Google Scholar
67. 67. Bujak K, Decewicz P, Kitowicz M, Radlinska M. Characterization of Three Novel Virulent Aeromonas Phages Provides Insights into the Diversity of the Autographiviridae Family. Viruses. 2022 May 10;14(5):1016. pmid:35632757
* View Article
* PubMed/NCBI
* Google Scholar
68. 68. Guo H, Li M, Wang T, Wu H, Zhou H, Xu C, et al. Crystal structure and biochemical studies of the bifunctional DNA primase-polymerase from phage NrS-1. Biochem Biophys Res Commun. 2019 Mar 19;510(4):573–9. pmid:30739783
* View Article
* PubMed/NCBI
* Google Scholar
69. 69. Warwick-Dugdale J, Buchholz HH, Allen MJ, Temperton B. Host-hijacking and planktonic piracy: How phages command the microbial high seas. Virol J. 2019 Feb 1;16(1). pmid:30709355
* View Article
* PubMed/NCBI
* Google Scholar
70. 70. Tian F, Wainaina JM, Howard-Varona C, Dominguez-Huerta G, Bolduc B, Gazitúa MC, et al. Virus-encoded auxiliary metabolic genes throughout the global oceans. Rev.
* View Article
* Google Scholar
71. 71. Pratama AA, Bolduc B, Zayed AA, Zhong ZP, Guo J, Vik DR, et al. Expanding standards in viromics: in silico evaluation of dsDNA viral genome identification, classification, and auxiliary metabolic gene curation. PeerJ. 2021 Jun 14;9:e11447. pmid:34178438
* View Article
* PubMed/NCBI
* Google Scholar
72. 72. Shaffer M, Borton MA, McGivern BB, Zayed AA, La Rosa SL 0003 3527 8101, Solden LM, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020 Sep 9;48(16):8883. pmid:32766782
* View Article
* PubMed/NCBI
* Google Scholar
73. 73. Gold M, Hurwitz J. THE ENZYMATIC METHYLATION OF RIBONUCLEIC ACID AND DEOXYRIBONUCLEIC ACID. V. PURIFICATION AND PROPERTIES OF THE DEOXYRIBONUCLEIC ACID-METHYLATING ACTIVITY OF ESCHERICHIA COLI. J Biol Chem. 1964 Nov;239:3858–65. pmid:14257620
* View Article
* PubMed/NCBI
* Google Scholar
74. 74. Kolberg M, Strand KR, Graff P, Kristoffer Andersson K. Structure, function, and mechanism of ribonucleotide reductases. Biochim Biophys Acta BBA ‐ Proteins Proteomics. 2004 Jun 1;1699(1):1–34. pmid:15158709
* View Article
* PubMed/NCBI
* Google Scholar
75. 75. Sapienza PJ, Falk BT, Lee AL. Bacterial Thymidylate Synthase Binds Two Molecules of Substrate and Cofactor without Cooperativity. J Am Chem Soc. 2015 Nov 11;137(45):14260. pmid:26517288
* View Article
* PubMed/NCBI
* Google Scholar
76. 76. Herrmann KM, Weaver LM. The Shikimate Pathway. Annu Rev Plant Physiol Plant Mol Biol. 1999;50(1):473–503. pmid:15012217
* View Article
* PubMed/NCBI
* Google Scholar
77. 77. Shende VV, Bauman KD, Moore BS. The shikimate pathway: gateway to metabolic diversity. Nat Prod Rep. 2024;41:604. pmid:38170905
* View Article
* PubMed/NCBI
* Google Scholar
78. 78. Huang N, Sorci L, Zhang X, Brautigan C, Li X, Raffaelli N, et al. Bifunctional NMN adenylyltransferase/ADP ribose pyrophosphatase: structure and function in bacterial NAD metabolism. Struct Lond Engl 1993. 2008 Feb;16(2):196–209. pmid:18275811
* View Article
* PubMed/NCBI
* Google Scholar
79. 79. Lee JY, Li Z, Miller ES. Vibrio Phage KVP40 Encodes a Functional NAD+ Salvage Pathway. J Bacteriol. 2017 Apr 11;199(9):e00855–16. pmid:28167526
* View Article
* PubMed/NCBI
* Google Scholar
80. 80. Goff CG. Chemical Structure of a Modification of the Escherichia coli Ribonucleic Acid Polymerase α Polypeptides Induced by Bacteriophage T4 Infection. J Biol Chem. 1974 Oct 10;249(19):6181–90.
* View Article
* Google Scholar
81. 81. Depping R, Lohaus C, Meyer HE, Rüger W. The mono-ADP-ribosyltransferases Alt and ModB of bacteriophage T4: Target proteins identified. Biochem Biophys Res Commun. 2005 Oct 7;335(4):1217–23. pmid:16112649
* View Article
* PubMed/NCBI
* Google Scholar
82. 82. Pan YJ, Lin TL, Chen CT, Chen YY, Hsieh PF, Hsu CR, et al. Genetic analysis of capsular polysaccharide synthesis gene clusters in 79 capsular types of Klebsiella spp. Sci Rep. 2015 Oct 23;5(1):15573. pmid:26493302
* View Article
* PubMed/NCBI
* Google Scholar
83. 83. Dunstan RA, Bamert RS, Belousoff MJ, Short FL, Barlow CK, Pickard DJ, et al. Mechanistic Insights into the Capsule-Targeting Depolymerase from a Klebsiella pneumoniae Bacteriophage. Microbiol Spectr. 2021 Sep 3;9(1):e0102321. pmid:34431721
* View Article
* PubMed/NCBI
* Google Scholar
84. 84. Beamud B, García-González N, Gómez-Ortega M, González-Candelas F, Domingo-Calap P, Sanjuan R. Genetic determinants of host tropism in Klebsiella phages. Cell Rep. 2023 Feb;42(2):112048. pmid:36753420
* View Article
* PubMed/NCBI
* Google Scholar
85. 85. Lourenço M, Osbelt L, Passet V, Gravey F, Megrian D, Strowig T, et al. Phages against Noncapsulated Klebsiella pneumoniae: Broader Host range, Slower Resistance. Microbiol Spectr. 2023 Jun 20;0(0):e04812–22. pmid:37338376
* View Article
* PubMed/NCBI
* Google Scholar
86. 86. Smith-zaitlik T, Shibu P, Mccartney AL, Foster G, Hoyles L, Hoyles L, et al. Extended genomic analyses of the broad-host-range phages vB _ KmiM-2Di and vB _ KmiM-4Dii reveal slopekviruses have highly conserved genomes. 2022.
* View Article
* Google Scholar
87. 87. Maciejewska B, Roszniowski B, Espaillat A, Kęsik-Szeloch A, Majkowska-Skrobek G, Kropinski AM, et al. Klebsiella phages representing a novel clade of viruses with an unknown DNA modification and biotechnologically interesting enzymes. Appl Microbiol Biotechnol. 2017 Jan 1;101(2):673–84. pmid:27766357
* View Article
* PubMed/NCBI
* Google Scholar
88. 88. Guo X, Söderholm A, Kanchugal P S, Isaksen GV, Warsi O, Eckhard U, et al. Structure and mechanism of a phage-encoded SAM lyase revises catalytic function of enzyme family. Cui Q, Wolberger C, Moliner V, Tunon I, editors. eLife. 2021 Feb 10;10:e61818. pmid:33567250
* View Article
* PubMed/NCBI
* Google Scholar
89. 89. Klemm BP, Singh D, Smith CE, Hsu AL, Dillard LB, Krahn JM, et al. Mechanism by which T7 bacteriophage protein Gp1.2 inhibits Escherichia coli dGTPase. Proc Natl Acad Sci U S A. 2022 Sep 13;119(37):e2123092119. pmid:36067314
* View Article
* PubMed/NCBI
* Google Scholar
90. 90. Howard-Varona C, Lindback MM, Fudyma JD, Krongauz A, Solonenko NE, Zayed AA, et al. Environment-specific virocell metabolic reprogramming. ISME J. 2024 Mar 29;wrae055. pmid:38552150
* View Article
* PubMed/NCBI
* Google Scholar
91. 91. Kutter E, Sulakvelidze A, editors. Bacteriophages Biology and Applications. New York: CRC Press; 2005. 483–488 p.
92. 92. Jakočiūnė D, Moodley A. A Rapid Bacteriophage DNA Extraction Method. Methods Protoc. 2018 Jul 27;1(3):27. pmid:31164569
* View Article
* PubMed/NCBI
* Google Scholar
93. 93. Turner D, Adriaenssens EM, Tolstoy I, Kropinski AM. Phage Annotation Guide: Guidelines for Assembly and High-Quality Annotation. https://home.liebertpub.com/phage. 2021 Dec 16;2(4):170–82.
* View Article
* Google Scholar
94. 94. Shen A, Millard A. Phage Genome Annotation: Where to Begin and End. https://home.liebertpub.com/phage. 2021 Dec 16;2(4):183–93.
* View Article
* Google Scholar
95. 95. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014 Aug 1;30(15):2114–20. pmid:24695404
* View Article
* PubMed/NCBI
* Google Scholar
96. 96. Andrews S. FastQC [Internet]. 2023 [cited 2023 Jul 6]. Available from: https://github.com/s-andrews/FastQC
* View Article
* Google Scholar
97. 97. Rognes T, Flouri T, Nichols B, Quince C, Mahé F. VSEARCH: a versatile open source tool for metagenomics. PeerJ. 2016;4:e2584. pmid:27781170
* View Article
* PubMed/NCBI
* Google Scholar
98. 98. Prjibelski A, Antipov D, Meleshko D, Lapidus A, Korobeynikov A. Using SPAdes De Novo Assembler. Curr Protoc Bioinforma. 2020;70(1):e102. pmid:32559359
* View Article
* PubMed/NCBI
* Google Scholar
99. 99. Wick RR, Schultz MB, Zobel J, Holt KE. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015 Oct 15;31(20):3350–2. pmid:26099265
* View Article
* PubMed/NCBI
* Google Scholar
100. 100. BBMap [Internet]. SourceForge. 2022 [cited 2023 Jul 6]. Available from: https://sourceforge.net/projects/bbmap/
101. 101. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PLOS ONE. 2014 Nov 19;9(11):e112963. pmid:25409509
* View Article
* PubMed/NCBI
* Google Scholar
102. 102. Garneau JR, Depardieu F, Fortier LC, Bikard D, Monot M. PhageTerm: A tool for fast and accurate determination of phage termini and packaging mechanism using next-generation sequencing data. Sci Rep. 2017 Dec 1;7(1). pmid:28811656
* View Article
* PubMed/NCBI
* Google Scholar
103. 103. Wick R. rrwick/Porechop [Internet]. 2024 [cited 2024 May 7]. Available from: https://github.com/rrwick/Porechop
104. 104. Salisbury A, Tsourkas PK. A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation. Int J Mol Sci 2019 Vol 20 Page 3391. 2019 Jul 10;20(14):3391. pmid:31295925
* View Article
* PubMed/NCBI
* Google Scholar
105. 105. Lazeroff M, Ryder G, Harris SL, Tsourkas PK. Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs. https://home.liebertpub.com/phage. 2021 Dec 16;2(4):204–13.
* View Article
* Google Scholar
106. 106. Besemer J, Borodovsky M. Heuristic approach to deriving models for gene finding. Nucleic Acids Res. 1999 Oct 1;27(19):3911–20. pmid:10481031
* View Article
* PubMed/NCBI
* Google Scholar
107. 107. Besemer J, Lomsadze A, Borodovsky M. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 2001 Jun 15;29(12):2607–18. pmid:11410670
* View Article
* PubMed/NCBI
* Google Scholar
108. 108. Lomsadze A, Gemayel K, Tang S, Borodovsky M. Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes. Genome Res. 2018 Jul;28(7):1079–89. pmid:29773659
* View Article
* PubMed/NCBI
* Google Scholar
109. 109. Zhu W, Lomsadze A, Borodovsky M. Ab initio gene identification in metagenomic sequences. Nucleic Acids Res. 2010 Jul 1;38(12):e132. pmid:20403810
* View Article
* PubMed/NCBI
* Google Scholar
110. 110. Noguchi H, Taniguchi T, Itoh T. MetaGeneAnnotator: Detecting Species-Specific Patterns of Ribosomal Binding Site for Precise Gene Prediction in Anonymous Prokaryotic and Phage Genomes. DNA Res Int J Rapid Publ Rep Genes Genomes. 2008 Dec;15(6):387–96. pmid:18940874
* View Article
* PubMed/NCBI
* Google Scholar
111. 111. Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinforma Oxf Engl. 2007 Mar 15;23(6):673–9. pmid:17237039
* View Article
* PubMed/NCBI
* Google Scholar
112. 112. Hyatt D, Chen GL, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010 Mar 8;11(1):119. pmid:20211023
* View Article
* PubMed/NCBI
* Google Scholar
113. 113. McNair K, Zhou C, Dinsdale EA, Souza B, Edwards RA. PHANOTATE: a novel approach to gene identification in phage genomes. Bioinformatics. 2019 Nov 15;35(22):4537–42. pmid:31329826
* View Article
* PubMed/NCBI
* Google Scholar
114. 114. Zimmermann L, Stephens A, Nam SZ, Rau D, Kübler J, Lozajic M, et al. A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core. J Mol Biol. 2018 Jul 20;430(15):2237–43. pmid:29258817
* View Article
* PubMed/NCBI
* Google Scholar
115. 115. Ramsey J, Rasche H, Maughmer C, Criscione A, Mijalis E, Liu M, et al. Galaxy and Apollo as a biologist-friendly interface for high-quality cooperative phage genome annotation. PLOS Comput Biol. 2020 Nov 2;16(11):e1008214. pmid:33137082
* View Article
* PubMed/NCBI
* Google Scholar
116. 116. Hadley Wickham. ggplot2: Elegant Graphics for Data Analysis [Internet]. Springer-Verlag New York; 2016. Available from: https://ggplot2.tidyverse.org
* View Article
* Google Scholar
117. 117. Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013 Apr 1;30(4):772–80. pmid:23329690
* View Article
* PubMed/NCBI
* Google Scholar
118. 118. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020 May 1;37(5):1530–4. pmid:32011700
* View Article
* PubMed/NCBI
* Google Scholar
119. 119. Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol Biol Evol. 2018 Feb 1;35(2):518–22. pmid:29077904
* View Article
* PubMed/NCBI
* Google Scholar
120. 120. Chernomor O, von Haeseler A, Minh BQ. Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices. Syst Biol. 2016 Nov 1;65(6):997–1008. pmid:27121966
* View Article
* PubMed/NCBI
* Google Scholar
121. 121. Letunic I, Bork P. Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 2024 Apr 13;gkae268.
* View Article
* Google Scholar
122. 122. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009 Aug 1;25(15):1972–3. pmid:19505945
* View Article
* PubMed/NCBI
* Google Scholar
123. 123. Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009 May 1;25(9):1189–91. pmid:19151095
* View Article
* PubMed/NCBI
* Google Scholar
124. 124. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017 Jun;14(6):587–9. pmid:28481363
* View Article
* PubMed/NCBI
* Google Scholar
125. 125. Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, et al. Identification of mobile genetic elements with geNomad. Nat Biotechnol. 2024 Aug;42(8):1303–12. pmid:37735266
* View Article
* PubMed/NCBI
* Google Scholar
126. 126. Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol. 2021 May;39(5):578–85. pmid:33349699
* View Article
* PubMed/NCBI
* Google Scholar
127. 127. Aroney STN, Newell RJP, Nissen J, Camargo AP, Tyson GW, Woodcroft BJ. CoverM: Read coverage calculator for metagenomics [Internet]. 2024 [cited 2024 Sep 16]. Available from: https://github.com/wwood/CoverM
* View Article
* Google Scholar
128. 128. Gilchrist CLM, Chooi YH. Clinker & clustermap.js: Automatic generation of gene cluster comparison figures. Bioinformatics. 2021;37(16):2473–5.
* View Article
* Google Scholar
Citation: Gittrich MR, Sanderson CM, Wainaina JM, Noel CM, Leopold JE, Babusci E, et al. (2025) Isolation and characterization of 24 phages infecting the plant growth-promoting rhizobacterium Klebsiella sp. M5al. PLoS ONE 20(2): e0313947. https://doi.org/10.1371/journal.pone.0313947
About the Authors:
Marissa R. Gittrich
Roles: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Validation, Visualization, Writing – original draft, Writing – review & editing
Affiliations: Department of Microbiology, The Ohio State University, Columbus, Ohio, United States of America, Center of Microbiome Science, The Ohio State University, Columbus, Ohio, United States of America
ORICD: https://orcid.org/0000-0003-0314-1426
Courtney M. Sanderson
Roles: Data curation, Formal analysis, Methodology, Visualization, Writing – review & editing
Affiliations: Department of Microbiology, The Ohio State University, Columbus, Ohio, United States of America, Center of Microbiome Science, The Ohio State University, Columbus, Ohio, United States of America
James M. Wainaina
Roles: Formal analysis, Methodology, Visualization, Writing – review & editing
Affiliations: Department of Microbiology, The Ohio State University, Columbus, Ohio, United States of America, Center of Microbiome Science, The Ohio State University, Columbus, Ohio, United States of America, Department of Biology, Woods Hole Oceanographic Institution, Woods Hole, Massachusetts, United States of America
Cara M. Noel
Roles: Data curation, Formal analysis, Investigation, Writing – review & editing
Affiliation: Department of Microbiology, The Ohio State University, Columbus, Ohio, United States of America
ORICD: https://orcid.org/0009-0009-0433-2719
Jonathan E. Leopold
Roles: Data curation, Formal analysis, Investigation, Writing – review & editing
Affiliation: Department of Microbiology, The Ohio State University, Columbus, Ohio, United States of America
Erica Babusci
Roles: Formal analysis, Investigation, Writing – review & editing
Affiliation: School of the Environment and Natural Resources, The Ohio State University, Columbus, Ohio, United States of America
Sumeyra C. Selbes
Roles: Data curation, Investigation, Writing – review & editing
Affiliation: Department of Psychology, The Ohio State University, Columbus, Ohio, United States of America
Olivia R. Farinas
Roles: Data curation, Investigation, Writing – review & editing
Affiliation: College of Public Health, Division of Environmental Health Sciences, The Ohio State University, Columbus, Ohio, United States of America
Jack Caine
Roles: Data curation, Investigation
Affiliation: Department of Biology, Kenyon College, Gambier, Ohio, United States of America
Joshua Davis II
Roles: Data curation, Investigation
Affiliation: Department of Microbiology, The Ohio State University, Columbus, Ohio, United States of America
Vivek K. Mutalik
Roles: Conceptualization, Funding acquisition, Project administration, Supervision, Validation, Writing – original draft, Writing – review & editing
Affiliation: Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
Paul Hyman
Roles: Conceptualization, Funding acquisition, Methodology, Resources, Writing – original draft, Writing – review & editing
Affiliation: Department of Biology/Toxicology, Ashland University, Ashland, Ohio, United States of America
ORICD: https://orcid.org/0000-0002-0108-4993
Matthew B. Sullivan
Roles: Funding acquisition, Supervision, Writing – original draft, Writing – review & editing
E-mail: [email protected]
Affiliations: Department of Microbiology, The Ohio State University, Columbus, Ohio, United States of America, Center of Microbiome Science, The Ohio State University, Columbus, Ohio, United States of America, Department of Civil, Environmental, and Geodetic Engineering, The Ohio State University, Columbus, Ohio, United States of America
ORICD: https://orcid.org/0000-0001-8398-8234
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
[/RAW_REF_TEXT]
1. Batinovic S, Wassef F, Knowler SA, Rice DTF, Stanton CR, Rose J, et al. Bacteriophages in Natural and Artificial Environments. Pathogens. 2019 Sep;8(3):100. pmid:31336985
2. Suttle CA. Marine viruses ‐ Major players in the global ecosystem. Nat Rev Microbiol. 2007 Oct;5(10):801–12. pmid:17853907
3. Fuhrman JA. Marine viruses and their biogeochemical and ecological effects. Nature. 1999 Jun;399(6736):541–8. pmid:10376593
4. Howard-Varona C, Lindback MM, Bastien GE, Solonenko N, Zayed AA, Jang HB, et al. Phage-specific metabolic reprogramming of virocells. ISME J. 2020 Jan 2;14(4):881–95. pmid:31896786
5. Sullivan MB, Lindell D, Lee JA, Thompson LR, Bielawski JP, Chisholm SW. Prevalence and Evolution of Core Photosystem II Genes in Marine Cyanobacterial Viruses and Their Hosts. PLOS Biol. 2006 Jul 4;4(8):e234. pmid:16802857
6. Łusiak-Szelachowska M, Weber-Dąbrowska B, Żaczek M, Borysowski J, Górski A. The Presence of Bacteriophages in the Human Body: Good, Bad or Neutral? Microorganisms. 2020 Dec 16;8(12):2012. pmid:33339331
7. Townsend EM, Kelly L, Gannon L, Muscatt G, Dunstan R, Michniewski S, et al. Isolation and Characterization of Klebsiella Phages for Phage Therapy. Phage. 2021;2(1):26–42. pmid:33796863
8. Emerson JB, Roux S, Brum JR, Bolduc B, Woodcroft BJ, Jang HB, et al. Host-linked soil viral ecology along a permafrost thaw gradient. Nat Microbiol. 2018;3(8):870–80. pmid:30013236
9. Trubl G, Jang HB, Roux S, Emerson JB, Solonenko N, Vik DR, et al. Soil Viruses Are Underexplored Players in Ecosystem Carbon Processing. mSystems. 2018 Oct 2;3(5):e00076–18.
10. Daly RA, Roux S, Borton MA, Morgan DM, Johnston MD, Booker AE, et al. Viruses control dominant bacteria colonizing the terrestrial deep biosphere after hydraulic fracturing. Nat Microbiol. 2019 Feb;4(2):352–61. pmid:30510171
11. Amundson KK, Borton MA, Daly RA, Hoyt DW, Wong A, Eder E, et al. Microbial colonization and persistence in deep fractured shales is guided by metabolic exchanges and viral predation. Microbiome. 2022 Jan 16;10(1):5. pmid:35034639
12. Zhong ZP, Tian F, Roux S, Gazitúa MC, Solonenko NE, Li YF, et al. Glacier ice archives nearly 15,000-year-old microbes and phages. Microbiome. 2021 Jul 20;9(1):160. pmid:34281625
13. Zhong ZP, Rapp JZ, Wainaina JM, Solonenko NE, Maughan H, Carpenter SD, et al. Viral Ecogenomics of Arctic Cryopeg Brine and Sea Ice. mSystems. 2020 Jun 16;5(3): pmid:32546670
14. Chevallereau A, Pons BJ, van Houte S, Westra ER. Interactions between bacterial and phage communities in natural environments. Nat Rev Microbiol. 2022 Jan;20(1):49–62. pmid:34373631
15. Williamson KE, Fuhrmann JJ, Wommack KE, Radosevich M. Viruses in Soil Ecosystems: An Unknown Quantity Within an Unexplored Territory. Httpsdoiorg101146annurev-Virol-101416-041639. 2017 Sep 29;4:201–19. pmid:28961409
16. Kuzyakov Y, Mason-Jones K. Viruses in soil: Nano-scale undead drivers of microbial life, biogeochemical turnover and ecosystem functions. Soil Biol Biochem. 2018 Dec 1;127:305–17.
17. Villalpando-Aguilar JL, Matos-Pech G, López-Rosas I, Castelán-Sánchez HG, Alatorre-Cobos F. Phage Therapy for Crops: Concepts, Experimental and Bioinformatics Approaches to Direct Its Application. Int J Mol Sci. 2022 Dec 25;24(1):325. pmid:36613768
18. Svircev A, Roach D, Castle A. Framing the Future with Bacteriophages in Agriculture. Viruses. 2018 Apr 25;10(5):218. pmid:29693561
19. Sieiro C, Areal-Hermida L, Pichardo-Gallardo Á, Almuiña-González R, de Miguel T, Sánchez S, et al. A Hundred Years of Bacteriophages: Can Phages Replace Antibiotics in Agriculture and Aquaculture? Antibiotics. 2020 Aug;9(8):493. pmid:32784768
20. Yang K, Wang X, Hou R, Lu C, Fan Z, Li J, et al. Rhizosphere phage communities drive soil suppressiveness to bacterial wilt disease. Microbiome. 2023 Feb 1;11(1):16. pmid:36721270
21. Braga LPP, Spor A, Kot W, Breuil MC, Hansen LH, Setubal JC, et al. Impact of phages on soil bacterial communities and nitrogen availability under different assembly scenarios. Microbiome. 2020 Apr 6;8(1):1–14.
22. Wang Y, Liu Y, Wu Y, Wu N, Liu W, Wang X. Heterogeneity of soil bacterial and bacteriophage communities in three rice agroecosystems and potential impacts of bacteriophage on nutrient cycling. Environ Microbiome. 2022 Apr 6;17(1):17. pmid:35387674
23. Morella NM, Gomez AL, Wang G, Leung MS, Koskella B. The impact of bacteriophages on phyllosphere bacterial abundance and composition. Mol Ecol. 2018;27(8):2025–38. pmid:29457297
24. Santos-Medellin C, Zinke LA, ter Horst AM, Gelardi DL, Parikh SJ, Emerson JB. Viromes outperform total metagenomes in revealing the spatiotemporal patterns of agricultural soil viral communities. ISME J. 2021 Jul;15(7):1956–70. pmid:33612831
25. Thorpe HA, Booton R, Kallonen T, Gibbon MJ, Couto N, Passet V, et al. A large-scale genomic snapshot of Klebsiella spp. isolates in Northern Italy reveals limited transmission between clinical and non-clinical settings. Nat Microbiol. 2022 Dec;7(12):2054–67. pmid:36411354
26. Rocha J, Henriques I, Gomila M, Manaia CM. Common and distinctive genomic features of Klebsiella pneumoniae thriving in the natural environment or in clinical settings. Sci Rep. 2022 Jun 21;12(1):10441. pmid:35729190
27. Podschun R, Pietsch S, Höller C, Ullmann U. Incidence of Klebsiella Species in Surface Waters and Their Expression of Virulence Factors. Appl Environ Microbiol. 2001 Jul;67(7):3325–7. pmid:11425763
28. NCBI Virus [Internet]. [cited 2023 Dec 20]. Available from: https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/
29. Du C, Zhang Y, Li Y, Cao Z. Novel Redox Potential-Based Screening Strategy for Rapid Isolation of Klebsiella pneumoniae Mutants with Enhanced 1,3-Propanediol-Producing Capability. Appl Environ Microbiol. 2007 Jul;73(14):4515–21. pmid:17513581
30. Du C, Yan H, Zhang Y, Li Y, Cao Z. Use of oxidoreduction potential as an indicator to regulate 1,3-propanediol fermentation by Klebsiella pneumoniae. Appl Microbiol Biotechnol. 2006 Jan;69(5):554–63. pmid:16021488
31. Yu Z, Li S, Li Y, Jiang Z, Zhou J, An Q. Complete genome sequence of N2-fixing model strain Klebsiella sp. nov. M5al, which produces plant cell wall-degrading enzymes and siderophores. Biotechnol Rep. 2018 Mar 1;17:6.
32. Van Twest R, Kropinski AM. Bacteriophage Enrichment from Water and Soil. In: Clokie MRJ, Kropinski AM, editors. Bacteriophages: Methods and Protocols, Volume 1: Isolation, Characterization, and Interactions [Internet]. Totowa, NJ: Humana Press; 2009. p. 15–21. (Methods in Molecular BiologyTM). Available from: https://doi.org/10.1007/978-1-60327-164-6_2
33. MAVERICLab / vcontact3—Bitbucket [Internet]. [cited 2024 Mar 29]. Available from: https://bitbucket.org/MAVERICLab/vcontact3/src/master/
34. Graham EB, Camargo AP, Wu R, Neches RY, Nolan M, Paez-Espino D, et al. A global atlas of soil viruses reveals unexplored biodiversity and potential biogeochemical impacts. Nat Microbiol. 2024 Jul;9(7):1873–83. pmid:38902374
35. Borton MA, McGivern BB, Willi KR, Woodcroft BJ, Mosier AC, Singleton DM, et al. A functional microbiome catalog crowdsourced from North American rivers [Internet]. bioRxiv; 2023 [cited 2024 Sep 10]. p. 2023.07.22.550117. Available from: pmid:37502915
36. Sun CL, Pratama AA, Gazitúa MC, Cronin D, McGivern BB, Wainaina JM, et al. Virus ecology and 7-year temporal dynamics across a permafrost thaw gradient. Environ Microbiol. 2024;26(8):e16665. pmid:39101434
37. Fudyma JD, ter Horst AM, Santos-Medellín C, Sorensen JW, Gogul GG, Hillary LS, et al. Exploring viral particle, soil, and extraction buffer physicochemical characteristics and their impacts on extractable viral communities. Soil Biol Biochem. 2024 Jul 1;194:109419.
38. Ma B, Wang Y, Zhao K, Stirling E, Lv X, Yu Y, et al. Biogeographic patterns and drivers of soil viromes. Nat Ecol Evol. 2024 Apr;8(4):717–28. pmid:38383853
39. Herridge WP, Shibu P, O’Shea J, Brook TC, Hoyles L. Bacteriophages of Klebsiella spp., their diversity and potential therapeutic uses. J Med Microbiol. 2020;69(2):176–94. pmid:31976857
40. Aranaga C, Pantoja LD, Martínez EA, Falco A. Phage Therapy in the Era of Multidrug Resistance in Bacteria: A Systematic Review. Int J Mol Sci. 2022 Jan;23(9):4577. pmid:35562968
41. Turner D, Shkoporov AN, Lood C, Millard AD, Dutilh BE, Alfenas-Zerbini P, et al. Abolishment of morphology-based taxa and change to binomial species names: 2022 taxonomy update of the ICTV bacterial viruses subcommittee. Arch Virol. 2023;168(2):74. pmid:36683075
42. Yang JY, Fang W, Miranda-Sanchez F, Brown JM, Kauffman KM, Acevero CM, et al. Degradation of host translational machinery drives tRNA acquisition in viruses. Cell Syst. 2021 Aug 18;12(8):771–779.e5. pmid:34143976
43. Bailly-Bechet M, Vergassola M, Rocha E. Causes for the intriguing presence of tRNAs in phages. Genome Res. 2007 Oct;17(10):1486–95. pmid:17785533
44. Delesalle VA, Tanke NT, Vill AC, Krukonis GP. Testing hypotheses for the presence of tRNA genes in mycobacteriophage genomes. Bacteriophage. 2016 Aug 5;6(3):e1219441. pmid:27738556
45. Villion M, Chopin MC, Deveau H, Ehrlich SD, Moineau S, Chopin A. P087, a lactococcal phage with a morphogenesis module similar to an Enterococcus faecalis prophage. Virology. 2009 May 25;388(1):49–56. pmid:19349056
46. Limor-Waisberg K, Carmi A, Scherz A, Pilpel Y, Furman I. Specialization versus adaptation: two strategies employed by cyanophages to enhance their translation efficiencies. Nucleic Acids Res. 2011 Aug;39(14):6016–28. pmid:21470965
47. Samson JE, Moineau S. Characterization of Lactococcus lactis Phage 949 and Comparison with Other Lactococcal Phages. Appl Environ Microbiol. 2010 Oct;76(20):6843–52. pmid:20802084
48. Cornelissen A, Hardies SC, Shaburova OV, Krylov VN, Mattheus W, Kropinski AM, et al. Complete Genome Sequence of the Giant Virus OBP and Comparative Genome Analysis of the Diverse ϕKZ-Related Phages. J Virol. 2012 Feb;86(3):1844–52.
49. Prabhakaran R, Chithambaram S, Xia X. Aeromonas phages encode tRNAs for their overused codons. Int J Comput Biol Drug Des. 2014;7(2–3):168–82. pmid:24878728
50. Wittmann J, Dreiseikelmann B, Rohde M, Meier-Kolthoff JP, Bunk B, Rohde C. First genome sequences of Achromobacter phages reveal new members of the N4 family. Virol J. 2014 Jan 27;11(1). pmid:24468270
51. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10. pmid:2231712
52. Nishimura Y, Yoshida T, Kuronishi M, Uehara H, Ogata H, Goto S. ViPTree: the viral proteomic tree server. Bioinforma Oxf Engl. 2017 Aug 1;33(15):2379–80. pmid:28379287
53. Turner D, Kropinski AM, Adriaenssens EM. A Roadmap for Genome-Based Phage Taxonomy. Viruses 2021 Vol 13 Page 506. 2021 Mar 18;13(3):506. pmid:33803862
54. Moraru C, Varsani A, Kropinski AM. VIRIDIC—A Novel Tool to Calculate the Intergenomic Similarities of Prokaryote-Infecting Viruses. Viruses. 2020 Nov;12(11):1268. pmid:33172115
55. Gautreau G, Bazin A, Gachet M, Planel R, Burlot L, Dubois M, et al. PPanGGOLiN: Depicting microbial diversity via a partitioned pangenome graph. PLOS Comput Biol. 2020 Mar 19;16(3):e1007732. pmid:32191703
56. Yahara K, Lehours P, Vale FF. Analysis of genetic recombination and the pan-genome of a highly recombinogenic bacteriophage species. Microb Genomics. 2019 Aug;5(8). pmid:31310202
57. Bukhari SAR, Irfan M, Ahmad I, Chen L. Comparative Genomics and Pan-Genome Driven Prediction of a Reduced Genome of Akkermansia muciniphila. Microorganisms. 2022 Jul;10(7):1350. pmid:35889069
58. Petrov VM, Ratnayaka S, Nolan JM, Miller ES, Karam JD. Genomes of the T4-related bacteriophages as windows on microbial genome evolution. Virol J. 2010 Oct 28;7(1):292. pmid:21029436
59. Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, Weigele PR, et al. Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environmentse mi_2280 3035‥3056. Environ Microbiol. 2010;12(11):3035–56.
60. Comeau AM, Bertrand C, Letarov A, Tétart F, Krisch HM. Modular architecture of the T4 phage superfamily: a conserved core genome and a plastic periphery. Virology. 2007 Jun 5;362(2):384–96. pmid:17289101
61. Filée J, Bapteste E, Susko E, Krisch HM. A Selective Barrier to Horizontal Gene Transfer in the T4-Type Bacteriophages That Has Preserved a Core Genome with the Viral Replication and Structural Genes. Mol Biol Evol. 2006 Sep 1;23(9):1688–96. pmid:16782763
62. Petrov VM, Nolan JM, Bertrand C, Levy D, Desplats C, Krisch HM, et al. Plasticity of the Gene Functions for DNA Replication in the T4-like Phages. J Mol Biol. 2006 Aug 4;361(1):46–68. pmid:16828113
63. Miller ES, Kutter E, Mosig G, Arisaka F, Kunisawa T, Rüger W. Bacteriophage T4 Genome. Microbiol Mol Biol Rev. 2003 Mar;67(1):86–156. pmid:12626685
64. Kutter E, Bryan D, Ray G, Brewster E, Blasdel B, Guttman B. From Host to Phage Metabolism: Hot Tales of Phage T4’s Takeover of E. coli. Viruses. 2018 Jul 21;10(7):387. pmid:30037085
65. Roberts MD, Martin NL, Kropinski AM. The genome and proteome of coliphage T1. Virology. 2004 Jan 5;318(1):245–66. pmid:14972552
66. Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017 Nov;35(11):1026–8. pmid:29035372
67. Bujak K, Decewicz P, Kitowicz M, Radlinska M. Characterization of Three Novel Virulent Aeromonas Phages Provides Insights into the Diversity of the Autographiviridae Family. Viruses. 2022 May 10;14(5):1016. pmid:35632757
68. Guo H, Li M, Wang T, Wu H, Zhou H, Xu C, et al. Crystal structure and biochemical studies of the bifunctional DNA primase-polymerase from phage NrS-1. Biochem Biophys Res Commun. 2019 Mar 19;510(4):573–9. pmid:30739783
69. Warwick-Dugdale J, Buchholz HH, Allen MJ, Temperton B. Host-hijacking and planktonic piracy: How phages command the microbial high seas. Virol J. 2019 Feb 1;16(1). pmid:30709355
70. Tian F, Wainaina JM, Howard-Varona C, Dominguez-Huerta G, Bolduc B, Gazitúa MC, et al. Virus-encoded auxiliary metabolic genes throughout the global oceans. Rev.
71. Pratama AA, Bolduc B, Zayed AA, Zhong ZP, Guo J, Vik DR, et al. Expanding standards in viromics: in silico evaluation of dsDNA viral genome identification, classification, and auxiliary metabolic gene curation. PeerJ. 2021 Jun 14;9:e11447. pmid:34178438
72. Shaffer M, Borton MA, McGivern BB, Zayed AA, La Rosa SL 0003 3527 8101, Solden LM, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020 Sep 9;48(16):8883. pmid:32766782
73. Gold M, Hurwitz J. THE ENZYMATIC METHYLATION OF RIBONUCLEIC ACID AND DEOXYRIBONUCLEIC ACID. V. PURIFICATION AND PROPERTIES OF THE DEOXYRIBONUCLEIC ACID-METHYLATING ACTIVITY OF ESCHERICHIA COLI. J Biol Chem. 1964 Nov;239:3858–65. pmid:14257620
74. Kolberg M, Strand KR, Graff P, Kristoffer Andersson K. Structure, function, and mechanism of ribonucleotide reductases. Biochim Biophys Acta BBA ‐ Proteins Proteomics. 2004 Jun 1;1699(1):1–34. pmid:15158709
75. Sapienza PJ, Falk BT, Lee AL. Bacterial Thymidylate Synthase Binds Two Molecules of Substrate and Cofactor without Cooperativity. J Am Chem Soc. 2015 Nov 11;137(45):14260. pmid:26517288
76. Herrmann KM, Weaver LM. The Shikimate Pathway. Annu Rev Plant Physiol Plant Mol Biol. 1999;50(1):473–503. pmid:15012217
77. Shende VV, Bauman KD, Moore BS. The shikimate pathway: gateway to metabolic diversity. Nat Prod Rep. 2024;41:604. pmid:38170905
78. Huang N, Sorci L, Zhang X, Brautigan C, Li X, Raffaelli N, et al. Bifunctional NMN adenylyltransferase/ADP ribose pyrophosphatase: structure and function in bacterial NAD metabolism. Struct Lond Engl 1993. 2008 Feb;16(2):196–209. pmid:18275811
79. Lee JY, Li Z, Miller ES. Vibrio Phage KVP40 Encodes a Functional NAD+ Salvage Pathway. J Bacteriol. 2017 Apr 11;199(9):e00855–16. pmid:28167526
80. Goff CG. Chemical Structure of a Modification of the Escherichia coli Ribonucleic Acid Polymerase α Polypeptides Induced by Bacteriophage T4 Infection. J Biol Chem. 1974 Oct 10;249(19):6181–90.
81. Depping R, Lohaus C, Meyer HE, Rüger W. The mono-ADP-ribosyltransferases Alt and ModB of bacteriophage T4: Target proteins identified. Biochem Biophys Res Commun. 2005 Oct 7;335(4):1217–23. pmid:16112649
82. Pan YJ, Lin TL, Chen CT, Chen YY, Hsieh PF, Hsu CR, et al. Genetic analysis of capsular polysaccharide synthesis gene clusters in 79 capsular types of Klebsiella spp. Sci Rep. 2015 Oct 23;5(1):15573. pmid:26493302
83. Dunstan RA, Bamert RS, Belousoff MJ, Short FL, Barlow CK, Pickard DJ, et al. Mechanistic Insights into the Capsule-Targeting Depolymerase from a Klebsiella pneumoniae Bacteriophage. Microbiol Spectr. 2021 Sep 3;9(1):e0102321. pmid:34431721
84. Beamud B, García-González N, Gómez-Ortega M, González-Candelas F, Domingo-Calap P, Sanjuan R. Genetic determinants of host tropism in Klebsiella phages. Cell Rep. 2023 Feb;42(2):112048. pmid:36753420
85. Lourenço M, Osbelt L, Passet V, Gravey F, Megrian D, Strowig T, et al. Phages against Noncapsulated Klebsiella pneumoniae: Broader Host range, Slower Resistance. Microbiol Spectr. 2023 Jun 20;0(0):e04812–22. pmid:37338376
86. Smith-zaitlik T, Shibu P, Mccartney AL, Foster G, Hoyles L, Hoyles L, et al. Extended genomic analyses of the broad-host-range phages vB _ KmiM-2Di and vB _ KmiM-4Dii reveal slopekviruses have highly conserved genomes. 2022.
87. Maciejewska B, Roszniowski B, Espaillat A, Kęsik-Szeloch A, Majkowska-Skrobek G, Kropinski AM, et al. Klebsiella phages representing a novel clade of viruses with an unknown DNA modification and biotechnologically interesting enzymes. Appl Microbiol Biotechnol. 2017 Jan 1;101(2):673–84. pmid:27766357
88. Guo X, Söderholm A, Kanchugal P S, Isaksen GV, Warsi O, Eckhard U, et al. Structure and mechanism of a phage-encoded SAM lyase revises catalytic function of enzyme family. Cui Q, Wolberger C, Moliner V, Tunon I, editors. eLife. 2021 Feb 10;10:e61818. pmid:33567250
89. Klemm BP, Singh D, Smith CE, Hsu AL, Dillard LB, Krahn JM, et al. Mechanism by which T7 bacteriophage protein Gp1.2 inhibits Escherichia coli dGTPase. Proc Natl Acad Sci U S A. 2022 Sep 13;119(37):e2123092119. pmid:36067314
90. Howard-Varona C, Lindback MM, Fudyma JD, Krongauz A, Solonenko NE, Zayed AA, et al. Environment-specific virocell metabolic reprogramming. ISME J. 2024 Mar 29;wrae055. pmid:38552150
91. Kutter E, Sulakvelidze A, editors. Bacteriophages Biology and Applications. New York: CRC Press; 2005. 483–488 p.
92. Jakočiūnė D, Moodley A. A Rapid Bacteriophage DNA Extraction Method. Methods Protoc. 2018 Jul 27;1(3):27. pmid:31164569
93. Turner D, Adriaenssens EM, Tolstoy I, Kropinski AM. Phage Annotation Guide: Guidelines for Assembly and High-Quality Annotation. https://home.liebertpub.com/phage. 2021 Dec 16;2(4):170–82.
94. Shen A, Millard A. Phage Genome Annotation: Where to Begin and End. https://home.liebertpub.com/phage. 2021 Dec 16;2(4):183–93.
95. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014 Aug 1;30(15):2114–20. pmid:24695404
96. Andrews S. FastQC [Internet]. 2023 [cited 2023 Jul 6]. Available from: https://github.com/s-andrews/FastQC
97. Rognes T, Flouri T, Nichols B, Quince C, Mahé F. VSEARCH: a versatile open source tool for metagenomics. PeerJ. 2016;4:e2584. pmid:27781170
98. Prjibelski A, Antipov D, Meleshko D, Lapidus A, Korobeynikov A. Using SPAdes De Novo Assembler. Curr Protoc Bioinforma. 2020;70(1):e102. pmid:32559359
99. Wick RR, Schultz MB, Zobel J, Holt KE. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015 Oct 15;31(20):3350–2. pmid:26099265
100. BBMap [Internet]. SourceForge. 2022 [cited 2023 Jul 6]. Available from: https://sourceforge.net/projects/bbmap/
101. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PLOS ONE. 2014 Nov 19;9(11):e112963. pmid:25409509
102. Garneau JR, Depardieu F, Fortier LC, Bikard D, Monot M. PhageTerm: A tool for fast and accurate determination of phage termini and packaging mechanism using next-generation sequencing data. Sci Rep. 2017 Dec 1;7(1). pmid:28811656
103. Wick R. rrwick/Porechop [Internet]. 2024 [cited 2024 May 7]. Available from: https://github.com/rrwick/Porechop
104. Salisbury A, Tsourkas PK. A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation. Int J Mol Sci 2019 Vol 20 Page 3391. 2019 Jul 10;20(14):3391. pmid:31295925
105. Lazeroff M, Ryder G, Harris SL, Tsourkas PK. Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs. https://home.liebertpub.com/phage. 2021 Dec 16;2(4):204–13.
106. Besemer J, Borodovsky M. Heuristic approach to deriving models for gene finding. Nucleic Acids Res. 1999 Oct 1;27(19):3911–20. pmid:10481031
107. Besemer J, Lomsadze A, Borodovsky M. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res. 2001 Jun 15;29(12):2607–18. pmid:11410670
108. Lomsadze A, Gemayel K, Tang S, Borodovsky M. Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes. Genome Res. 2018 Jul;28(7):1079–89. pmid:29773659
109. Zhu W, Lomsadze A, Borodovsky M. Ab initio gene identification in metagenomic sequences. Nucleic Acids Res. 2010 Jul 1;38(12):e132. pmid:20403810
110. Noguchi H, Taniguchi T, Itoh T. MetaGeneAnnotator: Detecting Species-Specific Patterns of Ribosomal Binding Site for Precise Gene Prediction in Anonymous Prokaryotic and Phage Genomes. DNA Res Int J Rapid Publ Rep Genes Genomes. 2008 Dec;15(6):387–96. pmid:18940874
111. Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinforma Oxf Engl. 2007 Mar 15;23(6):673–9. pmid:17237039
112. Hyatt D, Chen GL, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010 Mar 8;11(1):119. pmid:20211023
113. McNair K, Zhou C, Dinsdale EA, Souza B, Edwards RA. PHANOTATE: a novel approach to gene identification in phage genomes. Bioinformatics. 2019 Nov 15;35(22):4537–42. pmid:31329826
114. Zimmermann L, Stephens A, Nam SZ, Rau D, Kübler J, Lozajic M, et al. A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core. J Mol Biol. 2018 Jul 20;430(15):2237–43. pmid:29258817
115. Ramsey J, Rasche H, Maughmer C, Criscione A, Mijalis E, Liu M, et al. Galaxy and Apollo as a biologist-friendly interface for high-quality cooperative phage genome annotation. PLOS Comput Biol. 2020 Nov 2;16(11):e1008214. pmid:33137082
116. Hadley Wickham. ggplot2: Elegant Graphics for Data Analysis [Internet]. Springer-Verlag New York; 2016. Available from: https://ggplot2.tidyverse.org
117. Katoh K, Standley DM. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol Biol Evol. 2013 Apr 1;30(4):772–80. pmid:23329690
118. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020 May 1;37(5):1530–4. pmid:32011700
119. Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol Biol Evol. 2018 Feb 1;35(2):518–22. pmid:29077904
120. Chernomor O, von Haeseler A, Minh BQ. Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices. Syst Biol. 2016 Nov 1;65(6):997–1008. pmid:27121966
121. Letunic I, Bork P. Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 2024 Apr 13;gkae268.
122. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009 Aug 1;25(15):1972–3. pmid:19505945
123. Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009 May 1;25(9):1189–91. pmid:19151095
124. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017 Jun;14(6):587–9. pmid:28481363
125. Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, et al. Identification of mobile genetic elements with geNomad. Nat Biotechnol. 2024 Aug;42(8):1303–12. pmid:37735266
126. Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol. 2021 May;39(5):578–85. pmid:33349699
127. Aroney STN, Newell RJP, Nissen J, Camargo AP, Tyson GW, Woodcroft BJ. CoverM: Read coverage calculator for metagenomics [Internet]. 2024 [cited 2024 Sep 16]. Available from: https://github.com/wwood/CoverM
128. Gilchrist CLM, Chooi YH. Clinker & clustermap.js: Automatic generation of gene cluster comparison figures. Bioinformatics. 2021;37(16):2473–5.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025 Gittrich et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Bacteriophages largely impact bacterial communities via lysis, gene transfer, and metabolic reprogramming and thus are increasingly thought to alter nutrient and energy cycling across many of Earth’s ecosystems. However, there are few model systems to mechanistically and quantitatively study phage-bacteria interactions, especially in soil systems. Here, we isolated, sequenced, and genomically characterized 24 novel phages infecting Klebsiella sp. M5al, a plant growth-promoting, nonencapsulated rhizosphere-associated bacterium, and compared many of their features against all 565 sequenced, dsDNA Klebsiella phage genomes. Taxonomic analyses revealed that these Klebsiella phages belong to three known phage families (Autographiviridae, Drexlerviridae, and Straboviridae) and two newly proposed phage families (Candidatus Mavericviridae and Ca. Rivulusviridae). At the phage family level, we found that core genes were often phage-centric proteins, such as structural proteins for the phage head and tail and DNA packaging proteins. In contrast, genes involved in transcription, translation, or hypothetical proteins were commonly not shared or flexible genes. Ecologically, we assessed the phages’ ubiquity in recent large-scale metagenomic datasets, which revealed they were not widespread, as well as a possible direct role in reprogramming specific metabolisms during infection by screening their genomes for phage-encoded auxiliary metabolic genes (AMGs). Even though AMGs are common in the environmental literature, only one of our phage families, Straboviridae, contained AMGs, and the types of AMGs were correlated at the genus level. Host range phenotyping revealed the phages had a wide range of infectivity, infecting between 1–14 of our 22 bacterial strain panel that included pathogenic Klebsiella and Raoultella strains. This indicates that not all capsule-independent Klebsiella phages have broad host ranges. Together, these isolates, with corresponding genome, AMG, and host range analyses, help build the Klebsiella model system for studying phage-host interactions of rhizosphere-associated bacteria.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer