1. Introduction
Anelloviruses are ubiquitous and considered a major component of the human virome. Although no disease has been related to anelloviruses, the term anellome is commonly used to describe the composition of anelloviruses infecting an individual [1]. Anelloviruses are nonenveloped, containing a circular, negative-sense, single-stranded DNA genome ranging from 1.6 to 3.9 kb. Their genomic organization is made up of a large open reading frame (ORF), referred to as ORF1 (coding for the capsid protein, the most conserved protein), and a few smaller ORFs. Of the 34 genera of the Anelloviridae family defined by the International Committee on Taxonomy of Viruses (ICTV) (based on the complete ORF1 coding region nucleotide sequences using 69% as a species demarcation threshold) [2], eight have been described in humans, namely the genera Alphatorquevirus (commonly called Torque teno virus (TTV)), Betatorquevirus (commonly called Torque teno mini virus (TTMV)), Gammatorquevirus (commonly called Torque teno midi virus (TTMDV)), Hetorquevirus [3,4,5], and more recently Yodtorquevirus, Lamedtorquevirus, Memtorquevirus, and Samektorquevirus (last ICTV release).
Next-generation sequencing metagenomic studies not only highlight the impressive genomic diversity of anelloviruses but also that the co-detection of different genera, species, and genotypes within a single individual is the rule rather than the exception [6,7,8]. Investigation of the genomic diversity of anelloviruses in humans is becoming increasingly complex and demanding for bioinformatic experts that are not specialists in the Anelloviridae family, requiring them to ensure that the latest classification criteria established by the ICTV are met. Therefore, we previously designed SCANellome, a user-friendly computer tool to evaluate the anellome composition from raw data generated by the Illumina and Nanopore platforms [9]. Indeed, although short-read sequencing platforms (dominated by the Illumina platforms) represent the main contributors to the anellovirus sequences made available in GenBank, more and more investigations now use long-read sequencing technologies such as the Oxford Nanopore platforms, which can be extremely useful for the analysis of circular DNA virus genomes [10].
This SCANellome update includes >900 additional reference sequences from GenBank and has generated a novel FASTA complete ORF1 database using a clustering at 90% identity. The presence of new species based on the ICTV classification criteria were also investigated. In addition, SCANellome V2 now provides the user with a phylogenetic analysis (of ORF1) that includes new species not yet officially classified by the Anelloviridae ICTV committee. Finally, the Anelloviridae taxonomy was updated in SCANellome V2 to rename species names in binomial format as required by the ICTV [11].
2. Materials and Methods
Scanellome Database
The database has been updated with GenBank sequences downloaded up to 8 July 2024 and with the keywords for sequence search updated, including new “metagenome_source” and “host” targets (e.g., metagenome_source=”blood metagenome”, host=“Callithrix penicillata”), using a previously described script from
Sequences below the 69% nucleotide identity threshold have been assigned to new potential species. Then a FASTA database was generated using CD-HIT (v4.7) at 90% identity.
An overall phylogeny analysis of the database has been carried out using an amino acid alignment containing one representative sequence for each species using MAFFT for the alignment (v7.520) and MEGA X (v10.0.5) for the tree reconstruction with the Maximum Likelihood method and LG+G+I+F model [12]. According to this analysis, using the ICTV genera taxonomy, including the novel proposed genus Sadetorquevirus [13], three species have been reassigned to different genera, one Omegatorquevirus to Betatorquevirus and two Hetotorquevirus to novel genus Sadetorquevirus.
Additionally, for each Alpha-, Beta-, and Gammatorquevirus the same phylogeny analysis has been conducted with representative sequences forming a clustering at 70% nucleotide identity using CD-HIT (v4.7). According to this analysis, using the SCANellome group classification, two Betatorquevirus and one Gammatorquevirus have been moved to an unclassified group.
The header of the FASTA database has been modified to include a new field:
“ICTV=<bool>” with a “True” value if the species is currently approved by the ICTV or a “False” value if not, using the “ICTV Master Species List” (
All species without ICTV approval have also been renamed with binomial nomenclature. This includes:
12 Alphatorquevirus using epithet “cero” for Cercopithecidae hosts or “homin” for Hominidae hosts and with number matching former names;
144 Betatorquevirus using epithet “homini” for Hominidae hosts and with numbers incrementing by the hundreds depending on taxonomy group and by the tens according to their provisional name;
50 Gammatorquevirus using epithet “homidi” for Hominidae hosts with numbered increments from 16 to 65;
one Omegatorquevirus using epithet “hominid”;
three Epsilontorquevirus using epithet “cebid” for Cebidae hosts or “calli” for Callitrichidae hosts.
All non-ICTV name changes have been tracked in Table S1.
3. Results
The SCANellome database was updated on 8 July 2024. A total of 916 additional complete ORF1 primate anellovirus sequences were added, generating a database of over 18000 sequences. Using CD-HIT at 90% identity, 134 new representative sequences were generated in SCANellome V2. Furthermore, despite the addition of >900 sequences attributed to viruses with human hosts, there was no change in the number of species infecting humans (neither abolished nor established) in the SCANellome database using the 69% nucleotide identity threshold. Thus, this confirms the provisional species establishment in the SCANellome database (a proposal to the ICTV is in preparation). Table 1 describes the distribution of these additional sequences and potential new species across the 12 different human and nonhuman primate genera. In addition, SCANellome V2 now provides the user with an up-to-date phylogenetic analysis (by ORF1) to show the distribution of these new potential species (Figure 1). Indeed, the user first accesses the “interactive” complete ORF1 phylogenetic tree and can then choose to individually click on each of the three Alpha-, Beta-, and Gammatorquevirus genera to generate a detailed phylogenetic tree highlighting the new species within the selected genus in red. The majority of potential new species are assigned to Betatorquevirus (n = 144) and Gammatorquevirus (n = 50).
4. Conclusions
SCANellome V2 not only includes a significant number of additional anellovirus sequences, but thanks to these, it allows identification of the presence of more than 200 potential new species. Furthermore, now an up-to-date phylogenetic analysis shows the user the distribution of these new species among the 12 human and nonhuman primate genera. In addition, this updated version allows the user to obtain a nomenclature in binomial format as required by the ICTV in 2023.
Conceptualization, F.L. and S.C.; methodology, F.L. and S.C.; sequences analysis, F.L.; formal analysis, F.L. and S.C.; writing—original draft preparation, F.L. and S.C.; writing—review and editing, F.L., L.K. and S.C.; visualization, F.L.; supervision, S.C.; project administration, L.K. and S.C. All authors have read and agreed to the published version of the manuscript.
Not applicable.
Not applicable.
SCANellome V2 is freely available at:
We would like to thank Erik Boehm for revising the manuscript.
The authors declare no conflicts of interest.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1. Phylogenetic analysis of anellovirus. Interactive phylogenetic analysis across the 12 primate genera. The Maximum Likelihood tree is based on complete ORF1 nucleotide sequences. By clicking on the name of some genus, the user accesses a more detailed representation describing the species assigned or not within that selected genus. Scale is in number of substitutions per site.
Summary of the number of sequences and species for each genus in the updated (V2) complete ORF1 primate anelloviruses database. The distribution of the potential 210 new species within the 12 genera is also reported. NA: not applicable; * Hetorquevirus hominid 7 and 8 in V1 version moved to Sadetorquevirus in V2 version; ** moved to Betatorquevirus.
Genus | Number of Complete ORF1 Sequences | Representatives 90% Identity | Number of Species | Number of Species without ICTV Approved | |||
---|---|---|---|---|---|---|---|
V1 | V2 | V1 | V2 | V1 | V2 | ||
Alphatorquevirus | 5444 | 6096 | 419 | 469 | 36 | 36 | 12 |
Betatorquevirus | 6659 | 6820 | 1856 | 1908 | 178 | 180 | 144 |
Gammatorquevirus | 5534 | 5617 | 1376 | 1393 | 64 | 64 | 50 |
Hetorquevirus | 109 | 69 | 15 | 10 | 7 | 5 | 0 |
Sadetorquevirus | NA | 41 * | NA | 6 * | NA | 2 * | 0 |
Yodtorquevirus | 17 | 17 | 2 | 2 | 2 | 2 | 0 |
Lamedtorquevirus | 33 | 37 | 5 | 6 | 1 | 1 | 0 |
Memtorquevirus | 55 | 58 | 20 | 22 | 3 | 3 | 0 |
Samektorquevirus | 124 | 128 | 32 | 35 | 4 | 4 | 0 |
Omegatorquevirus | 3 | 2 ** | 3 | 2 ** | 3 | 2 | 1 |
Epsilontorquevirus | 1 | 10 | 1 | 10 | 1 | 4 | 3 |
Zetatorquevirus | 1 | 1 | 1 | 1 | 1 | 1 | 0 |
Total | 17,980 | 18,896 | 3730 | 3864 | 300 | 304 | 210 |
Supplementary Materials
The following supporting information can be downloaded at:
References
1. Kaczorowska, J.; Deijs, M.; Klein, M.; Bakker, M.; Jebbink, M.F.; Sparreboom, M.; Kinsella, C.M.; Timmerman, A.L.; van der Hoek, L. Diversity and Long-Term Dynamics of Human Blood Anelloviruses. J. Virol.; 2022; 96, e0010922. [DOI: https://dx.doi.org/10.1128/jvi.00109-22] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35575554]
2. Varsani, A.; Opriessnig, T.; Celer, V.; Maggi, F.; Okamoto, H.; Blomström, A.L.; Cadar, D.; Harrach, B.; Biagini, P.; Kraberger, S. Taxonomic update for mammalian anelloviruses (family Anelloviridae). Arch. Virol.; 2021; 166, pp. 2943-2953. [DOI: https://dx.doi.org/10.1007/s00705-021-05192-x] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34383165]
3. Kaczorowska, J.; van der Hoek, L. Human anelloviruses: Diverse, omnipresent and commensal members of the virome. FEMS Microbiol. Rev.; 2020; 44, pp. 305-313. [DOI: https://dx.doi.org/10.1093/femsre/fuaa007] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32188999]
4. Lu, L.; Robertson, G.; Ashworth, J.; Pham Hong, A.; Shi, T.; Ivens, A.; Thwaites, G.; Baker, S.; Woolhouse, M. Epidemiology and Phylogenetic Analysis of Viral Respiratory Infections in Vietnam. Front. Microbiol.; 2020; 11, 833. [DOI: https://dx.doi.org/10.3389/fmicb.2020.00833] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32499763]
5. Spandole, S.; Cimponeriu, D.; Berca, L.M.; Mihaescu, G. Human anelloviruses: An update of molecular, epidemiological and clinical aspects. Arch. Virol.; 2015; 160, pp. 893-908. [DOI: https://dx.doi.org/10.1007/s00705-015-2363-9] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25680568]
6. Arze, C.A.; Springer, S.; Dudas, G.; Patel, S.; Bhattacharyya, A.; Swaminathan, H.; Brugnara, C.; Delagrave, S.; Ong, T.; Kahvejian, A. et al. Global genome analysis reveals a vast and dynamic anellovirus landscape within the human virome. Cell Host Microbe; 2021; 29, pp. 1305-1315.e1306. [DOI: https://dx.doi.org/10.1016/j.chom.2021.07.001] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34320399]
7. Bal, A.; Sarkozy, C.; Josset, L.; Cheynet, V.; Oriol, G.; Becker, J.; Vilchez, G.; Sesques, P.; Mallet, F.; Pachot, A. et al. Metagenomic Next-Generation Sequencing Reveals Individual Composition and Dynamics of Anelloviruses during Autologous Stem Cell Transplant Recipient Management. Viruses; 2018; 10, 633. [DOI: https://dx.doi.org/10.3390/v10110633] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30441786]
8. Laubscher, F.; Hartley, M.A.; Kaiser, L.; Cordey, S. Genomic Diversity of Torque Teno Virus in Blood Samples from Febrile Paediatric Outpatients in Tanzania: A Descriptive Cohort Study. Viruses; 2022; 14, 1612. [DOI: https://dx.doi.org/10.3390/v14081612] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35893678]
9. Laubscher, F.; Kaiser, L.; Cordey, S. SCANellome: Analysis of the Genomic Diversity of Human and Non-Human Primate Anelloviruses from Metagenomics Data. Viruses; 2023; 15, 1575. [DOI: https://dx.doi.org/10.3390/v15071575] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37515261]
10. Anantharam, R.; Duchen, D.; Cox, A.L.; Timp, W.; Thomas, D.L.; Clipman, S.J.; Kandathil, A.J. Long-Read Nanopore-Based Sequencing of Anelloviruses. Viruses; 2024; 16, 723. [DOI: https://dx.doi.org/10.3390/v16050723] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38793605]
11. Varsani, A.; Kraberger, S.; Opriessnig, T.; Maggi, F.; Celer, V.; Okamoto, H.; Biagini, P. Anelloviridae taxonomy update 2023. Arch. Virol.; 2023; 168, 277. [DOI: https://dx.doi.org/10.1007/s00705-023-05903-6] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37864606]
12. Le, S.Q.; Gascuel, O. An improved general amino acid replacement matrix. Mol. Biol. Evol.; 2008; 25, pp. 1307-1320. [DOI: https://dx.doi.org/10.1093/molbev/msn067] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18367465]
13. Available online: https://ictv.global/filebrowser/download/17325 (accessed on 23 July 2024).
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Anelloviruses are ubiquitous in humans and represent a major component of the human virome. Its best-known representative is Torque teno virus (i.e., the Alphatorquevirus genus), which is considered a potential immunity biomarker. Recent metagenomic investigations revealed not only the extraordinary genomic diversity of anellovirus sequences, but also that co-detection of genera, genotypes, or species seems to be the rule in humans. SCANellome was developed to represent a user-friendly tool to analyze the primate (both human and non-human) anellovirus composition at the genus, species, and genotype level from metagenomics data based on an up-to-date database. This SCANellome update includes >900 additional reference sequences from GenBank. Using a clustering at 90% identity, the FASTA database was updated and generated 134 new representative sequences. Based on ORF1, the analysis of these new sequences indicates the presence of 206 potential new species, including four nonhuman primates, and adds four new non-human primate species which will be the subject of a proposal to the International Committee on Taxonomy of Viruses (ICTV). In addition, SCANellome V2 provides now the user with an interactive up-to-date phylogenetic analysis (of ORF1) to show the distribution among the 12 human and nonhuman primate genera of these new potential species. Finally, the Anelloviridae taxonomy was updated to rename species names in binomial format as required by the ICTV.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Laboratory of Virology, Department of Diagnostics, Geneva University Hospitals & Faculty of Medicine, University of Geneva, 1205 Geneva, Switzerland;
2 Laboratory of Virology, Department of Diagnostics, Geneva University Hospitals & Faculty of Medicine, University of Geneva, 1205 Geneva, Switzerland;