Abstract

Mobile elements and highly repetitive genomic regions are potent sources of lineage-specific genomic innovation and fingerprint individual genomes. Comprehensive analyses of large, composite or arrayed repeat elements and those found in more complex regions of the genome require a complete, linear genome assembly. Here we present the first de novo repeat discovery and annotation of a complete human reference genome, T2T-CHM13v1.0. We identified novel satellite arrays, expanded the catalog of variants and families for known repeats and mobile elements, characterized new classes of complex, composite repeats, and provided comprehensive annotations of retroelement transduction events. Utilizing PRO-seq to detect nascent transcription and nanopore sequencing to delineate CpG methylation profiles, we defined the structure of transcriptionally active retroelements in humans, including for the first time those found in centromeres. Together, these data provide expanded insight into the diversity, distribution and evolution of repetitive regions that have shaped the human genome.

Competing Interest Statement

KHM has received travel funds to speak at symposia organized by Oxford Nanopore. WT has two patents (8,748,091 and 8,394,584) licensed to Oxford Nanopore Technologies. All other authors declare that they have no competing interests.

Footnotes

* https://github.com/marbl/CHM13

* https://www.ncbi.nlm.nih.gov/bioproject/559484

* https://github.com/marbl/CHM13-issues

* http://genome.ucsc.edu/cgi-bin/hgTracks?genome=t2t-chm13-v1.0&hubUrl=http://t2t.gi.ucsc.edu/chm13/hub/hub.txt

* https://resgen.io/paper-data/T2T-Nurk-et-al-2021/views/t2t-identity

* https://gitlab.com/SJHoyt/t2t_transposable-elements/Repeat_annotations/Repeatmasker_and_polishing/RepeatLibrary_NewRepeatEntries.embl

* https://gitlab.com/SJHoyt/t2t_transposable-elements

Details

Title
From telomere to telomere: the transcriptional and epigenetic state of human repeat elements
Author
Hoyt, Savannah J; Storer, Jessica M; Hartley, Gabrielle A; Grady, Patrick Gs; Gershman, Ariel; De Lima, Leonardo G; Limouse, Charles; Halabian, Reza; Wojenski, Luke; Rodriguez, Matias; Altemose, Nicolas; Core, Leighton; Gerton, Jennifer L; Makalowski, Wojciech; Olson, Daniel; Rosen, Jeb; Smit, Arian Fa; Straight, Aaron F; Vollger, Mitchell R; Wheeler, Travis; Schatz, Michael; Eichler, Evan; Phillippy, Adam M; Timp, Winston; Miga, Karen H; O'neill, Rachel J
University/institution
Cold Spring Harbor Laboratory Press
Section
New Results
Publication year
2021
Publication date
Jul 12, 2021
Publisher
Cold Spring Harbor Laboratory Press
ISSN
2692-8205
Source type
Working Paper
Language of publication
English
ProQuest document ID
2550567249
Copyright
© 2021. This article is published under http://creativecommons.org/licenses/by/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.