Abstract

Abstract

Evolutionary annotation of genome maintenance (GM) proteins has conventionally been established by remote relationships within protein sequence databases. However, often no significant relationship can be established. Highly sensitive approaches to attain remote homologies based on iterative profile-to-profile methods have been developed. Still, these methods have not been systematically applied in the evolutionary annotation of GM proteins. Here, by applying profile-to-profile models, we systematically survey the repertoire of GM proteins from bacteria to man. We identify multiple GM protein candidates and annotate domains in numerous established GM proteins, among other PARP, OB-fold, Macro, TUDOR, SAP, BRCT, KU, MYB (SANT), and nuclease domains. We experimentally validate OB-fold and MIS18 (Yippee) domains in SPIDR and FAM72 protein families, respectively. Our results indicate that, surprisingly, despite the immense interest and long-term research efforts, the repertoire of genome stability caretakers is still not fully appreciated.

Here the authors use computational methods for sequence and structure analysis of genome maintenance proteins to catalog and identify genome maintenance families across species. This allows to identify several genome maintenance family members.

Details

Title
Exploring the structural landscape of DNA maintenance proteins
Author
Schou, Kenneth Bødkter 1   VIAFID ORCID Logo  ; Mandacaru, Samuel 2 ; Tahir, Muhammad 2 ; Tom, Nikola 3 ; Nilsson, Ann-Sofie 4 ; Andersen, Jens S. 2   VIAFID ORCID Logo  ; Tiberti, Matteo 5   VIAFID ORCID Logo  ; Papaleo, Elena 6   VIAFID ORCID Logo  ; Bartek, Jiri 7   VIAFID ORCID Logo 

 Danish Cancer Society, Genome Integrity, Danish Cancer Institute, Copenhagen, Denmark (GRID:grid.417390.8) (ISNI:0000 0001 2175 6024); University of Southern Denmark, Department of Biochemistry and Molecular Biology, Odense M, Denmark (GRID:grid.10825.3e) (ISNI:0000 0001 0728 0170); Karolinska Institute, Division of Genome Biology, Department of Medical Biochemistry and Biophysics, Science for Laboratory, Solna, Sweden (GRID:grid.465198.7) 
 University of Southern Denmark, Department of Biochemistry and Molecular Biology, Odense M, Denmark (GRID:grid.10825.3e) (ISNI:0000 0001 0728 0170) 
 Danish Cancer Institute (DCI), Lipidomics Core Facility, Copenhagen, Denmark (GRID:grid.10825.3e) 
 Karolinska Institute, Division of Genome Biology, Department of Medical Biochemistry and Biophysics, Science for Laboratory, Solna, Sweden (GRID:grid.465198.7) 
 Danish Cancer Society Research Center, Cancer Structural Biology, Copenhagen, Denmark (GRID:grid.417390.8) (ISNI:0000 0001 2175 6024) 
 Danish Cancer Society Research Center, Cancer Structural Biology, Copenhagen, Denmark (GRID:grid.417390.8) (ISNI:0000 0001 2175 6024); Technical University of Denmark, Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Lyngby, Denmark (GRID:grid.5170.3) (ISNI:0000 0001 2181 8870) 
 Danish Cancer Society, Genome Integrity, Danish Cancer Institute, Copenhagen, Denmark (GRID:grid.417390.8) (ISNI:0000 0001 2175 6024); Karolinska Institute, Division of Genome Biology, Department of Medical Biochemistry and Biophysics, Science for Laboratory, Solna, Sweden (GRID:grid.465198.7) 
Pages
7748
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3101005430
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.