Abstract

RNA secondary structures play essential roles in the formation of the tertiary structure and function of a transcript. Recent genome-wide studies highlight significant potential for RNA structures in the mammalian genome. However, a major challenge is assigning functional roles to these structured RNAs. In this study, we conduct a guilt-by-association analysis of clusters of computationally predicted conserved RNA structure (CRSs) in human untranslated regions (UTRs) to associate them with gene functions. We filtered a broad pool of ∼500 000 human CRSs for UTR overlap, resulting in 4734 and 24 754 CRSs from the 5′ and 3′ UTR of protein-coding genes, respectively. We separately clustered these CRSs for both sets using RNAscClust, obtaining 793 and 2403 clusters, each containing an average of five CRSs per cluster. We identified overrepresented binding sites for 60 and 43 RNA-binding proteins co-localizing with the clustered CRSs. Furthermore, 104 and 441 clusters from the 5′ and 3′ UTRs, respectively, showed enrichment for various Gene Ontologies, including biological processes such as ‘signal transduction’, ‘nervous system development’, molecular functions like ‘transferase activity’ and the cellular components such as ‘synapse’ among others. Our study shows that significant functional insights can be gained by clustering RNA structures based on their structural characteristics.

Details

Title
Clusters of mammalian conserved RNA structures in UTRs associate with RBP binding sites
Author
Gadekar, Veerendra P 1   VIAFID ORCID Logo  ; Alexander Welford Munk 1 ; Miladi, Milad 2   VIAFID ORCID Logo  ; Junge, Alexander 1 ; Backofen, Rolf 2   VIAFID ORCID Logo  ; Seemann, Stefan E 1   VIAFID ORCID Logo  ; Gorodkin, Jan 1   VIAFID ORCID Logo 

 Center for non-coding RNA in Technology and Health, University of Copenhagen , Ridebanevej 9, 1870 Frederiksberg, Denmark 
 Bioinformatics Group, Department of Computer Science, University of Freiburg , Freiburg im Breisgau, Germany 
Publication year
2024
Publication date
Sep 2024
Publisher
Oxford University Press
e-ISSN
26319268
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3168786544
Copyright
© The Author(s) 2024. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. This work is published under https://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.