Abstract

In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation predictions. Importantly, we show the power of deep language learning in capturing both sequential and functional semantics information from background genomes. Moreover, by integrating the interpretable analysis mechanism, we well explain what the model learns, helping us build the mapping from the discovery of important sequential determinants to the in-depth analysis of their biological functions.

Details

Title
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
Author
Jin, Junru; Yu, Yingying; Wang, Ruheng; Zeng, Xin; Pang, Chao; Jiang, Yi; Li, Zhongshen; Dai, Yutong; Su, Ran; Zou, Quan; Nakai, Kenta; Leyi Wei
Pages
1-23
Section
Method
Publication year
2022
Publication date
2022
Publisher
Springer Nature B.V.
ISSN
14747596
e-ISSN
1474760X
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2726070942
Copyright
© 2022. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.