Cargando…

iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations

In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Junru, Yu, Yingying, Wang, Ruheng, Zeng, Xin, Pang, Chao, Jiang, Yi, Li, Zhongshen, Dai, Yutong, Su, Ran, Zou, Quan, Nakai, Kenta, Wei, Leyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575223/
https://www.ncbi.nlm.nih.gov/pubmed/36253864
http://dx.doi.org/10.1186/s13059-022-02780-1
Descripción
Sumario:In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation predictions. Importantly, we show the power of deep language learning in capturing both sequential and functional semantics information from background genomes. Moreover, by integrating the interpretable analysis mechanism, we well explain what the model learns, helping us build the mapping from the discovery of important sequential determinants to the in-depth analysis of their biological functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02780-1.