Cargando…

iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations

In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Junru, Yu, Yingying, Wang, Ruheng, Zeng, Xin, Pang, Chao, Jiang, Yi, Li, Zhongshen, Dai, Yutong, Su, Ran, Zou, Quan, Nakai, Kenta, Wei, Leyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575223/
https://www.ncbi.nlm.nih.gov/pubmed/36253864
http://dx.doi.org/10.1186/s13059-022-02780-1
_version_ 1784811271489585152
author Jin, Junru
Yu, Yingying
Wang, Ruheng
Zeng, Xin
Pang, Chao
Jiang, Yi
Li, Zhongshen
Dai, Yutong
Su, Ran
Zou, Quan
Nakai, Kenta
Wei, Leyi
author_facet Jin, Junru
Yu, Yingying
Wang, Ruheng
Zeng, Xin
Pang, Chao
Jiang, Yi
Li, Zhongshen
Dai, Yutong
Su, Ran
Zou, Quan
Nakai, Kenta
Wei, Leyi
author_sort Jin, Junru
collection PubMed
description In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation predictions. Importantly, we show the power of deep language learning in capturing both sequential and functional semantics information from background genomes. Moreover, by integrating the interpretable analysis mechanism, we well explain what the model learns, helping us build the mapping from the discovery of important sequential determinants to the in-depth analysis of their biological functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02780-1.
format Online
Article
Text
id pubmed-9575223
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-95752232022-10-18 iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations Jin, Junru Yu, Yingying Wang, Ruheng Zeng, Xin Pang, Chao Jiang, Yi Li, Zhongshen Dai, Yutong Su, Ran Zou, Quan Nakai, Kenta Wei, Leyi Genome Biol Method In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation predictions. Importantly, we show the power of deep language learning in capturing both sequential and functional semantics information from background genomes. Moreover, by integrating the interpretable analysis mechanism, we well explain what the model learns, helping us build the mapping from the discovery of important sequential determinants to the in-depth analysis of their biological functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02780-1. BioMed Central 2022-10-17 /pmc/articles/PMC9575223/ /pubmed/36253864 http://dx.doi.org/10.1186/s13059-022-02780-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Method
Jin, Junru
Yu, Yingying
Wang, Ruheng
Zeng, Xin
Pang, Chao
Jiang, Yi
Li, Zhongshen
Dai, Yutong
Su, Ran
Zou, Quan
Nakai, Kenta
Wei, Leyi
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
title iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
title_full iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
title_fullStr iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
title_full_unstemmed iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
title_short iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
title_sort idna-abf: multi-scale deep biological language learning model for the interpretable prediction of dna methylations
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575223/
https://www.ncbi.nlm.nih.gov/pubmed/36253864
http://dx.doi.org/10.1186/s13059-022-02780-1
work_keys_str_mv AT jinjunru idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT yuyingying idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT wangruheng idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT zengxin idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT pangchao idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT jiangyi idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT lizhongshen idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT daiyutong idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT suran idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT zouquan idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT nakaikenta idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations
AT weileyi idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations