Cargando…
iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations
In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation pred...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575223/ https://www.ncbi.nlm.nih.gov/pubmed/36253864 http://dx.doi.org/10.1186/s13059-022-02780-1 |
_version_ | 1784811271489585152 |
---|---|
author | Jin, Junru Yu, Yingying Wang, Ruheng Zeng, Xin Pang, Chao Jiang, Yi Li, Zhongshen Dai, Yutong Su, Ran Zou, Quan Nakai, Kenta Wei, Leyi |
author_facet | Jin, Junru Yu, Yingying Wang, Ruheng Zeng, Xin Pang, Chao Jiang, Yi Li, Zhongshen Dai, Yutong Su, Ran Zou, Quan Nakai, Kenta Wei, Leyi |
author_sort | Jin, Junru |
collection | PubMed |
description | In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation predictions. Importantly, we show the power of deep language learning in capturing both sequential and functional semantics information from background genomes. Moreover, by integrating the interpretable analysis mechanism, we well explain what the model learns, helping us build the mapping from the discovery of important sequential determinants to the in-depth analysis of their biological functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02780-1. |
format | Online Article Text |
id | pubmed-9575223 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-95752232022-10-18 iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations Jin, Junru Yu, Yingying Wang, Ruheng Zeng, Xin Pang, Chao Jiang, Yi Li, Zhongshen Dai, Yutong Su, Ran Zou, Quan Nakai, Kenta Wei, Leyi Genome Biol Method In this study, we propose iDNA-ABF, a multi-scale deep biological language learning model that enables the interpretable prediction of DNA methylations based on genomic sequences only. Benchmarking comparisons show that our iDNA-ABF outperforms state-of-the-art methods for different methylation predictions. Importantly, we show the power of deep language learning in capturing both sequential and functional semantics information from background genomes. Moreover, by integrating the interpretable analysis mechanism, we well explain what the model learns, helping us build the mapping from the discovery of important sequential determinants to the in-depth analysis of their biological functions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02780-1. BioMed Central 2022-10-17 /pmc/articles/PMC9575223/ /pubmed/36253864 http://dx.doi.org/10.1186/s13059-022-02780-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Method Jin, Junru Yu, Yingying Wang, Ruheng Zeng, Xin Pang, Chao Jiang, Yi Li, Zhongshen Dai, Yutong Su, Ran Zou, Quan Nakai, Kenta Wei, Leyi iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations |
title | iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations |
title_full | iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations |
title_fullStr | iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations |
title_full_unstemmed | iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations |
title_short | iDNA-ABF: multi-scale deep biological language learning model for the interpretable prediction of DNA methylations |
title_sort | idna-abf: multi-scale deep biological language learning model for the interpretable prediction of dna methylations |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575223/ https://www.ncbi.nlm.nih.gov/pubmed/36253864 http://dx.doi.org/10.1186/s13059-022-02780-1 |
work_keys_str_mv | AT jinjunru idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT yuyingying idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT wangruheng idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT zengxin idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT pangchao idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT jiangyi idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT lizhongshen idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT daiyutong idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT suran idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT zouquan idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT nakaikenta idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations AT weileyi idnaabfmultiscaledeepbiologicallanguagelearningmodelfortheinterpretablepredictionofdnamethylations |