Cargando…
Base-resolution prediction of transcription factor binding signals by a deep learning framework
Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of the sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed a deep learning framework leveraging existing fully convolutional neural...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982852/ https://www.ncbi.nlm.nih.gov/pubmed/35263332 http://dx.doi.org/10.1371/journal.pcbi.1009941 |
_version_ | 1784681872141320192 |
---|---|
author | Zhang, Qinhu He, Ying Wang, Siguo Chen, Zhanheng Guo, Zhenhao Cui, Zhen Liu, Qi Huang, De-Shuang |
author_facet | Zhang, Qinhu He, Ying Wang, Siguo Chen, Zhanheng Guo, Zhenhao Cui, Zhen Liu, Qi Huang, De-Shuang |
author_sort | Zhang, Qinhu |
collection | PubMed |
description | Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of the sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed a deep learning framework leveraging existing fully convolutional neural networks (FCN) to predict TF-DNA binding signals at the base-resolution level (named as FCNsignal). The proposed FCNsignal can simultaneously achieve the following tasks: (i) modeling the base-resolution signals of binding regions; (ii) discriminating binding or non-binding regions; (iii) locating TF-DNA binding regions; (iv) predicting binding motifs. Besides, FCNsignal can also be used to predict opening regions across the whole genome. The experimental results on 53 TF ChIP-seq datasets and 6 chromatin accessibility ATAC-seq datasets show that our proposed framework outperforms some existing state-of-the-art methods. In addition, we explored to use the trained FCNsignal to locate all potential TF-DNA binding regions on a whole chromosome and predict DNA sequences of arbitrary length, and the results show that our framework can find most of the known binding regions and accept sequences of arbitrary length. Furthermore, we demonstrated the potential ability of our framework in discovering causal disease-associated single-nucleotide polymorphisms (SNPs) through a series of experiments. |
format | Online Article Text |
id | pubmed-8982852 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-89828522022-04-06 Base-resolution prediction of transcription factor binding signals by a deep learning framework Zhang, Qinhu He, Ying Wang, Siguo Chen, Zhanheng Guo, Zhenhao Cui, Zhen Liu, Qi Huang, De-Shuang PLoS Comput Biol Research Article Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of the sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed a deep learning framework leveraging existing fully convolutional neural networks (FCN) to predict TF-DNA binding signals at the base-resolution level (named as FCNsignal). The proposed FCNsignal can simultaneously achieve the following tasks: (i) modeling the base-resolution signals of binding regions; (ii) discriminating binding or non-binding regions; (iii) locating TF-DNA binding regions; (iv) predicting binding motifs. Besides, FCNsignal can also be used to predict opening regions across the whole genome. The experimental results on 53 TF ChIP-seq datasets and 6 chromatin accessibility ATAC-seq datasets show that our proposed framework outperforms some existing state-of-the-art methods. In addition, we explored to use the trained FCNsignal to locate all potential TF-DNA binding regions on a whole chromosome and predict DNA sequences of arbitrary length, and the results show that our framework can find most of the known binding regions and accept sequences of arbitrary length. Furthermore, we demonstrated the potential ability of our framework in discovering causal disease-associated single-nucleotide polymorphisms (SNPs) through a series of experiments. Public Library of Science 2022-03-09 /pmc/articles/PMC8982852/ /pubmed/35263332 http://dx.doi.org/10.1371/journal.pcbi.1009941 Text en © 2022 Zhang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhang, Qinhu He, Ying Wang, Siguo Chen, Zhanheng Guo, Zhenhao Cui, Zhen Liu, Qi Huang, De-Shuang Base-resolution prediction of transcription factor binding signals by a deep learning framework |
title | Base-resolution prediction of transcription factor binding signals by a deep learning framework |
title_full | Base-resolution prediction of transcription factor binding signals by a deep learning framework |
title_fullStr | Base-resolution prediction of transcription factor binding signals by a deep learning framework |
title_full_unstemmed | Base-resolution prediction of transcription factor binding signals by a deep learning framework |
title_short | Base-resolution prediction of transcription factor binding signals by a deep learning framework |
title_sort | base-resolution prediction of transcription factor binding signals by a deep learning framework |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982852/ https://www.ncbi.nlm.nih.gov/pubmed/35263332 http://dx.doi.org/10.1371/journal.pcbi.1009941 |
work_keys_str_mv | AT zhangqinhu baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework AT heying baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework AT wangsiguo baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework AT chenzhanheng baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework AT guozhenhao baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework AT cuizhen baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework AT liuqi baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework AT huangdeshuang baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework |