Cargando…

Base-resolution prediction of transcription factor binding signals by a deep learning framework

Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of the sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed a deep learning framework leveraging existing fully convolutional neural...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Qinhu, He, Ying, Wang, Siguo, Chen, Zhanheng, Guo, Zhenhao, Cui, Zhen, Liu, Qi, Huang, De-Shuang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982852/
https://www.ncbi.nlm.nih.gov/pubmed/35263332
http://dx.doi.org/10.1371/journal.pcbi.1009941
_version_ 1784681872141320192
author Zhang, Qinhu
He, Ying
Wang, Siguo
Chen, Zhanheng
Guo, Zhenhao
Cui, Zhen
Liu, Qi
Huang, De-Shuang
author_facet Zhang, Qinhu
He, Ying
Wang, Siguo
Chen, Zhanheng
Guo, Zhenhao
Cui, Zhen
Liu, Qi
Huang, De-Shuang
author_sort Zhang, Qinhu
collection PubMed
description Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of the sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed a deep learning framework leveraging existing fully convolutional neural networks (FCN) to predict TF-DNA binding signals at the base-resolution level (named as FCNsignal). The proposed FCNsignal can simultaneously achieve the following tasks: (i) modeling the base-resolution signals of binding regions; (ii) discriminating binding or non-binding regions; (iii) locating TF-DNA binding regions; (iv) predicting binding motifs. Besides, FCNsignal can also be used to predict opening regions across the whole genome. The experimental results on 53 TF ChIP-seq datasets and 6 chromatin accessibility ATAC-seq datasets show that our proposed framework outperforms some existing state-of-the-art methods. In addition, we explored to use the trained FCNsignal to locate all potential TF-DNA binding regions on a whole chromosome and predict DNA sequences of arbitrary length, and the results show that our framework can find most of the known binding regions and accept sequences of arbitrary length. Furthermore, we demonstrated the potential ability of our framework in discovering causal disease-associated single-nucleotide polymorphisms (SNPs) through a series of experiments.
format Online
Article
Text
id pubmed-8982852
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89828522022-04-06 Base-resolution prediction of transcription factor binding signals by a deep learning framework Zhang, Qinhu He, Ying Wang, Siguo Chen, Zhanheng Guo, Zhenhao Cui, Zhen Liu, Qi Huang, De-Shuang PLoS Comput Biol Research Article Transcription factors (TFs) play an important role in regulating gene expression, thus the identification of the sites bound by them has become a fundamental step for molecular and cellular biology. In this paper, we developed a deep learning framework leveraging existing fully convolutional neural networks (FCN) to predict TF-DNA binding signals at the base-resolution level (named as FCNsignal). The proposed FCNsignal can simultaneously achieve the following tasks: (i) modeling the base-resolution signals of binding regions; (ii) discriminating binding or non-binding regions; (iii) locating TF-DNA binding regions; (iv) predicting binding motifs. Besides, FCNsignal can also be used to predict opening regions across the whole genome. The experimental results on 53 TF ChIP-seq datasets and 6 chromatin accessibility ATAC-seq datasets show that our proposed framework outperforms some existing state-of-the-art methods. In addition, we explored to use the trained FCNsignal to locate all potential TF-DNA binding regions on a whole chromosome and predict DNA sequences of arbitrary length, and the results show that our framework can find most of the known binding regions and accept sequences of arbitrary length. Furthermore, we demonstrated the potential ability of our framework in discovering causal disease-associated single-nucleotide polymorphisms (SNPs) through a series of experiments. Public Library of Science 2022-03-09 /pmc/articles/PMC8982852/ /pubmed/35263332 http://dx.doi.org/10.1371/journal.pcbi.1009941 Text en © 2022 Zhang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Qinhu
He, Ying
Wang, Siguo
Chen, Zhanheng
Guo, Zhenhao
Cui, Zhen
Liu, Qi
Huang, De-Shuang
Base-resolution prediction of transcription factor binding signals by a deep learning framework
title Base-resolution prediction of transcription factor binding signals by a deep learning framework
title_full Base-resolution prediction of transcription factor binding signals by a deep learning framework
title_fullStr Base-resolution prediction of transcription factor binding signals by a deep learning framework
title_full_unstemmed Base-resolution prediction of transcription factor binding signals by a deep learning framework
title_short Base-resolution prediction of transcription factor binding signals by a deep learning framework
title_sort base-resolution prediction of transcription factor binding signals by a deep learning framework
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982852/
https://www.ncbi.nlm.nih.gov/pubmed/35263332
http://dx.doi.org/10.1371/journal.pcbi.1009941
work_keys_str_mv AT zhangqinhu baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework
AT heying baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework
AT wangsiguo baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework
AT chenzhanheng baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework
AT guozhenhao baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework
AT cuizhen baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework
AT liuqi baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework
AT huangdeshuang baseresolutionpredictionoftranscriptionfactorbindingsignalsbyadeeplearningframework