Cargando…
Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
The study of transcriptional regulation is still difficult yet fundamental in molecular biology research. Recent research has shown that the double helix structure of nucleotides plays an important role in improving the accuracy and interpretability of transcription factor binding sites (TFBSs). Alt...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society of Gene & Cell Therapy
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7972936/ https://www.ncbi.nlm.nih.gov/pubmed/33767912 http://dx.doi.org/10.1016/j.omtn.2021.02.014 |
_version_ | 1783666741219426304 |
---|---|
author | Wang, Siguo Zhang, Qinhu Shen, Zhen He, Ying Chen, Zhen-Heng Li, Jianqiang Huang, De-Shuang |
author_facet | Wang, Siguo Zhang, Qinhu Shen, Zhen He, Ying Chen, Zhen-Heng Li, Jianqiang Huang, De-Shuang |
author_sort | Wang, Siguo |
collection | PubMed |
description | The study of transcriptional regulation is still difficult yet fundamental in molecular biology research. Recent research has shown that the double helix structure of nucleotides plays an important role in improving the accuracy and interpretability of transcription factor binding sites (TFBSs). Although several computational methods have been designed to take both DNA sequence and DNA shape features into consideration simultaneously, how to design an efficient model is still an intractable topic. In this paper, we proposed a hybrid convolutional recurrent neural network (CNN/RNN) architecture, CRPTS, to predict TFBSs by combining DNA sequence and DNA shape features. The novelty of our proposed method relies on three critical aspects: (1) the application of a shared hybrid CNN and RNN has the ability to efficiently extract features from large-scale genomic sequences obtained by high-throughput technology; (2) the common patterns were found from DNA sequences and their corresponding DNA shape features; (3) our proposed CRPTS can capture local structural information of DNA sequences without completely relying on DNA shape data. A series of comprehensive experiments on 66 in vitro datasets derived from universal protein binding microarrays (uPBMs) shows that our proposed method CRPTS obviously outperforms the state-of-the-art methods. |
format | Online Article Text |
id | pubmed-7972936 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Society of Gene & Cell Therapy |
record_format | MEDLINE/PubMed |
spelling | pubmed-79729362021-03-24 Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture Wang, Siguo Zhang, Qinhu Shen, Zhen He, Ying Chen, Zhen-Heng Li, Jianqiang Huang, De-Shuang Mol Ther Nucleic Acids Original Article The study of transcriptional regulation is still difficult yet fundamental in molecular biology research. Recent research has shown that the double helix structure of nucleotides plays an important role in improving the accuracy and interpretability of transcription factor binding sites (TFBSs). Although several computational methods have been designed to take both DNA sequence and DNA shape features into consideration simultaneously, how to design an efficient model is still an intractable topic. In this paper, we proposed a hybrid convolutional recurrent neural network (CNN/RNN) architecture, CRPTS, to predict TFBSs by combining DNA sequence and DNA shape features. The novelty of our proposed method relies on three critical aspects: (1) the application of a shared hybrid CNN and RNN has the ability to efficiently extract features from large-scale genomic sequences obtained by high-throughput technology; (2) the common patterns were found from DNA sequences and their corresponding DNA shape features; (3) our proposed CRPTS can capture local structural information of DNA sequences without completely relying on DNA shape data. A series of comprehensive experiments on 66 in vitro datasets derived from universal protein binding microarrays (uPBMs) shows that our proposed method CRPTS obviously outperforms the state-of-the-art methods. American Society of Gene & Cell Therapy 2021-02-18 /pmc/articles/PMC7972936/ /pubmed/33767912 http://dx.doi.org/10.1016/j.omtn.2021.02.014 Text en © 2021 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Original Article Wang, Siguo Zhang, Qinhu Shen, Zhen He, Ying Chen, Zhen-Heng Li, Jianqiang Huang, De-Shuang Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture |
title | Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture |
title_full | Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture |
title_fullStr | Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture |
title_full_unstemmed | Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture |
title_short | Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture |
title_sort | predicting transcription factor binding sites using dna shape features based on shared hybrid deep learning architecture |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7972936/ https://www.ncbi.nlm.nih.gov/pubmed/33767912 http://dx.doi.org/10.1016/j.omtn.2021.02.014 |
work_keys_str_mv | AT wangsiguo predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture AT zhangqinhu predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture AT shenzhen predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture AT heying predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture AT chenzhenheng predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture AT lijianqiang predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture AT huangdeshuang predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture |