Cargando…

Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture

The study of transcriptional regulation is still difficult yet fundamental in molecular biology research. Recent research has shown that the double helix structure of nucleotides plays an important role in improving the accuracy and interpretability of transcription factor binding sites (TFBSs). Alt...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Siguo, Zhang, Qinhu, Shen, Zhen, He, Ying, Chen, Zhen-Heng, Li, Jianqiang, Huang, De-Shuang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Gene & Cell Therapy 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7972936/
https://www.ncbi.nlm.nih.gov/pubmed/33767912
http://dx.doi.org/10.1016/j.omtn.2021.02.014
_version_ 1783666741219426304
author Wang, Siguo
Zhang, Qinhu
Shen, Zhen
He, Ying
Chen, Zhen-Heng
Li, Jianqiang
Huang, De-Shuang
author_facet Wang, Siguo
Zhang, Qinhu
Shen, Zhen
He, Ying
Chen, Zhen-Heng
Li, Jianqiang
Huang, De-Shuang
author_sort Wang, Siguo
collection PubMed
description The study of transcriptional regulation is still difficult yet fundamental in molecular biology research. Recent research has shown that the double helix structure of nucleotides plays an important role in improving the accuracy and interpretability of transcription factor binding sites (TFBSs). Although several computational methods have been designed to take both DNA sequence and DNA shape features into consideration simultaneously, how to design an efficient model is still an intractable topic. In this paper, we proposed a hybrid convolutional recurrent neural network (CNN/RNN) architecture, CRPTS, to predict TFBSs by combining DNA sequence and DNA shape features. The novelty of our proposed method relies on three critical aspects: (1) the application of a shared hybrid CNN and RNN has the ability to efficiently extract features from large-scale genomic sequences obtained by high-throughput technology; (2) the common patterns were found from DNA sequences and their corresponding DNA shape features; (3) our proposed CRPTS can capture local structural information of DNA sequences without completely relying on DNA shape data. A series of comprehensive experiments on 66 in vitro datasets derived from universal protein binding microarrays (uPBMs) shows that our proposed method CRPTS obviously outperforms the state-of-the-art methods.
format Online
Article
Text
id pubmed-7972936
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society of Gene & Cell Therapy
record_format MEDLINE/PubMed
spelling pubmed-79729362021-03-24 Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture Wang, Siguo Zhang, Qinhu Shen, Zhen He, Ying Chen, Zhen-Heng Li, Jianqiang Huang, De-Shuang Mol Ther Nucleic Acids Original Article The study of transcriptional regulation is still difficult yet fundamental in molecular biology research. Recent research has shown that the double helix structure of nucleotides plays an important role in improving the accuracy and interpretability of transcription factor binding sites (TFBSs). Although several computational methods have been designed to take both DNA sequence and DNA shape features into consideration simultaneously, how to design an efficient model is still an intractable topic. In this paper, we proposed a hybrid convolutional recurrent neural network (CNN/RNN) architecture, CRPTS, to predict TFBSs by combining DNA sequence and DNA shape features. The novelty of our proposed method relies on three critical aspects: (1) the application of a shared hybrid CNN and RNN has the ability to efficiently extract features from large-scale genomic sequences obtained by high-throughput technology; (2) the common patterns were found from DNA sequences and their corresponding DNA shape features; (3) our proposed CRPTS can capture local structural information of DNA sequences without completely relying on DNA shape data. A series of comprehensive experiments on 66 in vitro datasets derived from universal protein binding microarrays (uPBMs) shows that our proposed method CRPTS obviously outperforms the state-of-the-art methods. American Society of Gene & Cell Therapy 2021-02-18 /pmc/articles/PMC7972936/ /pubmed/33767912 http://dx.doi.org/10.1016/j.omtn.2021.02.014 Text en © 2021 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Article
Wang, Siguo
Zhang, Qinhu
Shen, Zhen
He, Ying
Chen, Zhen-Heng
Li, Jianqiang
Huang, De-Shuang
Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
title Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
title_full Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
title_fullStr Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
title_full_unstemmed Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
title_short Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
title_sort predicting transcription factor binding sites using dna shape features based on shared hybrid deep learning architecture
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7972936/
https://www.ncbi.nlm.nih.gov/pubmed/33767912
http://dx.doi.org/10.1016/j.omtn.2021.02.014
work_keys_str_mv AT wangsiguo predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture
AT zhangqinhu predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture
AT shenzhen predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture
AT heying predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture
AT chenzhenheng predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture
AT lijianqiang predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture
AT huangdeshuang predictingtranscriptionfactorbindingsitesusingdnashapefeaturesbasedonsharedhybriddeeplearningarchitecture