Cargando…

Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures

DNA-binding proteins such as transcription factors use DNA-binding domains (DBDs) to bind to specific sequences in the genome to initiate many important biological functions. Accurate prediction of such target sequences, often represented by position weight matrices (PWMs), is an important step to u...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Chien-Yu, Chien, Ting-Ying, Lin, Chih-Kang, Lin, Chih-Wei, Weng, Yi-Zhong, Chang, Darby Tien-Hao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2012
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3270014/ https://www.ncbi.nlm.nih.gov/pubmed/22312425 http://dx.doi.org/10.1371/journal.pone.0030446

_version_	1782222534327926784
author	Chen, Chien-Yu Chien, Ting-Ying Lin, Chih-Kang Lin, Chih-Wei Weng, Yi-Zhong Chang, Darby Tien-Hao
author_facet	Chen, Chien-Yu Chien, Ting-Ying Lin, Chih-Kang Lin, Chih-Wei Weng, Yi-Zhong Chang, Darby Tien-Hao
author_sort	Chen, Chien-Yu
collection	PubMed
description	DNA-binding proteins such as transcription factors use DNA-binding domains (DBDs) to bind to specific sequences in the genome to initiate many important biological functions. Accurate prediction of such target sequences, often represented by position weight matrices (PWMs), is an important step to understand many biological processes. Recent studies have shown that knowledge-based potential functions can be applied on protein-DNA co-crystallized structures to generate PWMs that are considerably consistent with experimental data. However, this success has not been extended to DNA-binding proteins lacking co-crystallized structures. This study aims at investigating the possibility of predicting the DNA sequences bound by DNA-binding proteins from the proteins' unbound structures (structures of the unbound state). Given an unbound query protein and a template complex, the proposed method first employs structure alignment to generate synthetic protein-DNA complexes for the query protein. Once a complex is available, an atomic-level knowledge-based potential function is employed to predict PWMs characterizing the sequences to which the query protein can bind. The evaluation of the proposed method is based on seven DNA-binding proteins, which have structures of both DNA-bound and unbound forms for prediction as well as annotated PWMs for validation. Since this work is the first attempt to predict target sequences of DNA-binding proteins from their unbound structures, three types of structural variations that presumably influence the prediction accuracy were examined and discussed. Based on the analyses conducted in this study, the conformational change of proteins upon binding DNA was shown to be the key factor. This study sheds light on the challenge of predicting the target DNA sequences of a protein lacking co-crystallized structures, which encourages more efforts on the structure alignment-based approaches in addition to docking- and homology modeling-based approaches for generating synthetic complexes.
format	Online Article Text
id	pubmed-3270014
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-32700142012-02-06 Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures Chen, Chien-Yu Chien, Ting-Ying Lin, Chih-Kang Lin, Chih-Wei Weng, Yi-Zhong Chang, Darby Tien-Hao PLoS One Research Article DNA-binding proteins such as transcription factors use DNA-binding domains (DBDs) to bind to specific sequences in the genome to initiate many important biological functions. Accurate prediction of such target sequences, often represented by position weight matrices (PWMs), is an important step to understand many biological processes. Recent studies have shown that knowledge-based potential functions can be applied on protein-DNA co-crystallized structures to generate PWMs that are considerably consistent with experimental data. However, this success has not been extended to DNA-binding proteins lacking co-crystallized structures. This study aims at investigating the possibility of predicting the DNA sequences bound by DNA-binding proteins from the proteins' unbound structures (structures of the unbound state). Given an unbound query protein and a template complex, the proposed method first employs structure alignment to generate synthetic protein-DNA complexes for the query protein. Once a complex is available, an atomic-level knowledge-based potential function is employed to predict PWMs characterizing the sequences to which the query protein can bind. The evaluation of the proposed method is based on seven DNA-binding proteins, which have structures of both DNA-bound and unbound forms for prediction as well as annotated PWMs for validation. Since this work is the first attempt to predict target sequences of DNA-binding proteins from their unbound structures, three types of structural variations that presumably influence the prediction accuracy were examined and discussed. Based on the analyses conducted in this study, the conformational change of proteins upon binding DNA was shown to be the key factor. This study sheds light on the challenge of predicting the target DNA sequences of a protein lacking co-crystallized structures, which encourages more efforts on the structure alignment-based approaches in addition to docking- and homology modeling-based approaches for generating synthetic complexes. Public Library of Science 2012-02-01 /pmc/articles/PMC3270014/ /pubmed/22312425 http://dx.doi.org/10.1371/journal.pone.0030446 Text en Chen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Chen, Chien-Yu Chien, Ting-Ying Lin, Chih-Kang Lin, Chih-Wei Weng, Yi-Zhong Chang, Darby Tien-Hao Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
title	Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
title_full	Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
title_fullStr	Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
title_full_unstemmed	Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
title_short	Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures
title_sort	predicting target dna sequences of dna-binding proteins based on unbound structures
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3270014/ https://www.ncbi.nlm.nih.gov/pubmed/22312425 http://dx.doi.org/10.1371/journal.pone.0030446
work_keys_str_mv	AT chenchienyu predictingtargetdnasequencesofdnabindingproteinsbasedonunboundstructures AT chientingying predictingtargetdnasequencesofdnabindingproteinsbasedonunboundstructures AT linchihkang predictingtargetdnasequencesofdnabindingproteinsbasedonunboundstructures AT linchihwei predictingtargetdnasequencesofdnabindingproteinsbasedonunboundstructures AT wengyizhong predictingtargetdnasequencesofdnabindingproteinsbasedonunboundstructures AT changdarbytienhao predictingtargetdnasequencesofdnabindingproteinsbasedonunboundstructures

Predicting Target DNA Sequences of DNA-Binding Proteins Based on Unbound Structures

Ejemplares similares