Cargando…

Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome

As more and more protein sequences are uncovered from increasingly inexpensive sequencing techniques, an urgent task is to find their functions. This work presents a highly reliable computational technique for predicting DNA-binding function at the level of protein-DNA complex structures, rather tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Huiying, Wang, Jihua, Zhou, Yaoqi, Yang, Yuedong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4008587/
https://www.ncbi.nlm.nih.gov/pubmed/24792350
http://dx.doi.org/10.1371/journal.pone.0096694
_version_ 1782314482796593152
author Zhao, Huiying
Wang, Jihua
Zhou, Yaoqi
Yang, Yuedong
author_facet Zhao, Huiying
Wang, Jihua
Zhou, Yaoqi
Yang, Yuedong
author_sort Zhao, Huiying
collection PubMed
description As more and more protein sequences are uncovered from increasingly inexpensive sequencing techniques, an urgent task is to find their functions. This work presents a highly reliable computational technique for predicting DNA-binding function at the level of protein-DNA complex structures, rather than low-resolution two-state prediction of DNA-binding as most existing techniques do. The method first predicts protein-DNA complex structure by utilizing the template-based structure prediction technique HHblits, followed by binding affinity prediction based on a knowledge-based energy function (Distance-scaled finite ideal-gas reference state for protein-DNA interactions). A leave-one-out cross validation of the method based on 179 DNA-binding and 3797 non-binding protein domains achieves a Matthews correlation coefficient (MCC) of 0.77 with high precision (94%) and high sensitivity (65%). We further found 51% sensitivity for 82 newly determined structures of DNA-binding proteins and 56% sensitivity for the human proteome. In addition, the method provides a reasonably accurate prediction of DNA-binding residues in proteins based on predicted DNA-binding complex structures. Its application to human proteome leads to more than 300 novel DNA-binding proteins; some of these predicted structures were validated by known structures of homologous proteins in APO forms. The method [SPOT-Seq (DNA)] is available as an on-line server at http://sparks-lab.org.
format Online
Article
Text
id pubmed-4008587
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40085872014-05-09 Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome Zhao, Huiying Wang, Jihua Zhou, Yaoqi Yang, Yuedong PLoS One Research Article As more and more protein sequences are uncovered from increasingly inexpensive sequencing techniques, an urgent task is to find their functions. This work presents a highly reliable computational technique for predicting DNA-binding function at the level of protein-DNA complex structures, rather than low-resolution two-state prediction of DNA-binding as most existing techniques do. The method first predicts protein-DNA complex structure by utilizing the template-based structure prediction technique HHblits, followed by binding affinity prediction based on a knowledge-based energy function (Distance-scaled finite ideal-gas reference state for protein-DNA interactions). A leave-one-out cross validation of the method based on 179 DNA-binding and 3797 non-binding protein domains achieves a Matthews correlation coefficient (MCC) of 0.77 with high precision (94%) and high sensitivity (65%). We further found 51% sensitivity for 82 newly determined structures of DNA-binding proteins and 56% sensitivity for the human proteome. In addition, the method provides a reasonably accurate prediction of DNA-binding residues in proteins based on predicted DNA-binding complex structures. Its application to human proteome leads to more than 300 novel DNA-binding proteins; some of these predicted structures were validated by known structures of homologous proteins in APO forms. The method [SPOT-Seq (DNA)] is available as an on-line server at http://sparks-lab.org. Public Library of Science 2014-05-02 /pmc/articles/PMC4008587/ /pubmed/24792350 http://dx.doi.org/10.1371/journal.pone.0096694 Text en © 2014 Zhao et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhao, Huiying
Wang, Jihua
Zhou, Yaoqi
Yang, Yuedong
Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome
title Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome
title_full Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome
title_fullStr Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome
title_full_unstemmed Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome
title_short Predicting DNA-Binding Proteins and Binding Residues by Complex Structure Prediction and Application to Human Proteome
title_sort predicting dna-binding proteins and binding residues by complex structure prediction and application to human proteome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4008587/
https://www.ncbi.nlm.nih.gov/pubmed/24792350
http://dx.doi.org/10.1371/journal.pone.0096694
work_keys_str_mv AT zhaohuiying predictingdnabindingproteinsandbindingresiduesbycomplexstructurepredictionandapplicationtohumanproteome
AT wangjihua predictingdnabindingproteinsandbindingresiduesbycomplexstructurepredictionandapplicationtohumanproteome
AT zhouyaoqi predictingdnabindingproteinsandbindingresiduesbycomplexstructurepredictionandapplicationtohumanproteome
AT yangyuedong predictingdnabindingproteinsandbindingresiduesbycomplexstructurepredictionandapplicationtohumanproteome