Cargando…
De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins
Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specifici...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3874201/ https://www.ncbi.nlm.nih.gov/pubmed/24097433 http://dx.doi.org/10.1093/nar/gkt890 |
_version_ | 1782297205404598272 |
---|---|
author | Persikov, Anton V. Singh, Mona |
author_facet | Persikov, Anton V. Singh, Mona |
author_sort | Persikov, Anton V. |
collection | PubMed |
description | Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specificities for Cys(2)His(2) zinc fingers (C2H2-ZFs), the largest family of DNA-binding proteins in metazoans. We develop a general approach, based on empirical calculations of pairwise amino acid–nucleotide interaction energies, for predicting position weight matrices (PWMs) representing DNA-binding specificities for C2H2-ZF proteins. We predict DNA-binding specificities on a per-finger basis and merge predictions for C2H2-ZF domains that are arrayed within sequences. We test our approach on a diverse set of natural C2H2-ZF proteins with known binding specificities and demonstrate that for >85% of the proteins, their predicted PWMs are accurate in 50% of their nucleotide positions. For proteins with several zinc finger isoforms, we show via case studies that this level of accuracy enables us to match isoforms with their known DNA-binding specificities. A web server for predicting a PWM given a protein containing C2H2-ZF domains is available online at http://zf.princeton.edu and can be used to aid in protein engineering applications and in genome-wide searches for transcription factor targets. |
format | Online Article Text |
id | pubmed-3874201 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-38742012013-12-28 De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins Persikov, Anton V. Singh, Mona Nucleic Acids Res Computational Biology Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specificities for Cys(2)His(2) zinc fingers (C2H2-ZFs), the largest family of DNA-binding proteins in metazoans. We develop a general approach, based on empirical calculations of pairwise amino acid–nucleotide interaction energies, for predicting position weight matrices (PWMs) representing DNA-binding specificities for C2H2-ZF proteins. We predict DNA-binding specificities on a per-finger basis and merge predictions for C2H2-ZF domains that are arrayed within sequences. We test our approach on a diverse set of natural C2H2-ZF proteins with known binding specificities and demonstrate that for >85% of the proteins, their predicted PWMs are accurate in 50% of their nucleotide positions. For proteins with several zinc finger isoforms, we show via case studies that this level of accuracy enables us to match isoforms with their known DNA-binding specificities. A web server for predicting a PWM given a protein containing C2H2-ZF domains is available online at http://zf.princeton.edu and can be used to aid in protein engineering applications and in genome-wide searches for transcription factor targets. Oxford University Press 2014-01-01 2013-10-03 /pmc/articles/PMC3874201/ /pubmed/24097433 http://dx.doi.org/10.1093/nar/gkt890 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Persikov, Anton V. Singh, Mona De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins |
title | De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins |
title_full | De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins |
title_fullStr | De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins |
title_full_unstemmed | De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins |
title_short | De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins |
title_sort | de novo prediction of dna-binding specificities for cys(2)his(2) zinc finger proteins |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3874201/ https://www.ncbi.nlm.nih.gov/pubmed/24097433 http://dx.doi.org/10.1093/nar/gkt890 |
work_keys_str_mv | AT persikovantonv denovopredictionofdnabindingspecificitiesforcys2his2zincfingerproteins AT singhmona denovopredictionofdnabindingspecificitiesforcys2his2zincfingerproteins |