Cargando…

De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins

Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specifici...

Descripción completa

Detalles Bibliográficos
Autores principales: Persikov, Anton V., Singh, Mona
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3874201/
https://www.ncbi.nlm.nih.gov/pubmed/24097433
http://dx.doi.org/10.1093/nar/gkt890
_version_ 1782297205404598272
author Persikov, Anton V.
Singh, Mona
author_facet Persikov, Anton V.
Singh, Mona
author_sort Persikov, Anton V.
collection PubMed
description Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specificities for Cys(2)His(2) zinc fingers (C2H2-ZFs), the largest family of DNA-binding proteins in metazoans. We develop a general approach, based on empirical calculations of pairwise amino acid–nucleotide interaction energies, for predicting position weight matrices (PWMs) representing DNA-binding specificities for C2H2-ZF proteins. We predict DNA-binding specificities on a per-finger basis and merge predictions for C2H2-ZF domains that are arrayed within sequences. We test our approach on a diverse set of natural C2H2-ZF proteins with known binding specificities and demonstrate that for >85% of the proteins, their predicted PWMs are accurate in 50% of their nucleotide positions. For proteins with several zinc finger isoforms, we show via case studies that this level of accuracy enables us to match isoforms with their known DNA-binding specificities. A web server for predicting a PWM given a protein containing C2H2-ZF domains is available online at http://zf.princeton.edu and can be used to aid in protein engineering applications and in genome-wide searches for transcription factor targets.
format Online
Article
Text
id pubmed-3874201
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-38742012013-12-28 De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins Persikov, Anton V. Singh, Mona Nucleic Acids Res Computational Biology Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specificities for Cys(2)His(2) zinc fingers (C2H2-ZFs), the largest family of DNA-binding proteins in metazoans. We develop a general approach, based on empirical calculations of pairwise amino acid–nucleotide interaction energies, for predicting position weight matrices (PWMs) representing DNA-binding specificities for C2H2-ZF proteins. We predict DNA-binding specificities on a per-finger basis and merge predictions for C2H2-ZF domains that are arrayed within sequences. We test our approach on a diverse set of natural C2H2-ZF proteins with known binding specificities and demonstrate that for >85% of the proteins, their predicted PWMs are accurate in 50% of their nucleotide positions. For proteins with several zinc finger isoforms, we show via case studies that this level of accuracy enables us to match isoforms with their known DNA-binding specificities. A web server for predicting a PWM given a protein containing C2H2-ZF domains is available online at http://zf.princeton.edu and can be used to aid in protein engineering applications and in genome-wide searches for transcription factor targets. Oxford University Press 2014-01-01 2013-10-03 /pmc/articles/PMC3874201/ /pubmed/24097433 http://dx.doi.org/10.1093/nar/gkt890 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Persikov, Anton V.
Singh, Mona
De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins
title De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins
title_full De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins
title_fullStr De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins
title_full_unstemmed De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins
title_short De novo prediction of DNA-binding specificities for Cys(2)His(2) zinc finger proteins
title_sort de novo prediction of dna-binding specificities for cys(2)his(2) zinc finger proteins
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3874201/
https://www.ncbi.nlm.nih.gov/pubmed/24097433
http://dx.doi.org/10.1093/nar/gkt890
work_keys_str_mv AT persikovantonv denovopredictionofdnabindingspecificitiesforcys2his2zincfingerproteins
AT singhmona denovopredictionofdnabindingspecificitiesforcys2his2zincfingerproteins