Cargando…
An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, t...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005693/ https://www.ncbi.nlm.nih.gov/pubmed/24523353 http://dx.doi.org/10.1093/nar/gku132 |
_version_ | 1782314143397707776 |
---|---|
author | Gupta, Ankit Christensen, Ryan G. Bell, Heather A. Goodwin, Mathew Patel, Ronak Y. Pandey, Manishi Enuameh, Metewo Selase Rayla, Amy L. Zhu, Cong Thibodeau-Beganny, Stacey Brodsky, Michael H. Joung, J. Keith Wolfe, Scot A. Stormo, Gary D. |
author_facet | Gupta, Ankit Christensen, Ryan G. Bell, Heather A. Goodwin, Mathew Patel, Ronak Y. Pandey, Manishi Enuameh, Metewo Selase Rayla, Amy L. Zhu, Cong Thibodeau-Beganny, Stacey Brodsky, Michael H. Joung, J. Keith Wolfe, Scot A. Stormo, Gary D. |
author_sort | Gupta, Ankit |
collection | PubMed |
description | Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities. |
format | Online Article Text |
id | pubmed-4005693 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40056932014-05-01 An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins Gupta, Ankit Christensen, Ryan G. Bell, Heather A. Goodwin, Mathew Patel, Ronak Y. Pandey, Manishi Enuameh, Metewo Selase Rayla, Amy L. Zhu, Cong Thibodeau-Beganny, Stacey Brodsky, Michael H. Joung, J. Keith Wolfe, Scot A. Stormo, Gary D. Nucleic Acids Res Computational Biology Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities. Oxford University Press 2014-04 2014-02-12 /pmc/articles/PMC4005693/ /pubmed/24523353 http://dx.doi.org/10.1093/nar/gku132 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Gupta, Ankit Christensen, Ryan G. Bell, Heather A. Goodwin, Mathew Patel, Ronak Y. Pandey, Manishi Enuameh, Metewo Selase Rayla, Amy L. Zhu, Cong Thibodeau-Beganny, Stacey Brodsky, Michael H. Joung, J. Keith Wolfe, Scot A. Stormo, Gary D. An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins |
title | An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins |
title_full | An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins |
title_fullStr | An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins |
title_full_unstemmed | An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins |
title_short | An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins |
title_sort | improved predictive recognition model for cys(2)-his(2) zinc finger proteins |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005693/ https://www.ncbi.nlm.nih.gov/pubmed/24523353 http://dx.doi.org/10.1093/nar/gku132 |
work_keys_str_mv | AT guptaankit animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT christensenryang animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT bellheathera animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT goodwinmathew animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT patelronaky animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT pandeymanishi animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT enuamehmetewoselase animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT raylaamyl animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT zhucong animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT thibodeaubegannystacey animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT brodskymichaelh animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT joungjkeith animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT wolfescota animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT stormogaryd animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins AT guptaankit improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT christensenryang improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT bellheathera improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT goodwinmathew improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT patelronaky improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT pandeymanishi improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT enuamehmetewoselase improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT raylaamyl improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT zhucong improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT thibodeaubegannystacey improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT brodskymichaelh improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT joungjkeith improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT wolfescota improvedpredictiverecognitionmodelforcys2his2zincfingerproteins AT stormogaryd improvedpredictiverecognitionmodelforcys2his2zincfingerproteins |