Cargando…

An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins

Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, t...

Descripción completa

Detalles Bibliográficos
Autores principales: Gupta, Ankit, Christensen, Ryan G., Bell, Heather A., Goodwin, Mathew, Patel, Ronak Y., Pandey, Manishi, Enuameh, Metewo Selase, Rayla, Amy L., Zhu, Cong, Thibodeau-Beganny, Stacey, Brodsky, Michael H., Joung, J. Keith, Wolfe, Scot A., Stormo, Gary D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005693/
https://www.ncbi.nlm.nih.gov/pubmed/24523353
http://dx.doi.org/10.1093/nar/gku132
_version_ 1782314143397707776
author Gupta, Ankit
Christensen, Ryan G.
Bell, Heather A.
Goodwin, Mathew
Patel, Ronak Y.
Pandey, Manishi
Enuameh, Metewo Selase
Rayla, Amy L.
Zhu, Cong
Thibodeau-Beganny, Stacey
Brodsky, Michael H.
Joung, J. Keith
Wolfe, Scot A.
Stormo, Gary D.
author_facet Gupta, Ankit
Christensen, Ryan G.
Bell, Heather A.
Goodwin, Mathew
Patel, Ronak Y.
Pandey, Manishi
Enuameh, Metewo Selase
Rayla, Amy L.
Zhu, Cong
Thibodeau-Beganny, Stacey
Brodsky, Michael H.
Joung, J. Keith
Wolfe, Scot A.
Stormo, Gary D.
author_sort Gupta, Ankit
collection PubMed
description Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities.
format Online
Article
Text
id pubmed-4005693
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40056932014-05-01 An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins Gupta, Ankit Christensen, Ryan G. Bell, Heather A. Goodwin, Mathew Patel, Ronak Y. Pandey, Manishi Enuameh, Metewo Selase Rayla, Amy L. Zhu, Cong Thibodeau-Beganny, Stacey Brodsky, Michael H. Joung, J. Keith Wolfe, Scot A. Stormo, Gary D. Nucleic Acids Res Computational Biology Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities. Oxford University Press 2014-04 2014-02-12 /pmc/articles/PMC4005693/ /pubmed/24523353 http://dx.doi.org/10.1093/nar/gku132 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Gupta, Ankit
Christensen, Ryan G.
Bell, Heather A.
Goodwin, Mathew
Patel, Ronak Y.
Pandey, Manishi
Enuameh, Metewo Selase
Rayla, Amy L.
Zhu, Cong
Thibodeau-Beganny, Stacey
Brodsky, Michael H.
Joung, J. Keith
Wolfe, Scot A.
Stormo, Gary D.
An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
title An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
title_full An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
title_fullStr An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
title_full_unstemmed An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
title_short An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins
title_sort improved predictive recognition model for cys(2)-his(2) zinc finger proteins
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4005693/
https://www.ncbi.nlm.nih.gov/pubmed/24523353
http://dx.doi.org/10.1093/nar/gku132
work_keys_str_mv AT guptaankit animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT christensenryang animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT bellheathera animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT goodwinmathew animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT patelronaky animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT pandeymanishi animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT enuamehmetewoselase animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT raylaamyl animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT zhucong animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT thibodeaubegannystacey animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT brodskymichaelh animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT joungjkeith animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT wolfescota animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT stormogaryd animprovedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT guptaankit improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT christensenryang improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT bellheathera improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT goodwinmathew improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT patelronaky improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT pandeymanishi improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT enuamehmetewoselase improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT raylaamyl improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT zhucong improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT thibodeaubegannystacey improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT brodskymichaelh improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT joungjkeith improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT wolfescota improvedpredictiverecognitionmodelforcys2his2zincfingerproteins
AT stormogaryd improvedpredictiverecognitionmodelforcys2his2zincfingerproteins