Cargando…
FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)
The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein–DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures....
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5934639/ https://www.ncbi.nlm.nih.gov/pubmed/29608705 http://dx.doi.org/10.1093/nar/gky228 |
_version_ | 1783320151470374912 |
---|---|
author | Blanco, Javier Delgado Radusky, Leandro Climente-González, Héctor Serrano, Luis |
author_facet | Blanco, Javier Delgado Radusky, Leandro Climente-González, Héctor Serrano, Luis |
author_sort | Blanco, Javier Delgado |
collection | PubMed |
description | The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein–DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures. PADA1 relies on a library of protein and double-stranded DNA fragment pairs obtained from a training set of 2103 DNA–protein complexes. It includes a fast statistical force field computed from atom-atom distances, to evaluate and filter the 3D docking models. Using published benchmark validation sets and 212 DNA–protein structures published after 2016 we predicted the DNA-binding regions with an RMSD of <1.8 Å per residue in >95% of the cases. We show that the quality of the docked templates is compatible with FoldX protein design tool suite to identify the crystallized DNA molecule sequence as the most energetically favorable in 80% of the cases. We highlighted the biological potential of PADA1 by reconstituting DNA and protein conformational changes upon protein mutagenesis of a meganuclease and its variants, and by predicting DNA-binding regions and nucleotide sequences in proteins crystallized without DNA. These results opens up new perspectives for the engineering of DNA–protein interfaces. |
format | Online Article Text |
id | pubmed-5934639 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-59346392018-05-09 FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) Blanco, Javier Delgado Radusky, Leandro Climente-González, Héctor Serrano, Luis Nucleic Acids Res Computational Biology The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein–DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures. PADA1 relies on a library of protein and double-stranded DNA fragment pairs obtained from a training set of 2103 DNA–protein complexes. It includes a fast statistical force field computed from atom-atom distances, to evaluate and filter the 3D docking models. Using published benchmark validation sets and 212 DNA–protein structures published after 2016 we predicted the DNA-binding regions with an RMSD of <1.8 Å per residue in >95% of the cases. We show that the quality of the docked templates is compatible with FoldX protein design tool suite to identify the crystallized DNA molecule sequence as the most energetically favorable in 80% of the cases. We highlighted the biological potential of PADA1 by reconstituting DNA and protein conformational changes upon protein mutagenesis of a meganuclease and its variants, and by predicting DNA-binding regions and nucleotide sequences in proteins crystallized without DNA. These results opens up new perspectives for the engineering of DNA–protein interfaces. Oxford University Press 2018-05-04 2018-03-28 /pmc/articles/PMC5934639/ /pubmed/29608705 http://dx.doi.org/10.1093/nar/gky228 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Blanco, Javier Delgado Radusky, Leandro Climente-González, Héctor Serrano, Luis FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) |
title | FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) |
title_full | FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) |
title_fullStr | FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) |
title_full_unstemmed | FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) |
title_short | FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) |
title_sort | foldx accurate structural protein–dna binding prediction using pada1 (protein assisted dna assembly 1) |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5934639/ https://www.ncbi.nlm.nih.gov/pubmed/29608705 http://dx.doi.org/10.1093/nar/gky228 |
work_keys_str_mv | AT blancojavierdelgado foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1 AT raduskyleandro foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1 AT climentegonzalezhector foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1 AT serranoluis foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1 |