Cargando…

FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)

The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein–DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures....

Descripción completa

Detalles Bibliográficos
Autores principales: Blanco, Javier Delgado, Radusky, Leandro, Climente-González, Héctor, Serrano, Luis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5934639/
https://www.ncbi.nlm.nih.gov/pubmed/29608705
http://dx.doi.org/10.1093/nar/gky228
_version_ 1783320151470374912
author Blanco, Javier Delgado
Radusky, Leandro
Climente-González, Héctor
Serrano, Luis
author_facet Blanco, Javier Delgado
Radusky, Leandro
Climente-González, Héctor
Serrano, Luis
author_sort Blanco, Javier Delgado
collection PubMed
description The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein–DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures. PADA1 relies on a library of protein and double-stranded DNA fragment pairs obtained from a training set of 2103 DNA–protein complexes. It includes a fast statistical force field computed from atom-atom distances, to evaluate and filter the 3D docking models. Using published benchmark validation sets and 212 DNA–protein structures published after 2016 we predicted the DNA-binding regions with an RMSD of <1.8 Å per residue in >95% of the cases. We show that the quality of the docked templates is compatible with FoldX protein design tool suite to identify the crystallized DNA molecule sequence as the most energetically favorable in 80% of the cases. We highlighted the biological potential of PADA1 by reconstituting DNA and protein conformational changes upon protein mutagenesis of a meganuclease and its variants, and by predicting DNA-binding regions and nucleotide sequences in proteins crystallized without DNA. These results opens up new perspectives for the engineering of DNA–protein interfaces.
format Online
Article
Text
id pubmed-5934639
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-59346392018-05-09 FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1) Blanco, Javier Delgado Radusky, Leandro Climente-González, Héctor Serrano, Luis Nucleic Acids Res Computational Biology The speed at which new genomes are being sequenced highlights the need for genome-wide methods capable of predicting protein–DNA interactions. Here, we present PADA1, a generic algorithm that accurately models structural complexes and predicts the DNA-binding regions of resolved protein structures. PADA1 relies on a library of protein and double-stranded DNA fragment pairs obtained from a training set of 2103 DNA–protein complexes. It includes a fast statistical force field computed from atom-atom distances, to evaluate and filter the 3D docking models. Using published benchmark validation sets and 212 DNA–protein structures published after 2016 we predicted the DNA-binding regions with an RMSD of <1.8 Å per residue in >95% of the cases. We show that the quality of the docked templates is compatible with FoldX protein design tool suite to identify the crystallized DNA molecule sequence as the most energetically favorable in 80% of the cases. We highlighted the biological potential of PADA1 by reconstituting DNA and protein conformational changes upon protein mutagenesis of a meganuclease and its variants, and by predicting DNA-binding regions and nucleotide sequences in proteins crystallized without DNA. These results opens up new perspectives for the engineering of DNA–protein interfaces. Oxford University Press 2018-05-04 2018-03-28 /pmc/articles/PMC5934639/ /pubmed/29608705 http://dx.doi.org/10.1093/nar/gky228 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Blanco, Javier Delgado
Radusky, Leandro
Climente-González, Héctor
Serrano, Luis
FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)
title FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)
title_full FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)
title_fullStr FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)
title_full_unstemmed FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)
title_short FoldX accurate structural protein–DNA binding prediction using PADA1 (Protein Assisted DNA Assembly 1)
title_sort foldx accurate structural protein–dna binding prediction using pada1 (protein assisted dna assembly 1)
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5934639/
https://www.ncbi.nlm.nih.gov/pubmed/29608705
http://dx.doi.org/10.1093/nar/gky228
work_keys_str_mv AT blancojavierdelgado foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1
AT raduskyleandro foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1
AT climentegonzalezhector foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1
AT serranoluis foldxaccuratestructuralproteindnabindingpredictionusingpada1proteinassisteddnaassembly1