Cargando…

Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties

Interactions between proteins and nucleic acids are at the heart of many essential biological processes. Despite increasing structural information about how these interactions may take place, our understanding of the usage made of protein surfaces by nucleic acids is still very limited. This is in p...

Descripción completa

Detalles Bibliográficos
Autores principales: Corsi, Flavia, Lavery, Richard, Laine, Elodie, Carbone, Alessandra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7018136/
https://www.ncbi.nlm.nih.gov/pubmed/32012150
http://dx.doi.org/10.1371/journal.pcbi.1007624
_version_ 1783497307943075840
author Corsi, Flavia
Lavery, Richard
Laine, Elodie
Carbone, Alessandra
author_facet Corsi, Flavia
Lavery, Richard
Laine, Elodie
Carbone, Alessandra
author_sort Corsi, Flavia
collection PubMed
description Interactions between proteins and nucleic acids are at the heart of many essential biological processes. Despite increasing structural information about how these interactions may take place, our understanding of the usage made of protein surfaces by nucleic acids is still very limited. This is in part due to the inherent complexity associated to protein surface deformability and evolution. In this work, we present a method that contributes to decipher such complexity by predicting protein-DNA interfaces and characterizing their properties. It relies on three biologically and physically meaningful descriptors, namely evolutionary conservation, physico-chemical properties and surface geometry. We carefully assessed its performance on several hundreds of protein structures and compared it to several machine-learning state-of-the-art methods. Our approach achieves a higher sensitivity compared to the other methods, with a similar precision. Importantly, we show that it is able to unravel ‘hidden’ binding sites by applying it to unbound protein structures and to proteins binding to DNA via multiple sites and in different conformations. It is also applicable to the detection of RNA-binding sites, without significant loss of performance. This confirms that DNA and RNA-binding sites share similar properties. Our method is implemented as a fully automated tool, [Image: see text] , freely accessible at: http://www.lcqb.upmc.fr/JET2DNA. We also provide a new dataset of 187 protein-DNA complex structures, along with a subset of 82 associated unbound structures. The set represents the largest body of high-resolution crystallographic structures of protein-DNA complexes, use biological protein assemblies as DNA-binding units, and covers all major types of protein-DNA interactions. It is available at: http://www.lcqb.upmc.fr/PDNAbenchmarks.
format Online
Article
Text
id pubmed-7018136
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70181362020-02-26 Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties Corsi, Flavia Lavery, Richard Laine, Elodie Carbone, Alessandra PLoS Comput Biol Research Article Interactions between proteins and nucleic acids are at the heart of many essential biological processes. Despite increasing structural information about how these interactions may take place, our understanding of the usage made of protein surfaces by nucleic acids is still very limited. This is in part due to the inherent complexity associated to protein surface deformability and evolution. In this work, we present a method that contributes to decipher such complexity by predicting protein-DNA interfaces and characterizing their properties. It relies on three biologically and physically meaningful descriptors, namely evolutionary conservation, physico-chemical properties and surface geometry. We carefully assessed its performance on several hundreds of protein structures and compared it to several machine-learning state-of-the-art methods. Our approach achieves a higher sensitivity compared to the other methods, with a similar precision. Importantly, we show that it is able to unravel ‘hidden’ binding sites by applying it to unbound protein structures and to proteins binding to DNA via multiple sites and in different conformations. It is also applicable to the detection of RNA-binding sites, without significant loss of performance. This confirms that DNA and RNA-binding sites share similar properties. Our method is implemented as a fully automated tool, [Image: see text] , freely accessible at: http://www.lcqb.upmc.fr/JET2DNA. We also provide a new dataset of 187 protein-DNA complex structures, along with a subset of 82 associated unbound structures. The set represents the largest body of high-resolution crystallographic structures of protein-DNA complexes, use biological protein assemblies as DNA-binding units, and covers all major types of protein-DNA interactions. It is available at: http://www.lcqb.upmc.fr/PDNAbenchmarks. Public Library of Science 2020-02-03 /pmc/articles/PMC7018136/ /pubmed/32012150 http://dx.doi.org/10.1371/journal.pcbi.1007624 Text en © 2020 Corsi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Corsi, Flavia
Lavery, Richard
Laine, Elodie
Carbone, Alessandra
Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties
title Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties
title_full Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties
title_fullStr Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties
title_full_unstemmed Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties
title_short Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties
title_sort multiple protein-dna interfaces unravelled by evolutionary information, physico-chemical and geometrical properties
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7018136/
https://www.ncbi.nlm.nih.gov/pubmed/32012150
http://dx.doi.org/10.1371/journal.pcbi.1007624
work_keys_str_mv AT corsiflavia multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties
AT laveryrichard multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties
AT laineelodie multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties
AT carbonealessandra multipleproteindnainterfacesunravelledbyevolutionaryinformationphysicochemicalandgeometricalproperties