Cargando…

On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework

We consider the identification of interacting protein-nucleic acid partners using the rigid body docking method FTdock, which is systematic and exhaustive in the exploration of docking conformations. The accuracy of rigid body docking methods is tested using known protein-DNA complexes for which the...

Descripción completa

Detalles Bibliográficos
Autores principales: Parisien, Marc, Freed, Karl F., Sosnick, Tobin R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3290582/
https://www.ncbi.nlm.nih.gov/pubmed/22393431
http://dx.doi.org/10.1371/journal.pone.0032647
_version_ 1782225021627793408
author Parisien, Marc
Freed, Karl F.
Sosnick, Tobin R.
author_facet Parisien, Marc
Freed, Karl F.
Sosnick, Tobin R.
author_sort Parisien, Marc
collection PubMed
description We consider the identification of interacting protein-nucleic acid partners using the rigid body docking method FTdock, which is systematic and exhaustive in the exploration of docking conformations. The accuracy of rigid body docking methods is tested using known protein-DNA complexes for which the docked and undocked structures are both available. Additional tests with large decoy sets probe the efficacy of two published statistically derived scoring functions that contain a huge number of parameters. In contrast, we demonstrate that state-of-the-art machine learning techniques can enormously reduce the number of parameters required, thereby identifying the relevant docking features using a miniscule fraction of the number of parameters in the prior works. The present machine learning study considers a 300 dimensional vector (dependent on only 15 parameters), termed the Chemical Context Profile (CCP), where each dimension reflects a specific type of protein amino acid-nucleic acid base interaction. The CCP is designed to capture the chemical complementarities of the interface and is well suited for machine learning techniques. Our objective function is the Chemical Context Discrepancy (CCD), which is defined as the angle between the native system's CCP vector and the decoy's vector and which serves as a substitute for the more commonly used root mean squared deviation (RMSD). We demonstrate that the CCP provides a useful scoring function when certain dimensions are properly weighted. Finally, we explore how the amino acids on a protein's surface can help guide DNA binding, first through long-range interactions, followed by direct contacts, according to specific preferences for either the major or minor grooves of the DNA.
format Online
Article
Text
id pubmed-3290582
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32905822012-03-05 On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework Parisien, Marc Freed, Karl F. Sosnick, Tobin R. PLoS One Research Article We consider the identification of interacting protein-nucleic acid partners using the rigid body docking method FTdock, which is systematic and exhaustive in the exploration of docking conformations. The accuracy of rigid body docking methods is tested using known protein-DNA complexes for which the docked and undocked structures are both available. Additional tests with large decoy sets probe the efficacy of two published statistically derived scoring functions that contain a huge number of parameters. In contrast, we demonstrate that state-of-the-art machine learning techniques can enormously reduce the number of parameters required, thereby identifying the relevant docking features using a miniscule fraction of the number of parameters in the prior works. The present machine learning study considers a 300 dimensional vector (dependent on only 15 parameters), termed the Chemical Context Profile (CCP), where each dimension reflects a specific type of protein amino acid-nucleic acid base interaction. The CCP is designed to capture the chemical complementarities of the interface and is well suited for machine learning techniques. Our objective function is the Chemical Context Discrepancy (CCD), which is defined as the angle between the native system's CCP vector and the decoy's vector and which serves as a substitute for the more commonly used root mean squared deviation (RMSD). We demonstrate that the CCP provides a useful scoring function when certain dimensions are properly weighted. Finally, we explore how the amino acids on a protein's surface can help guide DNA binding, first through long-range interactions, followed by direct contacts, according to specific preferences for either the major or minor grooves of the DNA. Public Library of Science 2012-02-29 /pmc/articles/PMC3290582/ /pubmed/22393431 http://dx.doi.org/10.1371/journal.pone.0032647 Text en Parisien et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Parisien, Marc
Freed, Karl F.
Sosnick, Tobin R.
On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework
title On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework
title_full On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework
title_fullStr On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework
title_full_unstemmed On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework
title_short On Docking, Scoring and Assessing Protein-DNA Complexes in a Rigid-Body Framework
title_sort on docking, scoring and assessing protein-dna complexes in a rigid-body framework
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3290582/
https://www.ncbi.nlm.nih.gov/pubmed/22393431
http://dx.doi.org/10.1371/journal.pone.0032647
work_keys_str_mv AT parisienmarc ondockingscoringandassessingproteindnacomplexesinarigidbodyframework
AT freedkarlf ondockingscoringandassessingproteindnacomplexesinarigidbodyframework
AT sosnicktobinr ondockingscoringandassessingproteindnacomplexesinarigidbodyframework