Cargando…

Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis

Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue seq...

Descripción completa

Detalles Bibliográficos
Autores principales: Jakubec, David, Laskowski, Roman A., Vondrasek, Jiri
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4934765/
https://www.ncbi.nlm.nih.gov/pubmed/27384774
http://dx.doi.org/10.1371/journal.pone.0158704
_version_ 1782441384688484352
author Jakubec, David
Laskowski, Roman A.
Vondrasek, Jiri
author_facet Jakubec, David
Laskowski, Roman A.
Vondrasek, Jiri
author_sort Jakubec, David
collection PubMed
description Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue—amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein—DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties.
format Online
Article
Text
id pubmed-4934765
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49347652016-07-18 Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis Jakubec, David Laskowski, Roman A. Vondrasek, Jiri PLoS One Research Article Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue—amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein—DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties. Public Library of Science 2016-07-06 /pmc/articles/PMC4934765/ /pubmed/27384774 http://dx.doi.org/10.1371/journal.pone.0158704 Text en © 2016 Jakubec et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jakubec, David
Laskowski, Roman A.
Vondrasek, Jiri
Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis
title Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis
title_full Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis
title_fullStr Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis
title_full_unstemmed Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis
title_short Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis
title_sort sequence-specific recognition of dna by proteins: binding motifs discovered using a novel statistical/computational analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4934765/
https://www.ncbi.nlm.nih.gov/pubmed/27384774
http://dx.doi.org/10.1371/journal.pone.0158704
work_keys_str_mv AT jakubecdavid sequencespecificrecognitionofdnabyproteinsbindingmotifsdiscoveredusinganovelstatisticalcomputationalanalysis
AT laskowskiromana sequencespecificrecognitionofdnabyproteinsbindingmotifsdiscoveredusinganovelstatisticalcomputationalanalysis
AT vondrasekjiri sequencespecificrecognitionofdnabyproteinsbindingmotifsdiscoveredusinganovelstatisticalcomputationalanalysis