Cargando…

Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding

Protein–DNA binding is a fundamental component of gene regulatory processes, but it is still not completely understood how proteins recognize their target sites in the genome. Besides hydrogen bonding in the major groove (base readout), proteins recognize minor-groove geometry using positively charg...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiu, Tsu-Pei, Rao, Satyanarayan, Mann, Richard S., Honig, Barry, Rohs, Remo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5716191/
https://www.ncbi.nlm.nih.gov/pubmed/29040720
http://dx.doi.org/10.1093/nar/gkx915
_version_ 1783283896436129792
author Chiu, Tsu-Pei
Rao, Satyanarayan
Mann, Richard S.
Honig, Barry
Rohs, Remo
author_facet Chiu, Tsu-Pei
Rao, Satyanarayan
Mann, Richard S.
Honig, Barry
Rohs, Remo
author_sort Chiu, Tsu-Pei
collection PubMed
description Protein–DNA binding is a fundamental component of gene regulatory processes, but it is still not completely understood how proteins recognize their target sites in the genome. Besides hydrogen bonding in the major groove (base readout), proteins recognize minor-groove geometry using positively charged amino acids (shape readout). The underlying mechanism of DNA shape readout involves the correlation between minor-groove width and electrostatic potential (EP). To probe this biophysical effect directly, rather than using minor-groove width as an indirect measure for shape readout, we developed a methodology, DNAphi, for predicting EP in the minor groove and confirmed the direct role of EP in protein–DNA binding using massive sequencing data. The DNAphi method uses a sliding-window approach to mine results from non-linear Poisson–Boltzmann (NLPB) calculations on DNA structures derived from all-atom Monte Carlo simulations. We validated this approach, which only requires nucleotide sequence as input, based on direct comparison with NLPB calculations for available crystal structures. Using statistical machine-learning approaches, we showed that adding EP as a biophysical feature can improve the predictive power of quantitative binding specificity models across 27 transcription factor families. High-throughput prediction of EP offers a novel way to integrate biophysical and genomic studies of protein–DNA binding.
format Online
Article
Text
id pubmed-5716191
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57161912017-12-08 Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding Chiu, Tsu-Pei Rao, Satyanarayan Mann, Richard S. Honig, Barry Rohs, Remo Nucleic Acids Res Structural Biology Protein–DNA binding is a fundamental component of gene regulatory processes, but it is still not completely understood how proteins recognize their target sites in the genome. Besides hydrogen bonding in the major groove (base readout), proteins recognize minor-groove geometry using positively charged amino acids (shape readout). The underlying mechanism of DNA shape readout involves the correlation between minor-groove width and electrostatic potential (EP). To probe this biophysical effect directly, rather than using minor-groove width as an indirect measure for shape readout, we developed a methodology, DNAphi, for predicting EP in the minor groove and confirmed the direct role of EP in protein–DNA binding using massive sequencing data. The DNAphi method uses a sliding-window approach to mine results from non-linear Poisson–Boltzmann (NLPB) calculations on DNA structures derived from all-atom Monte Carlo simulations. We validated this approach, which only requires nucleotide sequence as input, based on direct comparison with NLPB calculations for available crystal structures. Using statistical machine-learning approaches, we showed that adding EP as a biophysical feature can improve the predictive power of quantitative binding specificity models across 27 transcription factor families. High-throughput prediction of EP offers a novel way to integrate biophysical and genomic studies of protein–DNA binding. Oxford University Press 2017-12-01 2017-10-11 /pmc/articles/PMC5716191/ /pubmed/29040720 http://dx.doi.org/10.1093/nar/gkx915 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Structural Biology
Chiu, Tsu-Pei
Rao, Satyanarayan
Mann, Richard S.
Honig, Barry
Rohs, Remo
Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding
title Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding
title_full Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding
title_fullStr Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding
title_full_unstemmed Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding
title_short Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding
title_sort genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–dna binding
topic Structural Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5716191/
https://www.ncbi.nlm.nih.gov/pubmed/29040720
http://dx.doi.org/10.1093/nar/gkx915
work_keys_str_mv AT chiutsupei genomewidepredictionofminorgrooveelectrostaticpotentialenablesbiophysicalmodelingofproteindnabinding
AT raosatyanarayan genomewidepredictionofminorgrooveelectrostaticpotentialenablesbiophysicalmodelingofproteindnabinding
AT mannrichards genomewidepredictionofminorgrooveelectrostaticpotentialenablesbiophysicalmodelingofproteindnabinding
AT honigbarry genomewidepredictionofminorgrooveelectrostaticpotentialenablesbiophysicalmodelingofproteindnabinding
AT rohsremo genomewidepredictionofminorgrooveelectrostaticpotentialenablesbiophysicalmodelingofproteindnabinding