Cargando…

Physicochemical models of protein–DNA binding with standard and modified base pairs

DNA-binding proteins play important roles in various cellular processes, but the mechanisms by which proteins recognize genomic target sites remain incompletely understood. Functional groups at the edges of the base pairs (bp) exposed in the DNA grooves represent physicochemical signatures. As these...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiu, Tsu-Pei, Rao, Satyanarayan, Rohs, Remo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942898/
https://www.ncbi.nlm.nih.gov/pubmed/36656856
http://dx.doi.org/10.1073/pnas.2205796120
_version_ 1784891596701958144
author Chiu, Tsu-Pei
Rao, Satyanarayan
Rohs, Remo
author_facet Chiu, Tsu-Pei
Rao, Satyanarayan
Rohs, Remo
author_sort Chiu, Tsu-Pei
collection PubMed
description DNA-binding proteins play important roles in various cellular processes, but the mechanisms by which proteins recognize genomic target sites remain incompletely understood. Functional groups at the edges of the base pairs (bp) exposed in the DNA grooves represent physicochemical signatures. As these signatures enable proteins to form specific contacts between protein residues and bp, their study can provide mechanistic insights into protein–DNA binding. Existing experimental methods, such as X-ray crystallography, can reveal such mechanisms based on physicochemical interactions between proteins and their DNA target sites. However, the low throughput of structural biology methods limits mechanistic insights for selection of many genomic sites. High-throughput binding assays enable prediction of potential target sites by determining relative binding affinities of a protein to massive numbers of DNA sequences. Many currently available computational methods are based on the sequence of standard Watson–Crick bp. They assume that the contribution of overall binding affinity is independent for each base pair, or alternatively include dinucleotides or short k-mers. These methods cannot directly expand to physicochemical contacts, and they are not suitable to apply to DNA modifications or non-Watson–Crick bp. These variations include DNA methylation, and synthetic or mismatched bp. The proposed method, DeepRec, can predict relative binding affinities as function of physicochemical signatures and the effect of DNA methylation or other chemical modifications on binding. Sequence-based modeling methods are in comparison a coarse-grain description and cannot achieve such insights. Our chemistry-based modeling framework provides a path towards understanding genome function at a mechanistic level.
format Online
Article
Text
id pubmed-9942898
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-99428982023-02-22 Physicochemical models of protein–DNA binding with standard and modified base pairs Chiu, Tsu-Pei Rao, Satyanarayan Rohs, Remo Proc Natl Acad Sci U S A Biological Sciences DNA-binding proteins play important roles in various cellular processes, but the mechanisms by which proteins recognize genomic target sites remain incompletely understood. Functional groups at the edges of the base pairs (bp) exposed in the DNA grooves represent physicochemical signatures. As these signatures enable proteins to form specific contacts between protein residues and bp, their study can provide mechanistic insights into protein–DNA binding. Existing experimental methods, such as X-ray crystallography, can reveal such mechanisms based on physicochemical interactions between proteins and their DNA target sites. However, the low throughput of structural biology methods limits mechanistic insights for selection of many genomic sites. High-throughput binding assays enable prediction of potential target sites by determining relative binding affinities of a protein to massive numbers of DNA sequences. Many currently available computational methods are based on the sequence of standard Watson–Crick bp. They assume that the contribution of overall binding affinity is independent for each base pair, or alternatively include dinucleotides or short k-mers. These methods cannot directly expand to physicochemical contacts, and they are not suitable to apply to DNA modifications or non-Watson–Crick bp. These variations include DNA methylation, and synthetic or mismatched bp. The proposed method, DeepRec, can predict relative binding affinities as function of physicochemical signatures and the effect of DNA methylation or other chemical modifications on binding. Sequence-based modeling methods are in comparison a coarse-grain description and cannot achieve such insights. Our chemistry-based modeling framework provides a path towards understanding genome function at a mechanistic level. National Academy of Sciences 2023-01-19 2023-01-24 /pmc/articles/PMC9942898/ /pubmed/36656856 http://dx.doi.org/10.1073/pnas.2205796120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Chiu, Tsu-Pei
Rao, Satyanarayan
Rohs, Remo
Physicochemical models of protein–DNA binding with standard and modified base pairs
title Physicochemical models of protein–DNA binding with standard and modified base pairs
title_full Physicochemical models of protein–DNA binding with standard and modified base pairs
title_fullStr Physicochemical models of protein–DNA binding with standard and modified base pairs
title_full_unstemmed Physicochemical models of protein–DNA binding with standard and modified base pairs
title_short Physicochemical models of protein–DNA binding with standard and modified base pairs
title_sort physicochemical models of protein–dna binding with standard and modified base pairs
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9942898/
https://www.ncbi.nlm.nih.gov/pubmed/36656856
http://dx.doi.org/10.1073/pnas.2205796120
work_keys_str_mv AT chiutsupei physicochemicalmodelsofproteindnabindingwithstandardandmodifiedbasepairs
AT raosatyanarayan physicochemicalmodelsofproteindnabindingwithstandardandmodifiedbasepairs
AT rohsremo physicochemicalmodelsofproteindnabindingwithstandardandmodifiedbasepairs