Cargando…

Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein

Protein–DNA binding is of a great interest due to its importance in many biological processes. Previous studies have presented many factors responsible for the recognition and specificity, but understanding the minimal informational requirements for proteins that bind to multiple DNA-sites is still...

Descripción completa

Detalles Bibliográficos
Autores principales: Sedhom, Jacklin, Kinser, Jason, Solomon, Lee A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803871/
https://www.ncbi.nlm.nih.gov/pubmed/36601576
http://dx.doi.org/10.1093/nargab/lqac101
_version_ 1784861979894087680
author Sedhom, Jacklin
Kinser, Jason
Solomon, Lee A
author_facet Sedhom, Jacklin
Kinser, Jason
Solomon, Lee A
author_sort Sedhom, Jacklin
collection PubMed
description Protein–DNA binding is of a great interest due to its importance in many biological processes. Previous studies have presented many factors responsible for the recognition and specificity, but understanding the minimal informational requirements for proteins that bind to multiple DNA-sites is still an understudied area of bioinformatics. Here we focus on the hydrogen bonds displayed by the target DNA in the major groove that take part in protein-binding. We show that analyses focused on the base pair identity may overlook key hydrogen bonds. We have developed an algorithm that converts a nucleotide sequence into an array of hydrogen bond donors and acceptors and methyl groups. It then aligns these non-covalent interaction arrays to identify what information is being maintained among multiple DNA sequences. For three different DNA-binding proteins, Lactose repressor, controller protein and λ-CI repressor, we uncovered the minimal pattern of hydrogen bonds that are common amongst all the binding sequences. Notably in the three proteins, key interacting hydrogen bonds are maintained despite nucleobase mutations in the corresponding binding sites. We believe this work will be useful for developing new DNA binding proteins and shed new light on evolutionary relationships.
format Online
Article
Text
id pubmed-9803871
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98038712023-01-03 Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein Sedhom, Jacklin Kinser, Jason Solomon, Lee A NAR Genom Bioinform Standard Article Protein–DNA binding is of a great interest due to its importance in many biological processes. Previous studies have presented many factors responsible for the recognition and specificity, but understanding the minimal informational requirements for proteins that bind to multiple DNA-sites is still an understudied area of bioinformatics. Here we focus on the hydrogen bonds displayed by the target DNA in the major groove that take part in protein-binding. We show that analyses focused on the base pair identity may overlook key hydrogen bonds. We have developed an algorithm that converts a nucleotide sequence into an array of hydrogen bond donors and acceptors and methyl groups. It then aligns these non-covalent interaction arrays to identify what information is being maintained among multiple DNA sequences. For three different DNA-binding proteins, Lactose repressor, controller protein and λ-CI repressor, we uncovered the minimal pattern of hydrogen bonds that are common amongst all the binding sequences. Notably in the three proteins, key interacting hydrogen bonds are maintained despite nucleobase mutations in the corresponding binding sites. We believe this work will be useful for developing new DNA binding proteins and shed new light on evolutionary relationships. Oxford University Press 2022-12-31 /pmc/articles/PMC9803871/ /pubmed/36601576 http://dx.doi.org/10.1093/nargab/lqac101 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Sedhom, Jacklin
Kinser, Jason
Solomon, Lee A
Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein
title Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein
title_full Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein
title_fullStr Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein
title_full_unstemmed Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein
title_short Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein
title_sort alignment of major-groove hydrogen bond arrays uncovers shared information between different dna sequences that bind the same protein
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803871/
https://www.ncbi.nlm.nih.gov/pubmed/36601576
http://dx.doi.org/10.1093/nargab/lqac101
work_keys_str_mv AT sedhomjacklin alignmentofmajorgroovehydrogenbondarraysuncoverssharedinformationbetweendifferentdnasequencesthatbindthesameprotein
AT kinserjason alignmentofmajorgroovehydrogenbondarraysuncoverssharedinformationbetweendifferentdnasequencesthatbindthesameprotein
AT solomonleea alignmentofmajorgroovehydrogenbondarraysuncoverssharedinformationbetweendifferentdnasequencesthatbindthesameprotein