Cargando…
Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein
Protein–DNA binding is of a great interest due to its importance in many biological processes. Previous studies have presented many factors responsible for the recognition and specificity, but understanding the minimal informational requirements for proteins that bind to multiple DNA-sites is still...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803871/ https://www.ncbi.nlm.nih.gov/pubmed/36601576 http://dx.doi.org/10.1093/nargab/lqac101 |
_version_ | 1784861979894087680 |
---|---|
author | Sedhom, Jacklin Kinser, Jason Solomon, Lee A |
author_facet | Sedhom, Jacklin Kinser, Jason Solomon, Lee A |
author_sort | Sedhom, Jacklin |
collection | PubMed |
description | Protein–DNA binding is of a great interest due to its importance in many biological processes. Previous studies have presented many factors responsible for the recognition and specificity, but understanding the minimal informational requirements for proteins that bind to multiple DNA-sites is still an understudied area of bioinformatics. Here we focus on the hydrogen bonds displayed by the target DNA in the major groove that take part in protein-binding. We show that analyses focused on the base pair identity may overlook key hydrogen bonds. We have developed an algorithm that converts a nucleotide sequence into an array of hydrogen bond donors and acceptors and methyl groups. It then aligns these non-covalent interaction arrays to identify what information is being maintained among multiple DNA sequences. For three different DNA-binding proteins, Lactose repressor, controller protein and λ-CI repressor, we uncovered the minimal pattern of hydrogen bonds that are common amongst all the binding sequences. Notably in the three proteins, key interacting hydrogen bonds are maintained despite nucleobase mutations in the corresponding binding sites. We believe this work will be useful for developing new DNA binding proteins and shed new light on evolutionary relationships. |
format | Online Article Text |
id | pubmed-9803871 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98038712023-01-03 Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein Sedhom, Jacklin Kinser, Jason Solomon, Lee A NAR Genom Bioinform Standard Article Protein–DNA binding is of a great interest due to its importance in many biological processes. Previous studies have presented many factors responsible for the recognition and specificity, but understanding the minimal informational requirements for proteins that bind to multiple DNA-sites is still an understudied area of bioinformatics. Here we focus on the hydrogen bonds displayed by the target DNA in the major groove that take part in protein-binding. We show that analyses focused on the base pair identity may overlook key hydrogen bonds. We have developed an algorithm that converts a nucleotide sequence into an array of hydrogen bond donors and acceptors and methyl groups. It then aligns these non-covalent interaction arrays to identify what information is being maintained among multiple DNA sequences. For three different DNA-binding proteins, Lactose repressor, controller protein and λ-CI repressor, we uncovered the minimal pattern of hydrogen bonds that are common amongst all the binding sequences. Notably in the three proteins, key interacting hydrogen bonds are maintained despite nucleobase mutations in the corresponding binding sites. We believe this work will be useful for developing new DNA binding proteins and shed new light on evolutionary relationships. Oxford University Press 2022-12-31 /pmc/articles/PMC9803871/ /pubmed/36601576 http://dx.doi.org/10.1093/nargab/lqac101 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Sedhom, Jacklin Kinser, Jason Solomon, Lee A Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein |
title | Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein |
title_full | Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein |
title_fullStr | Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein |
title_full_unstemmed | Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein |
title_short | Alignment of major-groove hydrogen bond arrays uncovers shared information between different DNA sequences that bind the same protein |
title_sort | alignment of major-groove hydrogen bond arrays uncovers shared information between different dna sequences that bind the same protein |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803871/ https://www.ncbi.nlm.nih.gov/pubmed/36601576 http://dx.doi.org/10.1093/nargab/lqac101 |
work_keys_str_mv | AT sedhomjacklin alignmentofmajorgroovehydrogenbondarraysuncoverssharedinformationbetweendifferentdnasequencesthatbindthesameprotein AT kinserjason alignmentofmajorgroovehydrogenbondarraysuncoverssharedinformationbetweendifferentdnasequencesthatbindthesameprotein AT solomonleea alignmentofmajorgroovehydrogenbondarraysuncoverssharedinformationbetweendifferentdnasequencesthatbindthesameprotein |