Cargando…

Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm

The core promoter elements are important DNA sequences for the regulation of RNA polymerase II transcription in eukaryotic cells. Despite the broad evolutionary conservation of these elements, there is extensive variation in the nucleotide composition of the actual sequences. In this study, we aim t...

Descripción completa

Detalles Bibliográficos
Autores principales: Dresch, Jacqueline M., Conrad, Regan D., Klonaros, Daniel, Drewell, Robert A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10290830/
https://www.ncbi.nlm.nih.gov/pubmed/37366427
http://dx.doi.org/10.7717/peerj.15597
_version_ 1785062573180190720
author Dresch, Jacqueline M.
Conrad, Regan D.
Klonaros, Daniel
Drewell, Robert A.
author_facet Dresch, Jacqueline M.
Conrad, Regan D.
Klonaros, Daniel
Drewell, Robert A.
author_sort Dresch, Jacqueline M.
collection PubMed
description The core promoter elements are important DNA sequences for the regulation of RNA polymerase II transcription in eukaryotic cells. Despite the broad evolutionary conservation of these elements, there is extensive variation in the nucleotide composition of the actual sequences. In this study, we aim to improve our understanding of the complexity of this sequence variation in the TATA box and initiator core promoter elements in Drosophila melanogaster. Using computational approaches, including an enhanced version of our previously developed MARZ algorithm that utilizes gapped nucleotide matrices, several sequence landscape features are uncovered, including an interdependency between the nucleotides in position 2 and 5 in the initiator. Incorporating this information in an expanded MARZ algorithm improves predictive performance for the identification of the initiator element. Overall our results demonstrate the need to carefully consider detailed sequence composition features in core promoter elements in order to make more robust and accurate bioinformatic predictions.
format Online
Article
Text
id pubmed-10290830
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-102908302023-06-26 Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm Dresch, Jacqueline M. Conrad, Regan D. Klonaros, Daniel Drewell, Robert A. PeerJ Entomology The core promoter elements are important DNA sequences for the regulation of RNA polymerase II transcription in eukaryotic cells. Despite the broad evolutionary conservation of these elements, there is extensive variation in the nucleotide composition of the actual sequences. In this study, we aim to improve our understanding of the complexity of this sequence variation in the TATA box and initiator core promoter elements in Drosophila melanogaster. Using computational approaches, including an enhanced version of our previously developed MARZ algorithm that utilizes gapped nucleotide matrices, several sequence landscape features are uncovered, including an interdependency between the nucleotides in position 2 and 5 in the initiator. Incorporating this information in an expanded MARZ algorithm improves predictive performance for the identification of the initiator element. Overall our results demonstrate the need to carefully consider detailed sequence composition features in core promoter elements in order to make more robust and accurate bioinformatic predictions. PeerJ Inc. 2023-06-22 /pmc/articles/PMC10290830/ /pubmed/37366427 http://dx.doi.org/10.7717/peerj.15597 Text en © 2023 Dresch et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Entomology
Dresch, Jacqueline M.
Conrad, Regan D.
Klonaros, Daniel
Drewell, Robert A.
Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm
title Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm
title_full Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm
title_fullStr Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm
title_full_unstemmed Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm
title_short Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm
title_sort investigating the sequence landscape in the drosophila initiator core promoter element using an enhanced marz algorithm
topic Entomology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10290830/
https://www.ncbi.nlm.nih.gov/pubmed/37366427
http://dx.doi.org/10.7717/peerj.15597
work_keys_str_mv AT dreschjacquelinem investigatingthesequencelandscapeinthedrosophilainitiatorcorepromoterelementusinganenhancedmarzalgorithm
AT conradregand investigatingthesequencelandscapeinthedrosophilainitiatorcorepromoterelementusinganenhancedmarzalgorithm
AT klonarosdaniel investigatingthesequencelandscapeinthedrosophilainitiatorcorepromoterelementusinganenhancedmarzalgorithm
AT drewellroberta investigatingthesequencelandscapeinthedrosophilainitiatorcorepromoterelementusinganenhancedmarzalgorithm