Cargando…

TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis

BACKGROUND: Transcription initiation regulation is mediated by sequence-specific interactions between DNA-binding proteins (transcription factors) and cis-elements, where BRE, TATA, INR, DPE and MTE motifs constitute canonical core motifs for basal transcription initiation of genes. Accurate identif...

Descripción completa

Detalles Bibliográficos
Autores principales: Mwangi, Sarah, Attardo, Geoffrey, Suzuki, Yutaka, Aksoy, Serap, Christoffels, Alan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4578606/
https://www.ncbi.nlm.nih.gov/pubmed/26394619
http://dx.doi.org/10.1186/s12864-015-1921-6
_version_ 1782391142013206528
author Mwangi, Sarah
Attardo, Geoffrey
Suzuki, Yutaka
Aksoy, Serap
Christoffels, Alan
author_facet Mwangi, Sarah
Attardo, Geoffrey
Suzuki, Yutaka
Aksoy, Serap
Christoffels, Alan
author_sort Mwangi, Sarah
collection PubMed
description BACKGROUND: Transcription initiation regulation is mediated by sequence-specific interactions between DNA-binding proteins (transcription factors) and cis-elements, where BRE, TATA, INR, DPE and MTE motifs constitute canonical core motifs for basal transcription initiation of genes. Accurate identification of transcription start site (TSS) and their corresponding promoter regions is critical for delineation of these motifs. To this end, the genome scale analysis of core promoter architecture in insects has been confined to Drosophila. The recently sequenced Tsetse fly genome provides a unique opportunity to analyze transcription initiation regulation machinery in blood-feeding insects. RESULTS: A computational method for identification of TSS in newly sequenced Tsetse fly genome was evaluated, using TSS seq tags sampled from two developmental stages namely; larvae and pupae. There were 3134 tag clusters among which 45.4 % (1424) of the tag clusters mapped to first coding exons or their proximal predicted 5′UTR regions and 1.0 % (31) tag clusters mapping to transposons, within a threshold of 100 tags per cluster. These 1393 non transposon-derived core promoters had propensity for AT nucleotides. The −1/+1 and 1/+1 positions in D. melanogaster, and G. m. morsitans had propensity for CA and AA dinucleotides respectively. The 1393 tag clusters comprised narrow promoters (5 %), broad with peak promoters (23 %) and broad without peak promoters (72 %). Two-way motif co-occurrence analysis showed that the MTE-DPE pair is over-represented in broad core promoters. The frequently occurring triplet motifs in all promoter classes are the INR-MTE-DPE, TATA-MTE-DPE and TATA-INR-DPE. Promoters without the TATA motif had higher frequency of the MTE and INR motifs than those observed in Drosophila, where the DPE motif occur more frequently in promoters without TATA motif. Gene ontology terms associated with developmental processes were overrepresented in the narrow and broad with peak promoters. CONCLUSIONS: The study has identified different motif combinations associated with broad promoters in a blood-feeding insect. In the case of TATA-less core promoters, G.m. morsitans uses the MTE to compensate for the lack of a TATA motif. The increasing availability of TSS seq data allows for revision of existing gene annotation datasets with the potential of identifying new transcriptional units. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1921-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4578606
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45786062015-09-23 TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis Mwangi, Sarah Attardo, Geoffrey Suzuki, Yutaka Aksoy, Serap Christoffels, Alan BMC Genomics Research Article BACKGROUND: Transcription initiation regulation is mediated by sequence-specific interactions between DNA-binding proteins (transcription factors) and cis-elements, where BRE, TATA, INR, DPE and MTE motifs constitute canonical core motifs for basal transcription initiation of genes. Accurate identification of transcription start site (TSS) and their corresponding promoter regions is critical for delineation of these motifs. To this end, the genome scale analysis of core promoter architecture in insects has been confined to Drosophila. The recently sequenced Tsetse fly genome provides a unique opportunity to analyze transcription initiation regulation machinery in blood-feeding insects. RESULTS: A computational method for identification of TSS in newly sequenced Tsetse fly genome was evaluated, using TSS seq tags sampled from two developmental stages namely; larvae and pupae. There were 3134 tag clusters among which 45.4 % (1424) of the tag clusters mapped to first coding exons or their proximal predicted 5′UTR regions and 1.0 % (31) tag clusters mapping to transposons, within a threshold of 100 tags per cluster. These 1393 non transposon-derived core promoters had propensity for AT nucleotides. The −1/+1 and 1/+1 positions in D. melanogaster, and G. m. morsitans had propensity for CA and AA dinucleotides respectively. The 1393 tag clusters comprised narrow promoters (5 %), broad with peak promoters (23 %) and broad without peak promoters (72 %). Two-way motif co-occurrence analysis showed that the MTE-DPE pair is over-represented in broad core promoters. The frequently occurring triplet motifs in all promoter classes are the INR-MTE-DPE, TATA-MTE-DPE and TATA-INR-DPE. Promoters without the TATA motif had higher frequency of the MTE and INR motifs than those observed in Drosophila, where the DPE motif occur more frequently in promoters without TATA motif. Gene ontology terms associated with developmental processes were overrepresented in the narrow and broad with peak promoters. CONCLUSIONS: The study has identified different motif combinations associated with broad promoters in a blood-feeding insect. In the case of TATA-less core promoters, G.m. morsitans uses the MTE to compensate for the lack of a TATA motif. The increasing availability of TSS seq data allows for revision of existing gene annotation datasets with the potential of identifying new transcriptional units. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1921-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-22 /pmc/articles/PMC4578606/ /pubmed/26394619 http://dx.doi.org/10.1186/s12864-015-1921-6 Text en © Mwangi et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Mwangi, Sarah
Attardo, Geoffrey
Suzuki, Yutaka
Aksoy, Serap
Christoffels, Alan
TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis
title TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis
title_full TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis
title_fullStr TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis
title_full_unstemmed TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis
title_short TSS seq based core promoter architecture in blood feeding Tsetse fly (Glossina morsitans morsitans) vector of Trypanosomiasis
title_sort tss seq based core promoter architecture in blood feeding tsetse fly (glossina morsitans morsitans) vector of trypanosomiasis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4578606/
https://www.ncbi.nlm.nih.gov/pubmed/26394619
http://dx.doi.org/10.1186/s12864-015-1921-6
work_keys_str_mv AT mwangisarah tssseqbasedcorepromoterarchitectureinbloodfeedingtsetseflyglossinamorsitansmorsitansvectoroftrypanosomiasis
AT attardogeoffrey tssseqbasedcorepromoterarchitectureinbloodfeedingtsetseflyglossinamorsitansmorsitansvectoroftrypanosomiasis
AT suzukiyutaka tssseqbasedcorepromoterarchitectureinbloodfeedingtsetseflyglossinamorsitansmorsitansvectoroftrypanosomiasis
AT aksoyserap tssseqbasedcorepromoterarchitectureinbloodfeedingtsetseflyglossinamorsitansmorsitansvectoroftrypanosomiasis
AT christoffelsalan tssseqbasedcorepromoterarchitectureinbloodfeedingtsetseflyglossinamorsitansmorsitansvectoroftrypanosomiasis