Cargando…

Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences

The 3′ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3′ UTRs may be conserved over...

Descripción completa

Detalles Bibliográficos
Autores principales: Algama, Manjula, Oldmeadow, Christopher, Tasker, Edward, Mengersen, Kerrie, Keith, Jonathan M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4019593/
https://www.ncbi.nlm.nih.gov/pubmed/24824035
http://dx.doi.org/10.1371/journal.pone.0097336
_version_ 1782480192932937728
author Algama, Manjula
Oldmeadow, Christopher
Tasker, Edward
Mengersen, Kerrie
Keith, Jonathan M.
author_facet Algama, Manjula
Oldmeadow, Christopher
Tasker, Edward
Mengersen, Kerrie
Keith, Jonathan M.
author_sort Algama, Manjula
collection PubMed
description The 3′ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3′ UTRs may be conserved over long evolutionary time scales, others may have only ephemeral functional significance as regulatory profiles respond to changing selective pressures. Here we propose a sensitive segmentation methodology for investigating patterns of composition and conservation in 3′ UTRs based on comparison of closely related species. We describe encodings of pairwise and three-way alignments integrating information about conservation, GC content and transition/transversion ratios and apply the method to three closely related Drosophila species: D. melanogaster, D. simulans and D. yakuba. Incorporating multiple data types greatly increased the number of segment classes identified compared to similar methods based on conservation or GC content alone. We propose that the number of segments and number of types of segment identified by the method can be used as proxies for functional complexity. Our main finding is that the number of segments and segment classes identified in 3′ UTRs is greater than in the same length of protein-coding sequence, suggesting greater functional complexity in 3′ UTRs. There is thus a need for sustained and extensive efforts by bioinformaticians to delineate functional elements in this important genomic fraction. C code, data and results are available upon request.
format Online
Article
Text
id pubmed-4019593
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40195932014-05-16 Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences Algama, Manjula Oldmeadow, Christopher Tasker, Edward Mengersen, Kerrie Keith, Jonathan M. PLoS One Research Article The 3′ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3′ UTRs may be conserved over long evolutionary time scales, others may have only ephemeral functional significance as regulatory profiles respond to changing selective pressures. Here we propose a sensitive segmentation methodology for investigating patterns of composition and conservation in 3′ UTRs based on comparison of closely related species. We describe encodings of pairwise and three-way alignments integrating information about conservation, GC content and transition/transversion ratios and apply the method to three closely related Drosophila species: D. melanogaster, D. simulans and D. yakuba. Incorporating multiple data types greatly increased the number of segment classes identified compared to similar methods based on conservation or GC content alone. We propose that the number of segments and number of types of segment identified by the method can be used as proxies for functional complexity. Our main finding is that the number of segments and segment classes identified in 3′ UTRs is greater than in the same length of protein-coding sequence, suggesting greater functional complexity in 3′ UTRs. There is thus a need for sustained and extensive efforts by bioinformaticians to delineate functional elements in this important genomic fraction. C code, data and results are available upon request. Public Library of Science 2014-05-13 /pmc/articles/PMC4019593/ /pubmed/24824035 http://dx.doi.org/10.1371/journal.pone.0097336 Text en © 2014 Algama et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Algama, Manjula
Oldmeadow, Christopher
Tasker, Edward
Mengersen, Kerrie
Keith, Jonathan M.
Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences
title Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences
title_full Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences
title_fullStr Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences
title_full_unstemmed Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences
title_short Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences
title_sort drosophila 3′ utrs are more complex than protein-coding sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4019593/
https://www.ncbi.nlm.nih.gov/pubmed/24824035
http://dx.doi.org/10.1371/journal.pone.0097336
work_keys_str_mv AT algamamanjula drosophila3utrsaremorecomplexthanproteincodingsequences
AT oldmeadowchristopher drosophila3utrsaremorecomplexthanproteincodingsequences
AT taskeredward drosophila3utrsaremorecomplexthanproteincodingsequences
AT mengersenkerrie drosophila3utrsaremorecomplexthanproteincodingsequences
AT keithjonathanm drosophila3utrsaremorecomplexthanproteincodingsequences