Cargando…
Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences
The 3′ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3′ UTRs may be conserved over...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4019593/ https://www.ncbi.nlm.nih.gov/pubmed/24824035 http://dx.doi.org/10.1371/journal.pone.0097336 |
_version_ | 1782480192932937728 |
---|---|
author | Algama, Manjula Oldmeadow, Christopher Tasker, Edward Mengersen, Kerrie Keith, Jonathan M. |
author_facet | Algama, Manjula Oldmeadow, Christopher Tasker, Edward Mengersen, Kerrie Keith, Jonathan M. |
author_sort | Algama, Manjula |
collection | PubMed |
description | The 3′ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3′ UTRs may be conserved over long evolutionary time scales, others may have only ephemeral functional significance as regulatory profiles respond to changing selective pressures. Here we propose a sensitive segmentation methodology for investigating patterns of composition and conservation in 3′ UTRs based on comparison of closely related species. We describe encodings of pairwise and three-way alignments integrating information about conservation, GC content and transition/transversion ratios and apply the method to three closely related Drosophila species: D. melanogaster, D. simulans and D. yakuba. Incorporating multiple data types greatly increased the number of segment classes identified compared to similar methods based on conservation or GC content alone. We propose that the number of segments and number of types of segment identified by the method can be used as proxies for functional complexity. Our main finding is that the number of segments and segment classes identified in 3′ UTRs is greater than in the same length of protein-coding sequence, suggesting greater functional complexity in 3′ UTRs. There is thus a need for sustained and extensive efforts by bioinformaticians to delineate functional elements in this important genomic fraction. C code, data and results are available upon request. |
format | Online Article Text |
id | pubmed-4019593 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-40195932014-05-16 Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences Algama, Manjula Oldmeadow, Christopher Tasker, Edward Mengersen, Kerrie Keith, Jonathan M. PLoS One Research Article The 3′ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3′ UTRs may be conserved over long evolutionary time scales, others may have only ephemeral functional significance as regulatory profiles respond to changing selective pressures. Here we propose a sensitive segmentation methodology for investigating patterns of composition and conservation in 3′ UTRs based on comparison of closely related species. We describe encodings of pairwise and three-way alignments integrating information about conservation, GC content and transition/transversion ratios and apply the method to three closely related Drosophila species: D. melanogaster, D. simulans and D. yakuba. Incorporating multiple data types greatly increased the number of segment classes identified compared to similar methods based on conservation or GC content alone. We propose that the number of segments and number of types of segment identified by the method can be used as proxies for functional complexity. Our main finding is that the number of segments and segment classes identified in 3′ UTRs is greater than in the same length of protein-coding sequence, suggesting greater functional complexity in 3′ UTRs. There is thus a need for sustained and extensive efforts by bioinformaticians to delineate functional elements in this important genomic fraction. C code, data and results are available upon request. Public Library of Science 2014-05-13 /pmc/articles/PMC4019593/ /pubmed/24824035 http://dx.doi.org/10.1371/journal.pone.0097336 Text en © 2014 Algama et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Algama, Manjula Oldmeadow, Christopher Tasker, Edward Mengersen, Kerrie Keith, Jonathan M. Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences |
title | Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences |
title_full | Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences |
title_fullStr | Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences |
title_full_unstemmed | Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences |
title_short | Drosophila 3′ UTRs Are More Complex than Protein-Coding Sequences |
title_sort | drosophila 3′ utrs are more complex than protein-coding sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4019593/ https://www.ncbi.nlm.nih.gov/pubmed/24824035 http://dx.doi.org/10.1371/journal.pone.0097336 |
work_keys_str_mv | AT algamamanjula drosophila3utrsaremorecomplexthanproteincodingsequences AT oldmeadowchristopher drosophila3utrsaremorecomplexthanproteincodingsequences AT taskeredward drosophila3utrsaremorecomplexthanproteincodingsequences AT mengersenkerrie drosophila3utrsaremorecomplexthanproteincodingsequences AT keithjonathanm drosophila3utrsaremorecomplexthanproteincodingsequences |