Cargando…
Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames
The Chlamydomonas genome has been sequenced, assembled, and annotated to produce a rich resource for genetics and molecular biology in this well-studied model organism. The annotated genome is very rich in open reading frames upstream of the annotated coding sequence (‘uORFs’): almost three quarters...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751561/ https://www.ncbi.nlm.nih.gov/pubmed/26701783 http://dx.doi.org/10.1534/g3.115.023119 |
_version_ | 1782415607232200704 |
---|---|
author | Cross, Frederick R. |
author_facet | Cross, Frederick R. |
author_sort | Cross, Frederick R. |
collection | PubMed |
description | The Chlamydomonas genome has been sequenced, assembled, and annotated to produce a rich resource for genetics and molecular biology in this well-studied model organism. The annotated genome is very rich in open reading frames upstream of the annotated coding sequence (‘uORFs’): almost three quarters of the assigned transcripts have at least one uORF, and frequently more than one. This is problematic with respect to the standard ‘scanning’ model for eukaryotic translation initiation. These uORFs can be grouped into three classes: class 1, initiating in-frame with the coding sequence (CDS) (thus providing a potential in-frame N-terminal extension); class 2, initiating in the 5′ untranslated sequences (5UT) and terminating out-of-frame in the CDS; and class 3, initiating and terminating within the 5UT. Multiple bioinformatics criteria (including analysis of Kozak consensus sequence agreement and BLASTP comparisons to the closely related Volvox genome, and statistical comparison to cds and to random sequence controls) indicate that of ∼4000 class 1 uORFs, approximately half are likely in vivo translation initiation sites. The proposed resulting N-terminal extensions in many cases will sharply alter the predicted biochemical properties of the encoded proteins. These results suggest significant modifications in ∼2000 of the ∼20,000 transcript models with respect to translation initiation and encoded peptides. In contrast, class 2 uORFs may be subject to purifying selection, and the existent ones (surviving selection) are likely inefficiently translated. Class 3 uORFs are found in more than half of transcripts, frequently multiple times per transcript; however, they are remarkably similar to random sequence expectations with respect to size, number, and composition, and therefore may in most cases be selectively neutral. |
format | Online Article Text |
id | pubmed-4751561 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-47515612016-02-12 Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames Cross, Frederick R. G3 (Bethesda) Investigations The Chlamydomonas genome has been sequenced, assembled, and annotated to produce a rich resource for genetics and molecular biology in this well-studied model organism. The annotated genome is very rich in open reading frames upstream of the annotated coding sequence (‘uORFs’): almost three quarters of the assigned transcripts have at least one uORF, and frequently more than one. This is problematic with respect to the standard ‘scanning’ model for eukaryotic translation initiation. These uORFs can be grouped into three classes: class 1, initiating in-frame with the coding sequence (CDS) (thus providing a potential in-frame N-terminal extension); class 2, initiating in the 5′ untranslated sequences (5UT) and terminating out-of-frame in the CDS; and class 3, initiating and terminating within the 5UT. Multiple bioinformatics criteria (including analysis of Kozak consensus sequence agreement and BLASTP comparisons to the closely related Volvox genome, and statistical comparison to cds and to random sequence controls) indicate that of ∼4000 class 1 uORFs, approximately half are likely in vivo translation initiation sites. The proposed resulting N-terminal extensions in many cases will sharply alter the predicted biochemical properties of the encoded proteins. These results suggest significant modifications in ∼2000 of the ∼20,000 transcript models with respect to translation initiation and encoded peptides. In contrast, class 2 uORFs may be subject to purifying selection, and the existent ones (surviving selection) are likely inefficiently translated. Class 3 uORFs are found in more than half of transcripts, frequently multiple times per transcript; however, they are remarkably similar to random sequence expectations with respect to size, number, and composition, and therefore may in most cases be selectively neutral. Genetics Society of America 2015-12-23 /pmc/articles/PMC4751561/ /pubmed/26701783 http://dx.doi.org/10.1534/g3.115.023119 Text en Copyright © 2016 Cross http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Cross, Frederick R. Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames |
title | Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames |
title_full | Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames |
title_fullStr | Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames |
title_full_unstemmed | Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames |
title_short | Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames |
title_sort | tying down loose ends in the chlamydomonas genome: functional significance of abundant upstream open reading frames |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751561/ https://www.ncbi.nlm.nih.gov/pubmed/26701783 http://dx.doi.org/10.1534/g3.115.023119 |
work_keys_str_mv | AT crossfrederickr tyingdownlooseendsinthechlamydomonasgenomefunctionalsignificanceofabundantupstreamopenreadingframes |