Cargando…

A comprehensive catalog of predicted functional upstream open reading frames in humans

Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumsc...

Descripción completa

Detalles Bibliográficos
Autores principales: McGillivray, Patrick, Ault, Russell, Pawashe, Mayur, Kitchen, Robert, Balasubramanian, Suganthi, Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6283423/
https://www.ncbi.nlm.nih.gov/pubmed/29562350
http://dx.doi.org/10.1093/nar/gky188
_version_ 1783379164327313408
author McGillivray, Patrick
Ault, Russell
Pawashe, Mayur
Kitchen, Robert
Balasubramanian, Suganthi
Gerstein, Mark
author_facet McGillivray, Patrick
Ault, Russell
Pawashe, Mayur
Kitchen, Robert
Balasubramanian, Suganthi
Gerstein, Mark
author_sort McGillivray, Patrick
collection PubMed
description Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe of all possible uORFs based on coding gene sequence motifs and identified 1.3 million unique uORFs. To determine which of these are likely to be biologically relevant, we built a simple Bayesian classifier using 89 attributes of uORFs labeled as active in ribosome profiling experiments. This allowed us to extrapolate to a comprehensive catalog of likely functional uORFs. We validated our predictions using in vivo protein levels and ribosome occupancy from 46 individuals. This is a substantially larger catalog of functional uORFs than has previously been reported. Our ranked list of likely active uORFs allows researchers to test their hypotheses regarding the role of uORFs in health and disease. We demonstrate several examples of biological interest through the application of our catalog to somatic mutations in cancer and disease-associated germline variants in humans.
format Online
Article
Text
id pubmed-6283423
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62834232018-12-11 A comprehensive catalog of predicted functional upstream open reading frames in humans McGillivray, Patrick Ault, Russell Pawashe, Mayur Kitchen, Robert Balasubramanian, Suganthi Gerstein, Mark Nucleic Acids Res Computational Biology Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe of all possible uORFs based on coding gene sequence motifs and identified 1.3 million unique uORFs. To determine which of these are likely to be biologically relevant, we built a simple Bayesian classifier using 89 attributes of uORFs labeled as active in ribosome profiling experiments. This allowed us to extrapolate to a comprehensive catalog of likely functional uORFs. We validated our predictions using in vivo protein levels and ribosome occupancy from 46 individuals. This is a substantially larger catalog of functional uORFs than has previously been reported. Our ranked list of likely active uORFs allows researchers to test their hypotheses regarding the role of uORFs in health and disease. We demonstrate several examples of biological interest through the application of our catalog to somatic mutations in cancer and disease-associated germline variants in humans. Oxford University Press 2018-04-20 2018-03-19 /pmc/articles/PMC6283423/ /pubmed/29562350 http://dx.doi.org/10.1093/nar/gky188 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
McGillivray, Patrick
Ault, Russell
Pawashe, Mayur
Kitchen, Robert
Balasubramanian, Suganthi
Gerstein, Mark
A comprehensive catalog of predicted functional upstream open reading frames in humans
title A comprehensive catalog of predicted functional upstream open reading frames in humans
title_full A comprehensive catalog of predicted functional upstream open reading frames in humans
title_fullStr A comprehensive catalog of predicted functional upstream open reading frames in humans
title_full_unstemmed A comprehensive catalog of predicted functional upstream open reading frames in humans
title_short A comprehensive catalog of predicted functional upstream open reading frames in humans
title_sort comprehensive catalog of predicted functional upstream open reading frames in humans
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6283423/
https://www.ncbi.nlm.nih.gov/pubmed/29562350
http://dx.doi.org/10.1093/nar/gky188
work_keys_str_mv AT mcgillivraypatrick acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT aultrussell acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT pawashemayur acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT kitchenrobert acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT balasubramaniansuganthi acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT gersteinmark acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT mcgillivraypatrick comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT aultrussell comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT pawashemayur comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT kitchenrobert comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT balasubramaniansuganthi comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans
AT gersteinmark comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans