Cargando…
A comprehensive catalog of predicted functional upstream open reading frames in humans
Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumsc...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6283423/ https://www.ncbi.nlm.nih.gov/pubmed/29562350 http://dx.doi.org/10.1093/nar/gky188 |
_version_ | 1783379164327313408 |
---|---|
author | McGillivray, Patrick Ault, Russell Pawashe, Mayur Kitchen, Robert Balasubramanian, Suganthi Gerstein, Mark |
author_facet | McGillivray, Patrick Ault, Russell Pawashe, Mayur Kitchen, Robert Balasubramanian, Suganthi Gerstein, Mark |
author_sort | McGillivray, Patrick |
collection | PubMed |
description | Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe of all possible uORFs based on coding gene sequence motifs and identified 1.3 million unique uORFs. To determine which of these are likely to be biologically relevant, we built a simple Bayesian classifier using 89 attributes of uORFs labeled as active in ribosome profiling experiments. This allowed us to extrapolate to a comprehensive catalog of likely functional uORFs. We validated our predictions using in vivo protein levels and ribosome occupancy from 46 individuals. This is a substantially larger catalog of functional uORFs than has previously been reported. Our ranked list of likely active uORFs allows researchers to test their hypotheses regarding the role of uORFs in health and disease. We demonstrate several examples of biological interest through the application of our catalog to somatic mutations in cancer and disease-associated germline variants in humans. |
format | Online Article Text |
id | pubmed-6283423 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-62834232018-12-11 A comprehensive catalog of predicted functional upstream open reading frames in humans McGillivray, Patrick Ault, Russell Pawashe, Mayur Kitchen, Robert Balasubramanian, Suganthi Gerstein, Mark Nucleic Acids Res Computational Biology Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe of all possible uORFs based on coding gene sequence motifs and identified 1.3 million unique uORFs. To determine which of these are likely to be biologically relevant, we built a simple Bayesian classifier using 89 attributes of uORFs labeled as active in ribosome profiling experiments. This allowed us to extrapolate to a comprehensive catalog of likely functional uORFs. We validated our predictions using in vivo protein levels and ribosome occupancy from 46 individuals. This is a substantially larger catalog of functional uORFs than has previously been reported. Our ranked list of likely active uORFs allows researchers to test their hypotheses regarding the role of uORFs in health and disease. We demonstrate several examples of biological interest through the application of our catalog to somatic mutations in cancer and disease-associated germline variants in humans. Oxford University Press 2018-04-20 2018-03-19 /pmc/articles/PMC6283423/ /pubmed/29562350 http://dx.doi.org/10.1093/nar/gky188 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology McGillivray, Patrick Ault, Russell Pawashe, Mayur Kitchen, Robert Balasubramanian, Suganthi Gerstein, Mark A comprehensive catalog of predicted functional upstream open reading frames in humans |
title | A comprehensive catalog of predicted functional upstream open reading frames in humans |
title_full | A comprehensive catalog of predicted functional upstream open reading frames in humans |
title_fullStr | A comprehensive catalog of predicted functional upstream open reading frames in humans |
title_full_unstemmed | A comprehensive catalog of predicted functional upstream open reading frames in humans |
title_short | A comprehensive catalog of predicted functional upstream open reading frames in humans |
title_sort | comprehensive catalog of predicted functional upstream open reading frames in humans |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6283423/ https://www.ncbi.nlm.nih.gov/pubmed/29562350 http://dx.doi.org/10.1093/nar/gky188 |
work_keys_str_mv | AT mcgillivraypatrick acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT aultrussell acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT pawashemayur acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT kitchenrobert acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT balasubramaniansuganthi acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT gersteinmark acomprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT mcgillivraypatrick comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT aultrussell comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT pawashemayur comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT kitchenrobert comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT balasubramaniansuganthi comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans AT gersteinmark comprehensivecatalogofpredictedfunctionalupstreamopenreadingframesinhumans |