Cargando…

Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling

Accurate annotation of protein coding regions is essential for understanding how genetic information is translated into function. We describe riboHMM, a new method that uses ribosome footprint data to accurately infer translated sequences. Applying riboHMM to human lymphoblastoid cell lines, we iden...

Descripción completa

Detalles Bibliográficos
Autores principales: Raj, Anil, Wang, Sidney H, Shim, Heejung, Harpak, Arbel, Li, Yang I, Engelmann, Brett, Stephens, Matthew, Gilad, Yoav, Pritchard, Jonathan K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4940163/
https://www.ncbi.nlm.nih.gov/pubmed/27232982
http://dx.doi.org/10.7554/eLife.13328
_version_ 1782442110336630784
author Raj, Anil
Wang, Sidney H
Shim, Heejung
Harpak, Arbel
Li, Yang I
Engelmann, Brett
Stephens, Matthew
Gilad, Yoav
Pritchard, Jonathan K
author_facet Raj, Anil
Wang, Sidney H
Shim, Heejung
Harpak, Arbel
Li, Yang I
Engelmann, Brett
Stephens, Matthew
Gilad, Yoav
Pritchard, Jonathan K
author_sort Raj, Anil
collection PubMed
description Accurate annotation of protein coding regions is essential for understanding how genetic information is translated into function. We describe riboHMM, a new method that uses ribosome footprint data to accurately infer translated sequences. Applying riboHMM to human lymphoblastoid cell lines, we identified 7273 novel coding sequences, including 2442 translated upstream open reading frames. We observed an enrichment of footprints at inferred initiation sites after drug-induced arrest of translation initiation, validating many of the novel coding sequences. The novel proteins exhibit significant selective constraint in the inferred reading frames, suggesting that many are functional. Moreover, ~40% of bicistronic transcripts showed negative correlation in the translation levels of their two coding sequences, suggesting a potential regulatory role for these novel regions. Despite known limitations of mass spectrometry to detect protein expressed at low level, we estimated a 14% validation rate. Our work significantly expands the set of known coding regions in humans. DOI: http://dx.doi.org/10.7554/eLife.13328.001
format Online
Article
Text
id pubmed-4940163
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-49401632016-07-13 Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling Raj, Anil Wang, Sidney H Shim, Heejung Harpak, Arbel Li, Yang I Engelmann, Brett Stephens, Matthew Gilad, Yoav Pritchard, Jonathan K eLife Computational and Systems Biology Accurate annotation of protein coding regions is essential for understanding how genetic information is translated into function. We describe riboHMM, a new method that uses ribosome footprint data to accurately infer translated sequences. Applying riboHMM to human lymphoblastoid cell lines, we identified 7273 novel coding sequences, including 2442 translated upstream open reading frames. We observed an enrichment of footprints at inferred initiation sites after drug-induced arrest of translation initiation, validating many of the novel coding sequences. The novel proteins exhibit significant selective constraint in the inferred reading frames, suggesting that many are functional. Moreover, ~40% of bicistronic transcripts showed negative correlation in the translation levels of their two coding sequences, suggesting a potential regulatory role for these novel regions. Despite known limitations of mass spectrometry to detect protein expressed at low level, we estimated a 14% validation rate. Our work significantly expands the set of known coding regions in humans. DOI: http://dx.doi.org/10.7554/eLife.13328.001 eLife Sciences Publications, Ltd 2016-05-27 /pmc/articles/PMC4940163/ /pubmed/27232982 http://dx.doi.org/10.7554/eLife.13328 Text en © 2016, Raj et al http://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Computational and Systems Biology
Raj, Anil
Wang, Sidney H
Shim, Heejung
Harpak, Arbel
Li, Yang I
Engelmann, Brett
Stephens, Matthew
Gilad, Yoav
Pritchard, Jonathan K
Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling
title Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling
title_full Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling
title_fullStr Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling
title_full_unstemmed Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling
title_short Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling
title_sort thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling
topic Computational and Systems Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4940163/
https://www.ncbi.nlm.nih.gov/pubmed/27232982
http://dx.doi.org/10.7554/eLife.13328
work_keys_str_mv AT rajanil thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT wangsidneyh thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT shimheejung thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT harpakarbel thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT liyangi thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT engelmannbrett thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT stephensmatthew thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT giladyoav thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling
AT pritchardjonathank thousandsofnoveltranslatedopenreadingframesinhumansinferredbyribosomefootprintprofiling