Cargando…
Analysis of nested alternate open reading frames and their encoded proteins
Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) wi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580016/ https://www.ncbi.nlm.nih.gov/pubmed/36267124 http://dx.doi.org/10.1093/nargab/lqac076 |
_version_ | 1784812301585481728 |
---|---|
author | Vasu, Kommireddy Khan, Debjit Ramachandiran, Iyappan Blankenberg, Daniel Fox, Paul L |
author_facet | Vasu, Kommireddy Khan, Debjit Ramachandiran, Iyappan Blankenberg, Daniel Fox, Paul L |
author_sort | Vasu, Kommireddy |
collection | PubMed |
description | Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) within canonical host mRNAs. Uniquely, ‘alt-proteins’ lack sequence homology with host ORF-derived proteins. We show global amino acid frequencies, and consequent biochemical characteristics of Alt-ORFs nested within host ORFs (nAlt-ORFs), are genetically-driven, and predicted by summation of frequencies of hundreds of encompassing host codon-pairs. Analysis of 101 human nAlt-ORFs of length ≥150 codons confirms the theoretical predictions, revealing an extraordinarily high median isoelectric point (pI) of 11.68, due to anomalous charged amino acid levels. Also, nAlt-ORF proteins exhibit a >2-fold preference for reading frame 2 versus 3, predicted mitochondrial and nuclear localization, and elevated codon adaptation index indicative of natural selection. Our results provide a theoretical and conceptual framework for exploration of these largely unannotated, but potentially significant, alternative ORFs and their encoded proteins. |
format | Online Article Text |
id | pubmed-9580016 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-95800162022-10-19 Analysis of nested alternate open reading frames and their encoded proteins Vasu, Kommireddy Khan, Debjit Ramachandiran, Iyappan Blankenberg, Daniel Fox, Paul L NAR Genom Bioinform Standard Article Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) within canonical host mRNAs. Uniquely, ‘alt-proteins’ lack sequence homology with host ORF-derived proteins. We show global amino acid frequencies, and consequent biochemical characteristics of Alt-ORFs nested within host ORFs (nAlt-ORFs), are genetically-driven, and predicted by summation of frequencies of hundreds of encompassing host codon-pairs. Analysis of 101 human nAlt-ORFs of length ≥150 codons confirms the theoretical predictions, revealing an extraordinarily high median isoelectric point (pI) of 11.68, due to anomalous charged amino acid levels. Also, nAlt-ORF proteins exhibit a >2-fold preference for reading frame 2 versus 3, predicted mitochondrial and nuclear localization, and elevated codon adaptation index indicative of natural selection. Our results provide a theoretical and conceptual framework for exploration of these largely unannotated, but potentially significant, alternative ORFs and their encoded proteins. Oxford University Press 2022-10-19 /pmc/articles/PMC9580016/ /pubmed/36267124 http://dx.doi.org/10.1093/nargab/lqac076 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Vasu, Kommireddy Khan, Debjit Ramachandiran, Iyappan Blankenberg, Daniel Fox, Paul L Analysis of nested alternate open reading frames and their encoded proteins |
title | Analysis of nested alternate open reading frames and their encoded proteins |
title_full | Analysis of nested alternate open reading frames and their encoded proteins |
title_fullStr | Analysis of nested alternate open reading frames and their encoded proteins |
title_full_unstemmed | Analysis of nested alternate open reading frames and their encoded proteins |
title_short | Analysis of nested alternate open reading frames and their encoded proteins |
title_sort | analysis of nested alternate open reading frames and their encoded proteins |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580016/ https://www.ncbi.nlm.nih.gov/pubmed/36267124 http://dx.doi.org/10.1093/nargab/lqac076 |
work_keys_str_mv | AT vasukommireddy analysisofnestedalternateopenreadingframesandtheirencodedproteins AT khandebjit analysisofnestedalternateopenreadingframesandtheirencodedproteins AT ramachandiraniyappan analysisofnestedalternateopenreadingframesandtheirencodedproteins AT blankenbergdaniel analysisofnestedalternateopenreadingframesandtheirencodedproteins AT foxpaull analysisofnestedalternateopenreadingframesandtheirencodedproteins |