Cargando…

Analysis of nested alternate open reading frames and their encoded proteins

Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Vasu, Kommireddy, Khan, Debjit, Ramachandiran, Iyappan, Blankenberg, Daniel, Fox, Paul L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580016/
https://www.ncbi.nlm.nih.gov/pubmed/36267124
http://dx.doi.org/10.1093/nargab/lqac076
_version_ 1784812301585481728
author Vasu, Kommireddy
Khan, Debjit
Ramachandiran, Iyappan
Blankenberg, Daniel
Fox, Paul L
author_facet Vasu, Kommireddy
Khan, Debjit
Ramachandiran, Iyappan
Blankenberg, Daniel
Fox, Paul L
author_sort Vasu, Kommireddy
collection PubMed
description Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) within canonical host mRNAs. Uniquely, ‘alt-proteins’ lack sequence homology with host ORF-derived proteins. We show global amino acid frequencies, and consequent biochemical characteristics of Alt-ORFs nested within host ORFs (nAlt-ORFs), are genetically-driven, and predicted by summation of frequencies of hundreds of encompassing host codon-pairs. Analysis of 101 human nAlt-ORFs of length ≥150 codons confirms the theoretical predictions, revealing an extraordinarily high median isoelectric point (pI) of 11.68, due to anomalous charged amino acid levels. Also, nAlt-ORF proteins exhibit a >2-fold preference for reading frame 2 versus 3, predicted mitochondrial and nuclear localization, and elevated codon adaptation index indicative of natural selection. Our results provide a theoretical and conceptual framework for exploration of these largely unannotated, but potentially significant, alternative ORFs and their encoded proteins.
format Online
Article
Text
id pubmed-9580016
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-95800162022-10-19 Analysis of nested alternate open reading frames and their encoded proteins Vasu, Kommireddy Khan, Debjit Ramachandiran, Iyappan Blankenberg, Daniel Fox, Paul L NAR Genom Bioinform Standard Article Transcriptional and post-transcriptional mechanisms diversify the proteome beyond gene number, while maintaining a sequence relationship between original and altered proteins. A new mechanism breaks this paradigm, generating novel proteins by translating alternative open reading frames (Alt-ORFs) within canonical host mRNAs. Uniquely, ‘alt-proteins’ lack sequence homology with host ORF-derived proteins. We show global amino acid frequencies, and consequent biochemical characteristics of Alt-ORFs nested within host ORFs (nAlt-ORFs), are genetically-driven, and predicted by summation of frequencies of hundreds of encompassing host codon-pairs. Analysis of 101 human nAlt-ORFs of length ≥150 codons confirms the theoretical predictions, revealing an extraordinarily high median isoelectric point (pI) of 11.68, due to anomalous charged amino acid levels. Also, nAlt-ORF proteins exhibit a >2-fold preference for reading frame 2 versus 3, predicted mitochondrial and nuclear localization, and elevated codon adaptation index indicative of natural selection. Our results provide a theoretical and conceptual framework for exploration of these largely unannotated, but potentially significant, alternative ORFs and their encoded proteins. Oxford University Press 2022-10-19 /pmc/articles/PMC9580016/ /pubmed/36267124 http://dx.doi.org/10.1093/nargab/lqac076 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Vasu, Kommireddy
Khan, Debjit
Ramachandiran, Iyappan
Blankenberg, Daniel
Fox, Paul L
Analysis of nested alternate open reading frames and their encoded proteins
title Analysis of nested alternate open reading frames and their encoded proteins
title_full Analysis of nested alternate open reading frames and their encoded proteins
title_fullStr Analysis of nested alternate open reading frames and their encoded proteins
title_full_unstemmed Analysis of nested alternate open reading frames and their encoded proteins
title_short Analysis of nested alternate open reading frames and their encoded proteins
title_sort analysis of nested alternate open reading frames and their encoded proteins
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580016/
https://www.ncbi.nlm.nih.gov/pubmed/36267124
http://dx.doi.org/10.1093/nargab/lqac076
work_keys_str_mv AT vasukommireddy analysisofnestedalternateopenreadingframesandtheirencodedproteins
AT khandebjit analysisofnestedalternateopenreadingframesandtheirencodedproteins
AT ramachandiraniyappan analysisofnestedalternateopenreadingframesandtheirencodedproteins
AT blankenbergdaniel analysisofnestedalternateopenreadingframesandtheirencodedproteins
AT foxpaull analysisofnestedalternateopenreadingframesandtheirencodedproteins