Cargando…

Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals

The synthesis of most proteins begins at AUG codons, yet a small number of non-AUG initiated proteoforms are also known. Here we analyse a large number of publicly available Ribo-seq datasets to identify novel, previously uncharacterised non-AUG proteoforms using Trips-Viz implementation of a novel...

Descripción completa

Detalles Bibliográficos
Autores principales: Fedorova, Alla D., Kiniry, Stephen J., Andreev, Dmitry E., Mudge, Jonathan M., Baranov, Pavel V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9789052/
https://www.ncbi.nlm.nih.gov/pubmed/36564405
http://dx.doi.org/10.1038/s41467-022-35595-6
_version_ 1784858890593107968
author Fedorova, Alla D.
Kiniry, Stephen J.
Andreev, Dmitry E.
Mudge, Jonathan M.
Baranov, Pavel V.
author_facet Fedorova, Alla D.
Kiniry, Stephen J.
Andreev, Dmitry E.
Mudge, Jonathan M.
Baranov, Pavel V.
author_sort Fedorova, Alla D.
collection PubMed
description The synthesis of most proteins begins at AUG codons, yet a small number of non-AUG initiated proteoforms are also known. Here we analyse a large number of publicly available Ribo-seq datasets to identify novel, previously uncharacterised non-AUG proteoforms using Trips-Viz implementation of a novel algorithm for detecting translated ORFs. In parallel we analyse genomic alignment of 120 mammals to identify evidence of protein coding evolution in sequences encoding potential extensions. Unexpectedly we find that the number of non-AUG proteoforms identified with ribosome profiling data greatly exceeds those with strong phylogenetic support suggesting their recent evolution. Our study argues that the protein coding potential of human genome greatly exceeds that detectable through comparative genomics and exposes the existence of multiple proteins encoded by the same genomic loci.
format Online
Article
Text
id pubmed-9789052
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97890522022-12-25 Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals Fedorova, Alla D. Kiniry, Stephen J. Andreev, Dmitry E. Mudge, Jonathan M. Baranov, Pavel V. Nat Commun Article The synthesis of most proteins begins at AUG codons, yet a small number of non-AUG initiated proteoforms are also known. Here we analyse a large number of publicly available Ribo-seq datasets to identify novel, previously uncharacterised non-AUG proteoforms using Trips-Viz implementation of a novel algorithm for detecting translated ORFs. In parallel we analyse genomic alignment of 120 mammals to identify evidence of protein coding evolution in sequences encoding potential extensions. Unexpectedly we find that the number of non-AUG proteoforms identified with ribosome profiling data greatly exceeds those with strong phylogenetic support suggesting their recent evolution. Our study argues that the protein coding potential of human genome greatly exceeds that detectable through comparative genomics and exposes the existence of multiple proteins encoded by the same genomic loci. Nature Publishing Group UK 2022-12-23 /pmc/articles/PMC9789052/ /pubmed/36564405 http://dx.doi.org/10.1038/s41467-022-35595-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Fedorova, Alla D.
Kiniry, Stephen J.
Andreev, Dmitry E.
Mudge, Jonathan M.
Baranov, Pavel V.
Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals
title Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals
title_full Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals
title_fullStr Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals
title_full_unstemmed Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals
title_short Thousands of human non-AUG extended proteoforms lack evidence of evolutionary selection among mammals
title_sort thousands of human non-aug extended proteoforms lack evidence of evolutionary selection among mammals
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9789052/
https://www.ncbi.nlm.nih.gov/pubmed/36564405
http://dx.doi.org/10.1038/s41467-022-35595-6
work_keys_str_mv AT fedorovaallad thousandsofhumannonaugextendedproteoformslackevidenceofevolutionaryselectionamongmammals
AT kinirystephenj thousandsofhumannonaugextendedproteoformslackevidenceofevolutionaryselectionamongmammals
AT andreevdmitrye thousandsofhumannonaugextendedproteoformslackevidenceofevolutionaryselectionamongmammals
AT mudgejonathanm thousandsofhumannonaugextendedproteoformslackevidenceofevolutionaryselectionamongmammals
AT baranovpavelv thousandsofhumannonaugextendedproteoformslackevidenceofevolutionaryselectionamongmammals