Cargando…

Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance

Frameshift mutations have been considered of significant importance for the molecular evolution of proteins and their coding genes, while frameshift protein sequences encoded in the alternative reading frames of coding genes have been considered to be meaningless. However, functional frameshifts hav...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xiaolong, Dong, Quanjiang, Chen, Gang, Zhang, Jianye, Liu, Yongqiang, Cai, Yujia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9164415/
https://www.ncbi.nlm.nih.gov/pubmed/35655139
http://dx.doi.org/10.1186/s12864-022-08435-6
_version_ 1784720130043805696
author Wang, Xiaolong
Dong, Quanjiang
Chen, Gang
Zhang, Jianye
Liu, Yongqiang
Cai, Yujia
author_facet Wang, Xiaolong
Dong, Quanjiang
Chen, Gang
Zhang, Jianye
Liu, Yongqiang
Cai, Yujia
author_sort Wang, Xiaolong
collection PubMed
description Frameshift mutations have been considered of significant importance for the molecular evolution of proteins and their coding genes, while frameshift protein sequences encoded in the alternative reading frames of coding genes have been considered to be meaningless. However, functional frameshifts have been found widely existing. It was puzzling how a frameshift protein kept its structure and functionality while substantial changes occurred in its primary amino-acid sequence. This study shows that the similarities among frameshifts and wild types are higher than random similarities and are determined at different levels. Frameshift substitutions are more conservative than random substitutions in the standard genetic code (SGC). The frameshift substitutions score of SGC ranks in the top 2.0-3.5% of alternative genetic codes, showing that SGC is nearly optimal for frameshift tolerance. In many genes and certain genomes, frameshift-resistant codons and codon pairs appear more frequently than expected, suggesting that frameshift tolerance is achieved through not only the optimality of the genetic code but, more importantly, the further optimization of a specific gene or genome through the usages of codons/codon pairs, which sheds light on the role of frameshift mutations in molecular and genomic evolution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08435-6.
format Online
Article
Text
id pubmed-9164415
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91644152022-06-05 Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance Wang, Xiaolong Dong, Quanjiang Chen, Gang Zhang, Jianye Liu, Yongqiang Cai, Yujia BMC Genomics Research Frameshift mutations have been considered of significant importance for the molecular evolution of proteins and their coding genes, while frameshift protein sequences encoded in the alternative reading frames of coding genes have been considered to be meaningless. However, functional frameshifts have been found widely existing. It was puzzling how a frameshift protein kept its structure and functionality while substantial changes occurred in its primary amino-acid sequence. This study shows that the similarities among frameshifts and wild types are higher than random similarities and are determined at different levels. Frameshift substitutions are more conservative than random substitutions in the standard genetic code (SGC). The frameshift substitutions score of SGC ranks in the top 2.0-3.5% of alternative genetic codes, showing that SGC is nearly optimal for frameshift tolerance. In many genes and certain genomes, frameshift-resistant codons and codon pairs appear more frequently than expected, suggesting that frameshift tolerance is achieved through not only the optimality of the genetic code but, more importantly, the further optimization of a specific gene or genome through the usages of codons/codon pairs, which sheds light on the role of frameshift mutations in molecular and genomic evolution. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08435-6. BioMed Central 2022-06-02 /pmc/articles/PMC9164415/ /pubmed/35655139 http://dx.doi.org/10.1186/s12864-022-08435-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wang, Xiaolong
Dong, Quanjiang
Chen, Gang
Zhang, Jianye
Liu, Yongqiang
Cai, Yujia
Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance
title Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance
title_full Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance
title_fullStr Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance
title_full_unstemmed Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance
title_short Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance
title_sort frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9164415/
https://www.ncbi.nlm.nih.gov/pubmed/35655139
http://dx.doi.org/10.1186/s12864-022-08435-6
work_keys_str_mv AT wangxiaolong frameshiftandwildtypeproteinsareoftenhighlysimilarbecausethegeneticcodeandgenomeswereoptimizedforframeshifttolerance
AT dongquanjiang frameshiftandwildtypeproteinsareoftenhighlysimilarbecausethegeneticcodeandgenomeswereoptimizedforframeshifttolerance
AT chengang frameshiftandwildtypeproteinsareoftenhighlysimilarbecausethegeneticcodeandgenomeswereoptimizedforframeshifttolerance
AT zhangjianye frameshiftandwildtypeproteinsareoftenhighlysimilarbecausethegeneticcodeandgenomeswereoptimizedforframeshifttolerance
AT liuyongqiang frameshiftandwildtypeproteinsareoftenhighlysimilarbecausethegeneticcodeandgenomeswereoptimizedforframeshifttolerance
AT caiyujia frameshiftandwildtypeproteinsareoftenhighlysimilarbecausethegeneticcodeandgenomeswereoptimizedforframeshifttolerance