Cargando…

Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life

Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection...

Descripción completa

Detalles Bibliográficos
Autores principales: Gebert, Daniel, Jehn, Julia, Rosenkranz, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6544989/
https://www.ncbi.nlm.nih.gov/pubmed/31138098
http://dx.doi.org/10.1098/rsob.190020
_version_ 1783423328864698368
author Gebert, Daniel
Jehn, Julia
Rosenkranz, David
author_facet Gebert, Daniel
Jehn, Julia
Rosenkranz, David
author_sort Gebert, Daniel
collection PubMed
description Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection favours more stable RNA secondary structures, we currently lack a concrete idea of how many genes are affected within a species, and whether this is a universal phenomenon in nature. We searched for signs of structural selection in a global manner, analysing a set of 1 million coding sequences from 73 species representing all domains of life, as well as viruses, by means of our newly developed software PACKEIS. We show that codon composition and amino acid identity are main determinants of RNA secondary structure. In addition, we show that the arrangement of synonymous codons within coding sequences is non-random, yielding extremely high, but also extremely low, RNA structuredness significantly more often than expected by chance. Taken together, we demonstrate that selection for high and low levels of secondary structure is a widespread phenomenon. Our results provide another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models.
format Online
Article
Text
id pubmed-6544989
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-65449892019-06-11 Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life Gebert, Daniel Jehn, Julia Rosenkranz, David Open Biol Research Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection favours more stable RNA secondary structures, we currently lack a concrete idea of how many genes are affected within a species, and whether this is a universal phenomenon in nature. We searched for signs of structural selection in a global manner, analysing a set of 1 million coding sequences from 73 species representing all domains of life, as well as viruses, by means of our newly developed software PACKEIS. We show that codon composition and amino acid identity are main determinants of RNA secondary structure. In addition, we show that the arrangement of synonymous codons within coding sequences is non-random, yielding extremely high, but also extremely low, RNA structuredness significantly more often than expected by chance. Taken together, we demonstrate that selection for high and low levels of secondary structure is a widespread phenomenon. Our results provide another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models. The Royal Society 2019-05-29 /pmc/articles/PMC6544989/ /pubmed/31138098 http://dx.doi.org/10.1098/rsob.190020 Text en © 2019 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Research
Gebert, Daniel
Jehn, Julia
Rosenkranz, David
Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
title Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
title_full Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
title_fullStr Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
title_full_unstemmed Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
title_short Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
title_sort widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6544989/
https://www.ncbi.nlm.nih.gov/pubmed/31138098
http://dx.doi.org/10.1098/rsob.190020
work_keys_str_mv AT gebertdaniel widespreadselectionforextremelyhighandlowlevelsofsecondarystructureincodingsequencesacrossalldomainsoflife
AT jehnjulia widespreadselectionforextremelyhighandlowlevelsofsecondarystructureincodingsequencesacrossalldomainsoflife
AT rosenkranzdavid widespreadselectionforextremelyhighandlowlevelsofsecondarystructureincodingsequencesacrossalldomainsoflife