Cargando…
Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life
Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6544989/ https://www.ncbi.nlm.nih.gov/pubmed/31138098 http://dx.doi.org/10.1098/rsob.190020 |
_version_ | 1783423328864698368 |
---|---|
author | Gebert, Daniel Jehn, Julia Rosenkranz, David |
author_facet | Gebert, Daniel Jehn, Julia Rosenkranz, David |
author_sort | Gebert, Daniel |
collection | PubMed |
description | Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection favours more stable RNA secondary structures, we currently lack a concrete idea of how many genes are affected within a species, and whether this is a universal phenomenon in nature. We searched for signs of structural selection in a global manner, analysing a set of 1 million coding sequences from 73 species representing all domains of life, as well as viruses, by means of our newly developed software PACKEIS. We show that codon composition and amino acid identity are main determinants of RNA secondary structure. In addition, we show that the arrangement of synonymous codons within coding sequences is non-random, yielding extremely high, but also extremely low, RNA structuredness significantly more often than expected by chance. Taken together, we demonstrate that selection for high and low levels of secondary structure is a widespread phenomenon. Our results provide another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models. |
format | Online Article Text |
id | pubmed-6544989 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-65449892019-06-11 Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life Gebert, Daniel Jehn, Julia Rosenkranz, David Open Biol Research Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection favours more stable RNA secondary structures, we currently lack a concrete idea of how many genes are affected within a species, and whether this is a universal phenomenon in nature. We searched for signs of structural selection in a global manner, analysing a set of 1 million coding sequences from 73 species representing all domains of life, as well as viruses, by means of our newly developed software PACKEIS. We show that codon composition and amino acid identity are main determinants of RNA secondary structure. In addition, we show that the arrangement of synonymous codons within coding sequences is non-random, yielding extremely high, but also extremely low, RNA structuredness significantly more often than expected by chance. Taken together, we demonstrate that selection for high and low levels of secondary structure is a widespread phenomenon. Our results provide another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models. The Royal Society 2019-05-29 /pmc/articles/PMC6544989/ /pubmed/31138098 http://dx.doi.org/10.1098/rsob.190020 Text en © 2019 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Research Gebert, Daniel Jehn, Julia Rosenkranz, David Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life |
title | Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life |
title_full | Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life |
title_fullStr | Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life |
title_full_unstemmed | Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life |
title_short | Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life |
title_sort | widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6544989/ https://www.ncbi.nlm.nih.gov/pubmed/31138098 http://dx.doi.org/10.1098/rsob.190020 |
work_keys_str_mv | AT gebertdaniel widespreadselectionforextremelyhighandlowlevelsofsecondarystructureincodingsequencesacrossalldomainsoflife AT jehnjulia widespreadselectionforextremelyhighandlowlevelsofsecondarystructureincodingsequencesacrossalldomainsoflife AT rosenkranzdavid widespreadselectionforextremelyhighandlowlevelsofsecondarystructureincodingsequencesacrossalldomainsoflife |