Cargando…

High GC content causes orphan proteins to be intrinsically disordered

De novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. These orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not ag...

Descripción completa

Detalles Bibliográficos
Autores principales: Basile, Walter, Sachenkova, Oxana, Light, Sara, Elofsson, Arne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389847/
https://www.ncbi.nlm.nih.gov/pubmed/28355220
http://dx.doi.org/10.1371/journal.pcbi.1005375
_version_ 1782521349114167296
author Basile, Walter
Sachenkova, Oxana
Light, Sara
Elofsson, Arne
author_facet Basile, Walter
Sachenkova, Oxana
Light, Sara
Elofsson, Arne
author_sort Basile, Walter
collection PubMed
description De novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. These orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not aggregate. Therefore, although the creation of short ORFs could be truly random, the fixation should be subjected to some selective pressure. The selective forces acting on orphan proteins have been elusive, and contradictory results have been reported. In Drosophila young proteins are more disordered than ancient ones, while the opposite trend is present in yeast. To the best of our knowledge no valid explanation for this difference has been proposed. To solve this riddle we studied structural properties and age of proteins in 187 eukaryotic organisms. We find that, with the exception of length, there are only small differences in the properties between proteins of different ages. However, when we take the GC content into account we noted that it could explain the opposite trends observed for orphans in yeast (low GC) and Drosophila (high GC). GC content is correlated with codons coding for disorder promoting amino acids. This leads us to propose that intrinsic disorder is not a strong determining factor for fixation of orphan proteins. Instead these proteins largely resemble random proteins given a particular GC level. During evolution the properties of a protein change faster than the GC level causing the relationship between disorder and GC to gradually weaken.
format Online
Article
Text
id pubmed-5389847
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53898472017-05-03 High GC content causes orphan proteins to be intrinsically disordered Basile, Walter Sachenkova, Oxana Light, Sara Elofsson, Arne PLoS Comput Biol Research Article De novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. These orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not aggregate. Therefore, although the creation of short ORFs could be truly random, the fixation should be subjected to some selective pressure. The selective forces acting on orphan proteins have been elusive, and contradictory results have been reported. In Drosophila young proteins are more disordered than ancient ones, while the opposite trend is present in yeast. To the best of our knowledge no valid explanation for this difference has been proposed. To solve this riddle we studied structural properties and age of proteins in 187 eukaryotic organisms. We find that, with the exception of length, there are only small differences in the properties between proteins of different ages. However, when we take the GC content into account we noted that it could explain the opposite trends observed for orphans in yeast (low GC) and Drosophila (high GC). GC content is correlated with codons coding for disorder promoting amino acids. This leads us to propose that intrinsic disorder is not a strong determining factor for fixation of orphan proteins. Instead these proteins largely resemble random proteins given a particular GC level. During evolution the properties of a protein change faster than the GC level causing the relationship between disorder and GC to gradually weaken. Public Library of Science 2017-03-29 /pmc/articles/PMC5389847/ /pubmed/28355220 http://dx.doi.org/10.1371/journal.pcbi.1005375 Text en © 2017 Basile et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Basile, Walter
Sachenkova, Oxana
Light, Sara
Elofsson, Arne
High GC content causes orphan proteins to be intrinsically disordered
title High GC content causes orphan proteins to be intrinsically disordered
title_full High GC content causes orphan proteins to be intrinsically disordered
title_fullStr High GC content causes orphan proteins to be intrinsically disordered
title_full_unstemmed High GC content causes orphan proteins to be intrinsically disordered
title_short High GC content causes orphan proteins to be intrinsically disordered
title_sort high gc content causes orphan proteins to be intrinsically disordered
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5389847/
https://www.ncbi.nlm.nih.gov/pubmed/28355220
http://dx.doi.org/10.1371/journal.pcbi.1005375
work_keys_str_mv AT basilewalter highgccontentcausesorphanproteinstobeintrinsicallydisordered
AT sachenkovaoxana highgccontentcausesorphanproteinstobeintrinsicallydisordered
AT lightsara highgccontentcausesorphanproteinstobeintrinsicallydisordered
AT elofssonarne highgccontentcausesorphanproteinstobeintrinsicallydisordered