Cargando…
Population Diversity of ORFan Genes in Escherichia coli
The origin and evolution of “ORFans” (suspected genes without known relatives) remain unclear. Here, we take advantage of a unique opportunity to examine the population diversity of thousands of ORFans, based on a collection of 35 complete genomes of isolates of Escherichia coli and Shigella (which...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3514957/ https://www.ncbi.nlm.nih.gov/pubmed/23034216 http://dx.doi.org/10.1093/gbe/evs081 |
_version_ | 1782252102451462144 |
---|---|
author | Yu, Guoqin Stoltzfus, Arlin |
author_facet | Yu, Guoqin Stoltzfus, Arlin |
author_sort | Yu, Guoqin |
collection | PubMed |
description | The origin and evolution of “ORFans” (suspected genes without known relatives) remain unclear. Here, we take advantage of a unique opportunity to examine the population diversity of thousands of ORFans, based on a collection of 35 complete genomes of isolates of Escherichia coli and Shigella (which is included phylogenetically within E. coli). As expected from previous studies, ORFans are shorter and AT-richer in sequence than non-ORFans. We find that ORFans often are very narrowly distributed: the most common pattern is for an ORFan to be found in only one genome. We compared within-species population diversity of ORFan genes with those of two control groups of non-ORFan genes. Patterns of population variation suggest that most ORFans are not artifacts, but encode real genes whose protein-coding capacity is conserved, reflecting selection against nonsynonymous mutations. Nevertheless, nonsynonymous nucleotide diversity is higher than for non-ORFans, whereas synonymous diversity is roughly the same. In particular, there is a several-fold excess of ORFans in the highest decile of diversity relative to controls, which might be due to weaker purifying selection, positive selection, or a subclass of ORFans that are decaying. |
format | Online Article Text |
id | pubmed-3514957 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35149572012-12-05 Population Diversity of ORFan Genes in Escherichia coli Yu, Guoqin Stoltzfus, Arlin Genome Biol Evol Research Article The origin and evolution of “ORFans” (suspected genes without known relatives) remain unclear. Here, we take advantage of a unique opportunity to examine the population diversity of thousands of ORFans, based on a collection of 35 complete genomes of isolates of Escherichia coli and Shigella (which is included phylogenetically within E. coli). As expected from previous studies, ORFans are shorter and AT-richer in sequence than non-ORFans. We find that ORFans often are very narrowly distributed: the most common pattern is for an ORFan to be found in only one genome. We compared within-species population diversity of ORFan genes with those of two control groups of non-ORFan genes. Patterns of population variation suggest that most ORFans are not artifacts, but encode real genes whose protein-coding capacity is conserved, reflecting selection against nonsynonymous mutations. Nevertheless, nonsynonymous nucleotide diversity is higher than for non-ORFans, whereas synonymous diversity is roughly the same. In particular, there is a several-fold excess of ORFans in the highest decile of diversity relative to controls, which might be due to weaker purifying selection, positive selection, or a subclass of ORFans that are decaying. Oxford University Press 2012 2012-10-03 /pmc/articles/PMC3514957/ /pubmed/23034216 http://dx.doi.org/10.1093/gbe/evs081 Text en Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2012. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Yu, Guoqin Stoltzfus, Arlin Population Diversity of ORFan Genes in Escherichia coli |
title | Population Diversity of ORFan Genes in Escherichia coli |
title_full | Population Diversity of ORFan Genes in Escherichia coli |
title_fullStr | Population Diversity of ORFan Genes in Escherichia coli |
title_full_unstemmed | Population Diversity of ORFan Genes in Escherichia coli |
title_short | Population Diversity of ORFan Genes in Escherichia coli |
title_sort | population diversity of orfan genes in escherichia coli |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3514957/ https://www.ncbi.nlm.nih.gov/pubmed/23034216 http://dx.doi.org/10.1093/gbe/evs081 |
work_keys_str_mv | AT yuguoqin populationdiversityoforfangenesinescherichiacoli AT stoltzfusarlin populationdiversityoforfangenesinescherichiacoli |