Cargando…

Population Diversity of ORFan Genes in Escherichia coli

The origin and evolution of “ORFans” (suspected genes without known relatives) remain unclear. Here, we take advantage of a unique opportunity to examine the population diversity of thousands of ORFans, based on a collection of 35 complete genomes of isolates of Escherichia coli and Shigella (which...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Guoqin, Stoltzfus, Arlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3514957/
https://www.ncbi.nlm.nih.gov/pubmed/23034216
http://dx.doi.org/10.1093/gbe/evs081
_version_ 1782252102451462144
author Yu, Guoqin
Stoltzfus, Arlin
author_facet Yu, Guoqin
Stoltzfus, Arlin
author_sort Yu, Guoqin
collection PubMed
description The origin and evolution of “ORFans” (suspected genes without known relatives) remain unclear. Here, we take advantage of a unique opportunity to examine the population diversity of thousands of ORFans, based on a collection of 35 complete genomes of isolates of Escherichia coli and Shigella (which is included phylogenetically within E. coli). As expected from previous studies, ORFans are shorter and AT-richer in sequence than non-ORFans. We find that ORFans often are very narrowly distributed: the most common pattern is for an ORFan to be found in only one genome. We compared within-species population diversity of ORFan genes with those of two control groups of non-ORFan genes. Patterns of population variation suggest that most ORFans are not artifacts, but encode real genes whose protein-coding capacity is conserved, reflecting selection against nonsynonymous mutations. Nevertheless, nonsynonymous nucleotide diversity is higher than for non-ORFans, whereas synonymous diversity is roughly the same. In particular, there is a several-fold excess of ORFans in the highest decile of diversity relative to controls, which might be due to weaker purifying selection, positive selection, or a subclass of ORFans that are decaying.
format Online
Article
Text
id pubmed-3514957
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35149572012-12-05 Population Diversity of ORFan Genes in Escherichia coli Yu, Guoqin Stoltzfus, Arlin Genome Biol Evol Research Article The origin and evolution of “ORFans” (suspected genes without known relatives) remain unclear. Here, we take advantage of a unique opportunity to examine the population diversity of thousands of ORFans, based on a collection of 35 complete genomes of isolates of Escherichia coli and Shigella (which is included phylogenetically within E. coli). As expected from previous studies, ORFans are shorter and AT-richer in sequence than non-ORFans. We find that ORFans often are very narrowly distributed: the most common pattern is for an ORFan to be found in only one genome. We compared within-species population diversity of ORFan genes with those of two control groups of non-ORFan genes. Patterns of population variation suggest that most ORFans are not artifacts, but encode real genes whose protein-coding capacity is conserved, reflecting selection against nonsynonymous mutations. Nevertheless, nonsynonymous nucleotide diversity is higher than for non-ORFans, whereas synonymous diversity is roughly the same. In particular, there is a several-fold excess of ORFans in the highest decile of diversity relative to controls, which might be due to weaker purifying selection, positive selection, or a subclass of ORFans that are decaying. Oxford University Press 2012 2012-10-03 /pmc/articles/PMC3514957/ /pubmed/23034216 http://dx.doi.org/10.1093/gbe/evs081 Text en Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2012. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yu, Guoqin
Stoltzfus, Arlin
Population Diversity of ORFan Genes in Escherichia coli
title Population Diversity of ORFan Genes in Escherichia coli
title_full Population Diversity of ORFan Genes in Escherichia coli
title_fullStr Population Diversity of ORFan Genes in Escherichia coli
title_full_unstemmed Population Diversity of ORFan Genes in Escherichia coli
title_short Population Diversity of ORFan Genes in Escherichia coli
title_sort population diversity of orfan genes in escherichia coli
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3514957/
https://www.ncbi.nlm.nih.gov/pubmed/23034216
http://dx.doi.org/10.1093/gbe/evs081
work_keys_str_mv AT yuguoqin populationdiversityoforfangenesinescherichiacoli
AT stoltzfusarlin populationdiversityoforfangenesinescherichiacoli