Cargando…
Genomic variation in Salmonella enterica core genes for epidemiological typing
BACKGROUND: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over ti...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3359268/ https://www.ncbi.nlm.nih.gov/pubmed/22409488 http://dx.doi.org/10.1186/1471-2164-13-88 |
_version_ | 1782233849677217792 |
---|---|
author | Leekitcharoenphon, Pimlapas Lukjancenko, Oksana Friis, Carsten Aarestrup, Frank M Ussery, David W |
author_facet | Leekitcharoenphon, Pimlapas Lukjancenko, Oksana Friis, Carsten Aarestrup, Frank M Ussery, David W |
author_sort | Leekitcharoenphon, Pimlapas |
collection | PubMed |
description | BACKGROUND: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over time. The core genes--the genes that are conserved in all (or most) members of a genus or species--are potentially good candidates for investigating genomic variation in phylogeny and epidemiology. RESULTS: We identify a set of 2,882 core genes clusters based on 73 publicly available Salmonella enterica genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher confidence. The core genes can be divided into two categories: a few highly variable genes and a larger set of conserved core genes, with low variance. For the most variable core genes, the variance in amino acid sequences is higher than for the corresponding nucleotide sequences, suggesting that there is a positive selection towards mutations leading to amino acid changes. CONCLUSIONS: Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important especially in trend analysis. |
format | Online Article Text |
id | pubmed-3359268 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33592682012-05-24 Genomic variation in Salmonella enterica core genes for epidemiological typing Leekitcharoenphon, Pimlapas Lukjancenko, Oksana Friis, Carsten Aarestrup, Frank M Ussery, David W BMC Genomics Research Article BACKGROUND: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over time. The core genes--the genes that are conserved in all (or most) members of a genus or species--are potentially good candidates for investigating genomic variation in phylogeny and epidemiology. RESULTS: We identify a set of 2,882 core genes clusters based on 73 publicly available Salmonella enterica genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher confidence. The core genes can be divided into two categories: a few highly variable genes and a larger set of conserved core genes, with low variance. For the most variable core genes, the variance in amino acid sequences is higher than for the corresponding nucleotide sequences, suggesting that there is a positive selection towards mutations leading to amino acid changes. CONCLUSIONS: Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important especially in trend analysis. BioMed Central 2012-03-12 /pmc/articles/PMC3359268/ /pubmed/22409488 http://dx.doi.org/10.1186/1471-2164-13-88 Text en Copyright ©2012 Leekitcharoenphon et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Leekitcharoenphon, Pimlapas Lukjancenko, Oksana Friis, Carsten Aarestrup, Frank M Ussery, David W Genomic variation in Salmonella enterica core genes for epidemiological typing |
title | Genomic variation in Salmonella enterica core genes for epidemiological typing |
title_full | Genomic variation in Salmonella enterica core genes for epidemiological typing |
title_fullStr | Genomic variation in Salmonella enterica core genes for epidemiological typing |
title_full_unstemmed | Genomic variation in Salmonella enterica core genes for epidemiological typing |
title_short | Genomic variation in Salmonella enterica core genes for epidemiological typing |
title_sort | genomic variation in salmonella enterica core genes for epidemiological typing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3359268/ https://www.ncbi.nlm.nih.gov/pubmed/22409488 http://dx.doi.org/10.1186/1471-2164-13-88 |
work_keys_str_mv | AT leekitcharoenphonpimlapas genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping AT lukjancenkooksana genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping AT friiscarsten genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping AT aarestrupfrankm genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping AT usserydavidw genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping |