Cargando…

Genomic variation in Salmonella enterica core genes for epidemiological typing

BACKGROUND: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over ti...

Descripción completa

Detalles Bibliográficos
Autores principales: Leekitcharoenphon, Pimlapas, Lukjancenko, Oksana, Friis, Carsten, Aarestrup, Frank M, Ussery, David W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3359268/
https://www.ncbi.nlm.nih.gov/pubmed/22409488
http://dx.doi.org/10.1186/1471-2164-13-88
_version_ 1782233849677217792
author Leekitcharoenphon, Pimlapas
Lukjancenko, Oksana
Friis, Carsten
Aarestrup, Frank M
Ussery, David W
author_facet Leekitcharoenphon, Pimlapas
Lukjancenko, Oksana
Friis, Carsten
Aarestrup, Frank M
Ussery, David W
author_sort Leekitcharoenphon, Pimlapas
collection PubMed
description BACKGROUND: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over time. The core genes--the genes that are conserved in all (or most) members of a genus or species--are potentially good candidates for investigating genomic variation in phylogeny and epidemiology. RESULTS: We identify a set of 2,882 core genes clusters based on 73 publicly available Salmonella enterica genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher confidence. The core genes can be divided into two categories: a few highly variable genes and a larger set of conserved core genes, with low variance. For the most variable core genes, the variance in amino acid sequences is higher than for the corresponding nucleotide sequences, suggesting that there is a positive selection towards mutations leading to amino acid changes. CONCLUSIONS: Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important especially in trend analysis.
format Online
Article
Text
id pubmed-3359268
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33592682012-05-24 Genomic variation in Salmonella enterica core genes for epidemiological typing Leekitcharoenphon, Pimlapas Lukjancenko, Oksana Friis, Carsten Aarestrup, Frank M Ussery, David W BMC Genomics Research Article BACKGROUND: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over time. The core genes--the genes that are conserved in all (or most) members of a genus or species--are potentially good candidates for investigating genomic variation in phylogeny and epidemiology. RESULTS: We identify a set of 2,882 core genes clusters based on 73 publicly available Salmonella enterica genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher confidence. The core genes can be divided into two categories: a few highly variable genes and a larger set of conserved core genes, with low variance. For the most variable core genes, the variance in amino acid sequences is higher than for the corresponding nucleotide sequences, suggesting that there is a positive selection towards mutations leading to amino acid changes. CONCLUSIONS: Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important especially in trend analysis. BioMed Central 2012-03-12 /pmc/articles/PMC3359268/ /pubmed/22409488 http://dx.doi.org/10.1186/1471-2164-13-88 Text en Copyright ©2012 Leekitcharoenphon et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Leekitcharoenphon, Pimlapas
Lukjancenko, Oksana
Friis, Carsten
Aarestrup, Frank M
Ussery, David W
Genomic variation in Salmonella enterica core genes for epidemiological typing
title Genomic variation in Salmonella enterica core genes for epidemiological typing
title_full Genomic variation in Salmonella enterica core genes for epidemiological typing
title_fullStr Genomic variation in Salmonella enterica core genes for epidemiological typing
title_full_unstemmed Genomic variation in Salmonella enterica core genes for epidemiological typing
title_short Genomic variation in Salmonella enterica core genes for epidemiological typing
title_sort genomic variation in salmonella enterica core genes for epidemiological typing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3359268/
https://www.ncbi.nlm.nih.gov/pubmed/22409488
http://dx.doi.org/10.1186/1471-2164-13-88
work_keys_str_mv AT leekitcharoenphonpimlapas genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping
AT lukjancenkooksana genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping
AT friiscarsten genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping
AT aarestrupfrankm genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping
AT usserydavidw genomicvariationinsalmonellaentericacoregenesforepidemiologicaltyping