Cargando…

High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach

BACKGROUND: Next-Generation Sequencing (NGS) is increasingly being used as a molecular epidemiologic tool for discerning ancestry and traceback of the most complicated, difficult to resolve bacterial pathogens. Making a linkage between possible food sources and clinical isolates requires distinguish...

Descripción completa

Detalles Bibliográficos
Autores principales: Allard, Marc W, Luo, Yan, Strain, Errol, Li, Cong, Keys, Christine E, Son, Insook, Stones, Robert, Musser, Steven M, Brown, Eric W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3368722/
https://www.ncbi.nlm.nih.gov/pubmed/22260654
http://dx.doi.org/10.1186/1471-2164-13-32
_version_ 1782234975887687680
author Allard, Marc W
Luo, Yan
Strain, Errol
Li, Cong
Keys, Christine E
Son, Insook
Stones, Robert
Musser, Steven M
Brown, Eric W
author_facet Allard, Marc W
Luo, Yan
Strain, Errol
Li, Cong
Keys, Christine E
Son, Insook
Stones, Robert
Musser, Steven M
Brown, Eric W
author_sort Allard, Marc W
collection PubMed
description BACKGROUND: Next-Generation Sequencing (NGS) is increasingly being used as a molecular epidemiologic tool for discerning ancestry and traceback of the most complicated, difficult to resolve bacterial pathogens. Making a linkage between possible food sources and clinical isolates requires distinguishing the suspected pathogen from an environmental background and placing the variation observed into the wider context of variation occurring within a serovar and among other closely related foodborne pathogens. Equally important is the need to validate these high resolution molecular tools for use in molecular epidemiologic traceback. Such efforts include the examination of strain cluster stability as well as the cumulative genetic effects of sub-culturing on these clusters. Numerous isolates of S. Montevideo were shot-gun sequenced including diverse lineage representatives as well as numerous replicate clones to determine how much variability is due to bias, sequencing error, and or the culturing of isolates. All new draft genomes were compared to 34 S. Montevideo isolates previously published during an NGS-based molecular epidemiological case study. RESULTS: Intraserovar lineages of S. Montevideo differ by thousands of SNPs, that are only slightly less than the number of SNPs observed between S. Montevideo and other distinct serovars. Much less variability was discovered within an individual S. Montevideo clade implicated in a recent foodborne outbreak as well as among individual NGS replicates. These findings were similar to previous reports documenting homopolymeric and deletion error rates with the Roche 454 GS Titanium technology. In no case, however, did variability associated with sequencing methods or sample preparations create inconsistencies with our current phylogenetic results or the subsequent molecular epidemiological evidence gleaned from these data. CONCLUSIONS: Implementation of a validated pipeline for NGS data acquisition and analysis provides highly reproducible results that are stable and predictable for molecular epidemiological applications. When draft genomes are collected at 15×-20× coverage and passed through a quality filter as part of a data analysis pipeline, including sub-passaged replicates defined by a few SNPs, they can be accurately placed in a phylogenetic context. This reproducibility applies to all levels within and between serovars of Salmonella suggesting that investigators using these methods can have confidence in their conclusions.
format Online
Article
Text
id pubmed-3368722
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33687222012-06-07 High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach Allard, Marc W Luo, Yan Strain, Errol Li, Cong Keys, Christine E Son, Insook Stones, Robert Musser, Steven M Brown, Eric W BMC Genomics Research Article BACKGROUND: Next-Generation Sequencing (NGS) is increasingly being used as a molecular epidemiologic tool for discerning ancestry and traceback of the most complicated, difficult to resolve bacterial pathogens. Making a linkage between possible food sources and clinical isolates requires distinguishing the suspected pathogen from an environmental background and placing the variation observed into the wider context of variation occurring within a serovar and among other closely related foodborne pathogens. Equally important is the need to validate these high resolution molecular tools for use in molecular epidemiologic traceback. Such efforts include the examination of strain cluster stability as well as the cumulative genetic effects of sub-culturing on these clusters. Numerous isolates of S. Montevideo were shot-gun sequenced including diverse lineage representatives as well as numerous replicate clones to determine how much variability is due to bias, sequencing error, and or the culturing of isolates. All new draft genomes were compared to 34 S. Montevideo isolates previously published during an NGS-based molecular epidemiological case study. RESULTS: Intraserovar lineages of S. Montevideo differ by thousands of SNPs, that are only slightly less than the number of SNPs observed between S. Montevideo and other distinct serovars. Much less variability was discovered within an individual S. Montevideo clade implicated in a recent foodborne outbreak as well as among individual NGS replicates. These findings were similar to previous reports documenting homopolymeric and deletion error rates with the Roche 454 GS Titanium technology. In no case, however, did variability associated with sequencing methods or sample preparations create inconsistencies with our current phylogenetic results or the subsequent molecular epidemiological evidence gleaned from these data. CONCLUSIONS: Implementation of a validated pipeline for NGS data acquisition and analysis provides highly reproducible results that are stable and predictable for molecular epidemiological applications. When draft genomes are collected at 15×-20× coverage and passed through a quality filter as part of a data analysis pipeline, including sub-passaged replicates defined by a few SNPs, they can be accurately placed in a phylogenetic context. This reproducibility applies to all levels within and between serovars of Salmonella suggesting that investigators using these methods can have confidence in their conclusions. BioMed Central 2012-01-19 /pmc/articles/PMC3368722/ /pubmed/22260654 http://dx.doi.org/10.1186/1471-2164-13-32 Text en Copyright ©2012 Allard et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Allard, Marc W
Luo, Yan
Strain, Errol
Li, Cong
Keys, Christine E
Son, Insook
Stones, Robert
Musser, Steven M
Brown, Eric W
High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach
title High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach
title_full High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach
title_fullStr High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach
title_full_unstemmed High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach
title_short High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach
title_sort high resolution clustering of salmonella enterica serovar montevideo strains using a next-generation sequencing approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3368722/
https://www.ncbi.nlm.nih.gov/pubmed/22260654
http://dx.doi.org/10.1186/1471-2164-13-32
work_keys_str_mv AT allardmarcw highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT luoyan highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT strainerrol highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT licong highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT keyschristinee highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT soninsook highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT stonesrobert highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT musserstevenm highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach
AT brownericw highresolutionclusteringofsalmonellaentericaserovarmontevideostrainsusinganextgenerationsequencingapproach