Cargando…
Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation
A large European multi-country Salmonella enterica serovar Enteritidis outbreak associated with Polish eggs was characterized by whole-genome sequencing (WGS)-based analysis, with various European institutes using different analysis workflows to identify isolates potentially related to the outbreak....
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Microbiology Society
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7200063/ https://www.ncbi.nlm.nih.gov/pubmed/32101514 http://dx.doi.org/10.1099/mgen.0.000318 |
_version_ | 1783529267692306432 |
---|---|
author | Coipan, Claudia E. Dallman, Timothy J. Brown, Derek Hartman, Hassan van der Voort, Menno van den Berg, Redmar R. Palm, Daniel Kotila, Saara van Wijk, Tom Franz, Eelco |
author_facet | Coipan, Claudia E. Dallman, Timothy J. Brown, Derek Hartman, Hassan van der Voort, Menno van den Berg, Redmar R. Palm, Daniel Kotila, Saara van Wijk, Tom Franz, Eelco |
author_sort | Coipan, Claudia E. |
collection | PubMed |
description | A large European multi-country Salmonella enterica serovar Enteritidis outbreak associated with Polish eggs was characterized by whole-genome sequencing (WGS)-based analysis, with various European institutes using different analysis workflows to identify isolates potentially related to the outbreak. The objective of our study was to compare the output of six of these different typing workflows (distance matrices of either SNP-based or allele-based workflows) in terms of cluster detection and concordance. To this end, we analysed a set of 180 isolates coming from confirmed and probable outbreak cases, which were representative of the genetic variation within the outbreak, supplemented with 22 unrelated contemporaneous S . enterica serovar Enteritidis isolates. Since the definition of a cluster cut-off based on genetic distance requires prior knowledge on the evolutionary processes that govern the bacterial populations in question, we used a variety of hierarchical clustering methods (single, average and complete) and selected the optimal number of clusters based on the consensus of the silhouette, Dunn2, and McClain–Rao internal validation indices. External validation was done by calculating the concordance with the WGS-based case definition (SNP-address) for this outbreak using the Fowlkes–Mallows index. Our analysis indicates that with complete-linkage hierarchical clustering combined with the optimal number of clusters, as defined by three internal validity indices, the six different allele- and SNP-based typing workflows generate clusters with similar compositions. Furthermore, we show that even in the absence of coordinated typing procedures, but by using an unsupervised machine learning methodology for cluster delineation, the various workflows that are currently in use by six European public-health authorities can identify concordant clusters of genetically related S . enterica serovar Enteritidis isolates; thus, providing public-health researchers with comparable tools for detection of infectious-disease outbreaks. |
format | Online Article Text |
id | pubmed-7200063 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Microbiology Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-72000632020-05-06 Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation Coipan, Claudia E. Dallman, Timothy J. Brown, Derek Hartman, Hassan van der Voort, Menno van den Berg, Redmar R. Palm, Daniel Kotila, Saara van Wijk, Tom Franz, Eelco Microb Genom Research Article A large European multi-country Salmonella enterica serovar Enteritidis outbreak associated with Polish eggs was characterized by whole-genome sequencing (WGS)-based analysis, with various European institutes using different analysis workflows to identify isolates potentially related to the outbreak. The objective of our study was to compare the output of six of these different typing workflows (distance matrices of either SNP-based or allele-based workflows) in terms of cluster detection and concordance. To this end, we analysed a set of 180 isolates coming from confirmed and probable outbreak cases, which were representative of the genetic variation within the outbreak, supplemented with 22 unrelated contemporaneous S . enterica serovar Enteritidis isolates. Since the definition of a cluster cut-off based on genetic distance requires prior knowledge on the evolutionary processes that govern the bacterial populations in question, we used a variety of hierarchical clustering methods (single, average and complete) and selected the optimal number of clusters based on the consensus of the silhouette, Dunn2, and McClain–Rao internal validation indices. External validation was done by calculating the concordance with the WGS-based case definition (SNP-address) for this outbreak using the Fowlkes–Mallows index. Our analysis indicates that with complete-linkage hierarchical clustering combined with the optimal number of clusters, as defined by three internal validity indices, the six different allele- and SNP-based typing workflows generate clusters with similar compositions. Furthermore, we show that even in the absence of coordinated typing procedures, but by using an unsupervised machine learning methodology for cluster delineation, the various workflows that are currently in use by six European public-health authorities can identify concordant clusters of genetically related S . enterica serovar Enteritidis isolates; thus, providing public-health researchers with comparable tools for detection of infectious-disease outbreaks. Microbiology Society 2020-02-26 /pmc/articles/PMC7200063/ /pubmed/32101514 http://dx.doi.org/10.1099/mgen.0.000318 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution NonCommercial License. |
spellingShingle | Research Article Coipan, Claudia E. Dallman, Timothy J. Brown, Derek Hartman, Hassan van der Voort, Menno van den Berg, Redmar R. Palm, Daniel Kotila, Saara van Wijk, Tom Franz, Eelco Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation |
title | Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation |
title_full | Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation |
title_fullStr | Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation |
title_full_unstemmed | Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation |
title_short | Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation |
title_sort | concordance of snp- and allele-based typing workflows in the context of a large-scale international salmonella enteritidis outbreak investigation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7200063/ https://www.ncbi.nlm.nih.gov/pubmed/32101514 http://dx.doi.org/10.1099/mgen.0.000318 |
work_keys_str_mv | AT coipanclaudiae concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT dallmantimothyj concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT brownderek concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT hartmanhassan concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT vandervoortmenno concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT vandenbergredmarr concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT palmdaniel concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT kotilasaara concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT vanwijktom concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation AT franzeelco concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation |