Cargando…

Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation

A large European multi-country Salmonella enterica serovar Enteritidis outbreak associated with Polish eggs was characterized by whole-genome sequencing (WGS)-based analysis, with various European institutes using different analysis workflows to identify isolates potentially related to the outbreak....

Descripción completa

Detalles Bibliográficos
Autores principales: Coipan, Claudia E., Dallman, Timothy J., Brown, Derek, Hartman, Hassan, van der Voort, Menno, van den Berg, Redmar R., Palm, Daniel, Kotila, Saara, van Wijk, Tom, Franz, Eelco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7200063/
https://www.ncbi.nlm.nih.gov/pubmed/32101514
http://dx.doi.org/10.1099/mgen.0.000318
_version_ 1783529267692306432
author Coipan, Claudia E.
Dallman, Timothy J.
Brown, Derek
Hartman, Hassan
van der Voort, Menno
van den Berg, Redmar R.
Palm, Daniel
Kotila, Saara
van Wijk, Tom
Franz, Eelco
author_facet Coipan, Claudia E.
Dallman, Timothy J.
Brown, Derek
Hartman, Hassan
van der Voort, Menno
van den Berg, Redmar R.
Palm, Daniel
Kotila, Saara
van Wijk, Tom
Franz, Eelco
author_sort Coipan, Claudia E.
collection PubMed
description A large European multi-country Salmonella enterica serovar Enteritidis outbreak associated with Polish eggs was characterized by whole-genome sequencing (WGS)-based analysis, with various European institutes using different analysis workflows to identify isolates potentially related to the outbreak. The objective of our study was to compare the output of six of these different typing workflows (distance matrices of either SNP-based or allele-based workflows) in terms of cluster detection and concordance. To this end, we analysed a set of 180 isolates coming from confirmed and probable outbreak cases, which were representative of the genetic variation within the outbreak, supplemented with 22 unrelated contemporaneous S . enterica serovar Enteritidis isolates. Since the definition of a cluster cut-off based on genetic distance requires prior knowledge on the evolutionary processes that govern the bacterial populations in question, we used a variety of hierarchical clustering methods (single, average and complete) and selected the optimal number of clusters based on the consensus of the silhouette, Dunn2, and McClain–Rao internal validation indices. External validation was done by calculating the concordance with the WGS-based case definition (SNP-address) for this outbreak using the Fowlkes–Mallows index. Our analysis indicates that with complete-linkage hierarchical clustering combined with the optimal number of clusters, as defined by three internal validity indices, the six different allele- and SNP-based typing workflows generate clusters with similar compositions. Furthermore, we show that even in the absence of coordinated typing procedures, but by using an unsupervised machine learning methodology for cluster delineation, the various workflows that are currently in use by six European public-health authorities can identify concordant clusters of genetically related S . enterica serovar Enteritidis isolates; thus, providing public-health researchers with comparable tools for detection of infectious-disease outbreaks.
format Online
Article
Text
id pubmed-7200063
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-72000632020-05-06 Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation Coipan, Claudia E. Dallman, Timothy J. Brown, Derek Hartman, Hassan van der Voort, Menno van den Berg, Redmar R. Palm, Daniel Kotila, Saara van Wijk, Tom Franz, Eelco Microb Genom Research Article A large European multi-country Salmonella enterica serovar Enteritidis outbreak associated with Polish eggs was characterized by whole-genome sequencing (WGS)-based analysis, with various European institutes using different analysis workflows to identify isolates potentially related to the outbreak. The objective of our study was to compare the output of six of these different typing workflows (distance matrices of either SNP-based or allele-based workflows) in terms of cluster detection and concordance. To this end, we analysed a set of 180 isolates coming from confirmed and probable outbreak cases, which were representative of the genetic variation within the outbreak, supplemented with 22 unrelated contemporaneous S . enterica serovar Enteritidis isolates. Since the definition of a cluster cut-off based on genetic distance requires prior knowledge on the evolutionary processes that govern the bacterial populations in question, we used a variety of hierarchical clustering methods (single, average and complete) and selected the optimal number of clusters based on the consensus of the silhouette, Dunn2, and McClain–Rao internal validation indices. External validation was done by calculating the concordance with the WGS-based case definition (SNP-address) for this outbreak using the Fowlkes–Mallows index. Our analysis indicates that with complete-linkage hierarchical clustering combined with the optimal number of clusters, as defined by three internal validity indices, the six different allele- and SNP-based typing workflows generate clusters with similar compositions. Furthermore, we show that even in the absence of coordinated typing procedures, but by using an unsupervised machine learning methodology for cluster delineation, the various workflows that are currently in use by six European public-health authorities can identify concordant clusters of genetically related S . enterica serovar Enteritidis isolates; thus, providing public-health researchers with comparable tools for detection of infectious-disease outbreaks. Microbiology Society 2020-02-26 /pmc/articles/PMC7200063/ /pubmed/32101514 http://dx.doi.org/10.1099/mgen.0.000318 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution NonCommercial License.
spellingShingle Research Article
Coipan, Claudia E.
Dallman, Timothy J.
Brown, Derek
Hartman, Hassan
van der Voort, Menno
van den Berg, Redmar R.
Palm, Daniel
Kotila, Saara
van Wijk, Tom
Franz, Eelco
Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation
title Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation
title_full Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation
title_fullStr Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation
title_full_unstemmed Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation
title_short Concordance of SNP- and allele-based typing workflows in the context of a large-scale international Salmonella Enteritidis outbreak investigation
title_sort concordance of snp- and allele-based typing workflows in the context of a large-scale international salmonella enteritidis outbreak investigation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7200063/
https://www.ncbi.nlm.nih.gov/pubmed/32101514
http://dx.doi.org/10.1099/mgen.0.000318
work_keys_str_mv AT coipanclaudiae concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT dallmantimothyj concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT brownderek concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT hartmanhassan concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT vandervoortmenno concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT vandenbergredmarr concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT palmdaniel concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT kotilasaara concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT vanwijktom concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation
AT franzeelco concordanceofsnpandallelebasedtypingworkflowsinthecontextofalargescaleinternationalsalmonellaenteritidisoutbreakinvestigation