Cargando…
Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:-
Whole genome sequencing represents a promising new technology for subtyping of bacterial pathogens. Besides the technological advances which have pushed the approach forward, the last years have been marked by considerable evolution of the whole genome sequencing data analysis methods. Prior to appl...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5800660/ https://www.ncbi.nlm.nih.gov/pubmed/29408896 http://dx.doi.org/10.1371/journal.pone.0192504 |
_version_ | 1783298242620948480 |
---|---|
author | Saltykova, Assia Wuyts, Véronique Mattheus, Wesley Bertrand, Sophie Roosens, Nancy H. C. Marchal, Kathleen De Keersmaecker, Sigrid C. J. |
author_facet | Saltykova, Assia Wuyts, Véronique Mattheus, Wesley Bertrand, Sophie Roosens, Nancy H. C. Marchal, Kathleen De Keersmaecker, Sigrid C. J. |
author_sort | Saltykova, Assia |
collection | PubMed |
description | Whole genome sequencing represents a promising new technology for subtyping of bacterial pathogens. Besides the technological advances which have pushed the approach forward, the last years have been marked by considerable evolution of the whole genome sequencing data analysis methods. Prior to application of the technology as a routine epidemiological typing tool, however, reliable and efficient data analysis strategies need to be identified among the wide variety of the emerged methodologies. In this work, we have compared three existing SNP-based subtyping workflows using a benchmark dataset of 32 Salmonella enterica subsp. enterica serovar Typhimurium and serovar 1,4,[5],12:i:- isolates including five isolates from a confirmed outbreak and three isolates obtained from the same patient at different time points. The analysis was carried out using the original (high-coverage) and a down-sampled (low-coverage) datasets and two different reference genomes. All three tested workflows, namely CSI Phylogeny-based workflow, CFSAN-based workflow and PHEnix-based workflow, were able to correctly group the confirmed outbreak isolates and isolates from the same patient with all combinations of reference genomes and datasets. However, the workflows differed strongly with respect to the SNP distances between isolates and sensitivity towards sequencing coverage, which could be linked to the specific data analysis strategies used therein. To demonstrate the effect of particular data analysis steps, several modifications of the existing workflows were also tested. This allowed us to propose data analysis schemes most suitable for routine SNP-based subtyping applied to S. Typhimurium and S. 1,4,[5],12:i:-. Results presented in this study illustrate the importance of using correct data analysis strategies and to define benchmark and fine-tune parameters applied within routine data analysis pipelines to obtain optimal results. |
format | Online Article Text |
id | pubmed-5800660 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-58006602018-02-23 Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:- Saltykova, Assia Wuyts, Véronique Mattheus, Wesley Bertrand, Sophie Roosens, Nancy H. C. Marchal, Kathleen De Keersmaecker, Sigrid C. J. PLoS One Research Article Whole genome sequencing represents a promising new technology for subtyping of bacterial pathogens. Besides the technological advances which have pushed the approach forward, the last years have been marked by considerable evolution of the whole genome sequencing data analysis methods. Prior to application of the technology as a routine epidemiological typing tool, however, reliable and efficient data analysis strategies need to be identified among the wide variety of the emerged methodologies. In this work, we have compared three existing SNP-based subtyping workflows using a benchmark dataset of 32 Salmonella enterica subsp. enterica serovar Typhimurium and serovar 1,4,[5],12:i:- isolates including five isolates from a confirmed outbreak and three isolates obtained from the same patient at different time points. The analysis was carried out using the original (high-coverage) and a down-sampled (low-coverage) datasets and two different reference genomes. All three tested workflows, namely CSI Phylogeny-based workflow, CFSAN-based workflow and PHEnix-based workflow, were able to correctly group the confirmed outbreak isolates and isolates from the same patient with all combinations of reference genomes and datasets. However, the workflows differed strongly with respect to the SNP distances between isolates and sensitivity towards sequencing coverage, which could be linked to the specific data analysis strategies used therein. To demonstrate the effect of particular data analysis steps, several modifications of the existing workflows were also tested. This allowed us to propose data analysis schemes most suitable for routine SNP-based subtyping applied to S. Typhimurium and S. 1,4,[5],12:i:-. Results presented in this study illustrate the importance of using correct data analysis strategies and to define benchmark and fine-tune parameters applied within routine data analysis pipelines to obtain optimal results. Public Library of Science 2018-02-06 /pmc/articles/PMC5800660/ /pubmed/29408896 http://dx.doi.org/10.1371/journal.pone.0192504 Text en © 2018 Saltykova et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Saltykova, Assia Wuyts, Véronique Mattheus, Wesley Bertrand, Sophie Roosens, Nancy H. C. Marchal, Kathleen De Keersmaecker, Sigrid C. J. Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:- |
title | Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:- |
title_full | Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:- |
title_fullStr | Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:- |
title_full_unstemmed | Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:- |
title_short | Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:- |
title_sort | comparison of snp-based subtyping workflows for bacterial isolates using wgs data, applied to salmonella enterica serotype typhimurium and serotype 1,4,[5],12:i:- |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5800660/ https://www.ncbi.nlm.nih.gov/pubmed/29408896 http://dx.doi.org/10.1371/journal.pone.0192504 |
work_keys_str_mv | AT saltykovaassia comparisonofsnpbasedsubtypingworkflowsforbacterialisolatesusingwgsdataappliedtosalmonellaentericaserotypetyphimuriumandserotype14512i AT wuytsveronique comparisonofsnpbasedsubtypingworkflowsforbacterialisolatesusingwgsdataappliedtosalmonellaentericaserotypetyphimuriumandserotype14512i AT mattheuswesley comparisonofsnpbasedsubtypingworkflowsforbacterialisolatesusingwgsdataappliedtosalmonellaentericaserotypetyphimuriumandserotype14512i AT bertrandsophie comparisonofsnpbasedsubtypingworkflowsforbacterialisolatesusingwgsdataappliedtosalmonellaentericaserotypetyphimuriumandserotype14512i AT roosensnancyhc comparisonofsnpbasedsubtypingworkflowsforbacterialisolatesusingwgsdataappliedtosalmonellaentericaserotypetyphimuriumandserotype14512i AT marchalkathleen comparisonofsnpbasedsubtypingworkflowsforbacterialisolatesusingwgsdataappliedtosalmonellaentericaserotypetyphimuriumandserotype14512i AT dekeersmaeckersigridcj comparisonofsnpbasedsubtypingworkflowsforbacterialisolatesusingwgsdataappliedtosalmonellaentericaserotypetyphimuriumandserotype14512i |