Cargando…

A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches

The use of whole-genome sequencing (WGS) for routine typing of bacterial isolates has increased substantially in recent years. For Mycobacterium tuberculosis (MTB), in particular, WGS has the benefit of drastically reducing the time required to generate results compared to most conventional phenotyp...

Descripción completa

Detalles Bibliográficos
Autores principales: Bogaerts, Bert, Delcourt, Thomas, Soetaert, Karine, Boarbi, Samira, Ceyssens, Pieter-Jan, Winand, Raf, Van Braekel, Julien, De Keersmaecker, Sigrid C. J., Roosens, Nancy H. C., Marchal, Kathleen, Mathys, Vanessa, Vanneste, Kevin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8316078/
https://www.ncbi.nlm.nih.gov/pubmed/33789960
http://dx.doi.org/10.1128/JCM.00202-21
_version_ 1783729810163367936
author Bogaerts, Bert
Delcourt, Thomas
Soetaert, Karine
Boarbi, Samira
Ceyssens, Pieter-Jan
Winand, Raf
Van Braekel, Julien
De Keersmaecker, Sigrid C. J.
Roosens, Nancy H. C.
Marchal, Kathleen
Mathys, Vanessa
Vanneste, Kevin
author_facet Bogaerts, Bert
Delcourt, Thomas
Soetaert, Karine
Boarbi, Samira
Ceyssens, Pieter-Jan
Winand, Raf
Van Braekel, Julien
De Keersmaecker, Sigrid C. J.
Roosens, Nancy H. C.
Marchal, Kathleen
Mathys, Vanessa
Vanneste, Kevin
author_sort Bogaerts, Bert
collection PubMed
description The use of whole-genome sequencing (WGS) for routine typing of bacterial isolates has increased substantially in recent years. For Mycobacterium tuberculosis (MTB), in particular, WGS has the benefit of drastically reducing the time required to generate results compared to most conventional phenotypic methods. Consequently, a multitude of solutions for analyzing WGS MTB data have been developed, but their successful integration in clinical and national reference laboratories is hindered by the requirement for their validation, for which a consensus framework is still largely absent. We developed a bioinformatics workflow for (Illumina) WGS-based routine typing of MTB complex (MTBC) member isolates allowing complete characterization, including (sub)species confirmation and identification (16S, csb/RD, hsp65), single nucleotide polymorphism (SNP)-based antimicrobial resistance (AMR) prediction, and pathogen typing (spoligotyping, SNP barcoding, and core genome multilocus sequence typing). Workflow performance was validated on a per-assay basis using a collection of 238 in-house-sequenced MTBC isolates, extensively characterized with conventional molecular biology-based approaches supplemented with public data. For SNP-based AMR prediction, results from molecular genotyping methods were supplemented with in silico modified data sets, allowing us to greatly increase the set of evaluated mutations. The workflow demonstrated very high performance with performance metrics of >99% for all assays, except for spoligotyping, where sensitivity dropped to ∼90%. The validation framework for our WGS-based bioinformatics workflow can aid in the standardization of bioinformatics tools by the MTB community and other SNP-based applications regardless of the targeted pathogen(s). The bioinformatics workflow is available for academic and nonprofit use through the Galaxy instance of our institute at https://galaxy.sciensano.be.
format Online
Article
Text
id pubmed-8316078
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-83160782021-11-19 A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches Bogaerts, Bert Delcourt, Thomas Soetaert, Karine Boarbi, Samira Ceyssens, Pieter-Jan Winand, Raf Van Braekel, Julien De Keersmaecker, Sigrid C. J. Roosens, Nancy H. C. Marchal, Kathleen Mathys, Vanessa Vanneste, Kevin J Clin Microbiol Mycobacteriology and Aerobic Actinomycetes The use of whole-genome sequencing (WGS) for routine typing of bacterial isolates has increased substantially in recent years. For Mycobacterium tuberculosis (MTB), in particular, WGS has the benefit of drastically reducing the time required to generate results compared to most conventional phenotypic methods. Consequently, a multitude of solutions for analyzing WGS MTB data have been developed, but their successful integration in clinical and national reference laboratories is hindered by the requirement for their validation, for which a consensus framework is still largely absent. We developed a bioinformatics workflow for (Illumina) WGS-based routine typing of MTB complex (MTBC) member isolates allowing complete characterization, including (sub)species confirmation and identification (16S, csb/RD, hsp65), single nucleotide polymorphism (SNP)-based antimicrobial resistance (AMR) prediction, and pathogen typing (spoligotyping, SNP barcoding, and core genome multilocus sequence typing). Workflow performance was validated on a per-assay basis using a collection of 238 in-house-sequenced MTBC isolates, extensively characterized with conventional molecular biology-based approaches supplemented with public data. For SNP-based AMR prediction, results from molecular genotyping methods were supplemented with in silico modified data sets, allowing us to greatly increase the set of evaluated mutations. The workflow demonstrated very high performance with performance metrics of >99% for all assays, except for spoligotyping, where sensitivity dropped to ∼90%. The validation framework for our WGS-based bioinformatics workflow can aid in the standardization of bioinformatics tools by the MTB community and other SNP-based applications regardless of the targeted pathogen(s). The bioinformatics workflow is available for academic and nonprofit use through the Galaxy instance of our institute at https://galaxy.sciensano.be. American Society for Microbiology 2021-05-19 /pmc/articles/PMC8316078/ /pubmed/33789960 http://dx.doi.org/10.1128/JCM.00202-21 Text en Copyright © 2021 Bogaerts et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Mycobacteriology and Aerobic Actinomycetes
Bogaerts, Bert
Delcourt, Thomas
Soetaert, Karine
Boarbi, Samira
Ceyssens, Pieter-Jan
Winand, Raf
Van Braekel, Julien
De Keersmaecker, Sigrid C. J.
Roosens, Nancy H. C.
Marchal, Kathleen
Mathys, Vanessa
Vanneste, Kevin
A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches
title A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches
title_full A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches
title_fullStr A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches
title_full_unstemmed A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches
title_short A Bioinformatics Whole-Genome Sequencing Workflow for Clinical Mycobacterium tuberculosis Complex Isolate Analysis, Validated Using a Reference Collection Extensively Characterized with Conventional Methods and In Silico Approaches
title_sort bioinformatics whole-genome sequencing workflow for clinical mycobacterium tuberculosis complex isolate analysis, validated using a reference collection extensively characterized with conventional methods and in silico approaches
topic Mycobacteriology and Aerobic Actinomycetes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8316078/
https://www.ncbi.nlm.nih.gov/pubmed/33789960
http://dx.doi.org/10.1128/JCM.00202-21
work_keys_str_mv AT bogaertsbert abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT delcourtthomas abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT soetaertkarine abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT boarbisamira abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT ceyssenspieterjan abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT winandraf abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT vanbraekeljulien abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT dekeersmaeckersigridcj abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT roosensnancyhc abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT marchalkathleen abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT mathysvanessa abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT vannestekevin abioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT bogaertsbert bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT delcourtthomas bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT soetaertkarine bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT boarbisamira bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT ceyssenspieterjan bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT winandraf bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT vanbraekeljulien bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT dekeersmaeckersigridcj bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT roosensnancyhc bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT marchalkathleen bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT mathysvanessa bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches
AT vannestekevin bioinformaticswholegenomesequencingworkflowforclinicalmycobacteriumtuberculosiscomplexisolateanalysisvalidatedusingareferencecollectionextensivelycharacterizedwithconventionalmethodsandinsilicoapproaches