Cargando…

Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis

Whole-genome sequencing (WGS) of Mycobacterium tuberculosis (MTB) isolates can be used to get an accurate diagnosis, to guide clinical decision making, to control tuberculosis (TB) and for outbreak investigations. We evaluated the performance of long-read (LR) and/or short-read (SR) sequencing for a...

Descripción completa

Detalles Bibliográficos
Autores principales: Peker, Nilay, Schuele, Leonard, Kok, Nienke, Terrazos, Miguel, Neuenschwander, Stefan M., de Beer, Jessica, Akkerman, Onno, Peter, Silke, Ramette, Alban, Merker, Matthias, Niemann, Stefan, Couto, Natacha, Sinha, Bhanu, Rossen, John WA
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8743536/
https://www.ncbi.nlm.nih.gov/pubmed/34825880
http://dx.doi.org/10.1099/mgen.0.000695
_version_ 1784629923766337536
author Peker, Nilay
Schuele, Leonard
Kok, Nienke
Terrazos, Miguel
Neuenschwander, Stefan M.
de Beer, Jessica
Akkerman, Onno
Peter, Silke
Ramette, Alban
Merker, Matthias
Niemann, Stefan
Couto, Natacha
Sinha, Bhanu
Rossen, John WA
author_facet Peker, Nilay
Schuele, Leonard
Kok, Nienke
Terrazos, Miguel
Neuenschwander, Stefan M.
de Beer, Jessica
Akkerman, Onno
Peter, Silke
Ramette, Alban
Merker, Matthias
Niemann, Stefan
Couto, Natacha
Sinha, Bhanu
Rossen, John WA
author_sort Peker, Nilay
collection PubMed
description Whole-genome sequencing (WGS) of Mycobacterium tuberculosis (MTB) isolates can be used to get an accurate diagnosis, to guide clinical decision making, to control tuberculosis (TB) and for outbreak investigations. We evaluated the performance of long-read (LR) and/or short-read (SR) sequencing for anti-TB drug-resistance prediction using the TBProfiler and Mykrobe tools, the fraction of genome recovery, assembly accuracies and the robustness of two typing approaches based on core-genome SNP (cgSNP) typing and core-genome multi-locus sequence typing (cgMLST). Most of the discrepancies between phenotypic drug-susceptibility testing (DST) and drug-resistance prediction were observed for the first-line drugs rifampicin, isoniazid, pyrazinamide and ethambutol, mainly with LR sequence data. Resistance prediction to second-line drugs made by both TBProfiler and Mykrobe tools with SR- and LR-sequence data were in complete agreement with phenotypic DST except for one isolate. The SR assemblies were more accurate than the LR assemblies, having significantly (P<0.05) fewer indels and mismatches per 100 kbp. However, the hybrid and LR assemblies had slightly higher genome fractions. For LR assemblies, Canu followed by Racon, and Medaka polishing was the most accurate approach. The cgSNP approach, based on either reads or assemblies, was more robust than the cgMLST approach, especially for LR sequence data. In conclusion, anti-TB drug-resistance prediction, particularly with only LR sequence data, remains challenging, especially for first-line drugs. In addition, SR assemblies appear more accurate than LR ones, and reproducible phylogeny can be achieved using cgSNP approaches.
format Online
Article
Text
id pubmed-8743536
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-87435362022-01-10 Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis Peker, Nilay Schuele, Leonard Kok, Nienke Terrazos, Miguel Neuenschwander, Stefan M. de Beer, Jessica Akkerman, Onno Peter, Silke Ramette, Alban Merker, Matthias Niemann, Stefan Couto, Natacha Sinha, Bhanu Rossen, John WA Microb Genom Research Articles Whole-genome sequencing (WGS) of Mycobacterium tuberculosis (MTB) isolates can be used to get an accurate diagnosis, to guide clinical decision making, to control tuberculosis (TB) and for outbreak investigations. We evaluated the performance of long-read (LR) and/or short-read (SR) sequencing for anti-TB drug-resistance prediction using the TBProfiler and Mykrobe tools, the fraction of genome recovery, assembly accuracies and the robustness of two typing approaches based on core-genome SNP (cgSNP) typing and core-genome multi-locus sequence typing (cgMLST). Most of the discrepancies between phenotypic drug-susceptibility testing (DST) and drug-resistance prediction were observed for the first-line drugs rifampicin, isoniazid, pyrazinamide and ethambutol, mainly with LR sequence data. Resistance prediction to second-line drugs made by both TBProfiler and Mykrobe tools with SR- and LR-sequence data were in complete agreement with phenotypic DST except for one isolate. The SR assemblies were more accurate than the LR assemblies, having significantly (P<0.05) fewer indels and mismatches per 100 kbp. However, the hybrid and LR assemblies had slightly higher genome fractions. For LR assemblies, Canu followed by Racon, and Medaka polishing was the most accurate approach. The cgSNP approach, based on either reads or assemblies, was more robust than the cgMLST approach, especially for LR sequence data. In conclusion, anti-TB drug-resistance prediction, particularly with only LR sequence data, remains challenging, especially for first-line drugs. In addition, SR assemblies appear more accurate than LR ones, and reproducible phylogeny can be achieved using cgSNP approaches. Microbiology Society 2021-11-26 /pmc/articles/PMC8743536/ /pubmed/34825880 http://dx.doi.org/10.1099/mgen.0.000695 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License. This article was made open access via a Publish and Read agreement between the Microbiology Society and the corresponding author’s institution.
spellingShingle Research Articles
Peker, Nilay
Schuele, Leonard
Kok, Nienke
Terrazos, Miguel
Neuenschwander, Stefan M.
de Beer, Jessica
Akkerman, Onno
Peter, Silke
Ramette, Alban
Merker, Matthias
Niemann, Stefan
Couto, Natacha
Sinha, Bhanu
Rossen, John WA
Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis
title Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis
title_full Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis
title_fullStr Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis
title_full_unstemmed Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis
title_short Evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of Mycobacterium tuberculosis
title_sort evaluation of whole-genome sequence data analysis approaches for short- and long-read sequencing of mycobacterium tuberculosis
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8743536/
https://www.ncbi.nlm.nih.gov/pubmed/34825880
http://dx.doi.org/10.1099/mgen.0.000695
work_keys_str_mv AT pekernilay evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT schueleleonard evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT koknienke evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT terrazosmiguel evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT neuenschwanderstefanm evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT debeerjessica evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT akkermanonno evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT petersilke evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT ramettealban evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT merkermatthias evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT niemannstefan evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT coutonatacha evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT sinhabhanu evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis
AT rossenjohnwa evaluationofwholegenomesequencedataanalysisapproachesforshortandlongreadsequencingofmycobacteriumtuberculosis