Cargando…

Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline

Contact tracing requires reliable identification of closely related bacterial isolates. When we noticed the reporting of artifactual variation between Mycobacterium tuberculosis isolates during routine next-generation sequencing of Mycobacterium spp., we investigated its basis in 2,018 consecutive M...

Descripción completa

Detalles Bibliográficos
Autores principales: Wyllie, David H., Sanderson, Nicholas, Myers, Richard, Peto, Tim, Robinson, Esther, Crook, Derrick W., Smith, E. Grace, Walker, A. Sarah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062814/
https://www.ncbi.nlm.nih.gov/pubmed/29875188
http://dx.doi.org/10.1128/JCM.00104-18
_version_ 1783342437937184768
author Wyllie, David H.
Sanderson, Nicholas
Myers, Richard
Peto, Tim
Robinson, Esther
Crook, Derrick W.
Smith, E. Grace
Walker, A. Sarah
author_facet Wyllie, David H.
Sanderson, Nicholas
Myers, Richard
Peto, Tim
Robinson, Esther
Crook, Derrick W.
Smith, E. Grace
Walker, A. Sarah
author_sort Wyllie, David H.
collection PubMed
description Contact tracing requires reliable identification of closely related bacterial isolates. When we noticed the reporting of artifactual variation between Mycobacterium tuberculosis isolates during routine next-generation sequencing of Mycobacterium spp., we investigated its basis in 2,018 consecutive M. tuberculosis isolates. In the routine process used, clinical samples were decontaminated and inoculated into broth cultures; from positive broth cultures DNA was extracted and sequenced, reads were mapped, and consensus sequences were determined. We investigated the process of consensus sequence determination, which selects the most common nucleotide at each position. Having determined the high-quality read depth and depth of minor variants across 8,006 M. tuberculosis genomic regions, we quantified the relationship between the minor variant depth and the amount of nonmycobacterial bacterial DNA, which originates from commensal microbes killed during sample decontamination. In the presence of nonmycobacterial bacterial DNA, we found significant increases in minor variant frequencies, of more than 1.5-fold, in 242 regions covering 5.1% of the M. tuberculosis genome. Included within these were four high-variation regions strongly influenced by the amount of nonmycobacterial bacterial DNA. Excluding these four regions from pairwise distance comparisons reduced biologically implausible variation from 5.2% to 0% in an independent validation set derived from 226 individuals. Thus, we demonstrated an approach identifying critical genomic regions contributing to clinically relevant artifactual variation in bacterial similarity searches. The approach described monitors the outputs of the complex multistep laboratory and bioinformatics process, allows periodic process adjustments, and will have application to quality control of routine bacterial genomics.
format Online
Article
Text
id pubmed-6062814
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-60628142018-08-08 Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline Wyllie, David H. Sanderson, Nicholas Myers, Richard Peto, Tim Robinson, Esther Crook, Derrick W. Smith, E. Grace Walker, A. Sarah J Clin Microbiol Mycobacteriology and Aerobic Actinomycetes Contact tracing requires reliable identification of closely related bacterial isolates. When we noticed the reporting of artifactual variation between Mycobacterium tuberculosis isolates during routine next-generation sequencing of Mycobacterium spp., we investigated its basis in 2,018 consecutive M. tuberculosis isolates. In the routine process used, clinical samples were decontaminated and inoculated into broth cultures; from positive broth cultures DNA was extracted and sequenced, reads were mapped, and consensus sequences were determined. We investigated the process of consensus sequence determination, which selects the most common nucleotide at each position. Having determined the high-quality read depth and depth of minor variants across 8,006 M. tuberculosis genomic regions, we quantified the relationship between the minor variant depth and the amount of nonmycobacterial bacterial DNA, which originates from commensal microbes killed during sample decontamination. In the presence of nonmycobacterial bacterial DNA, we found significant increases in minor variant frequencies, of more than 1.5-fold, in 242 regions covering 5.1% of the M. tuberculosis genome. Included within these were four high-variation regions strongly influenced by the amount of nonmycobacterial bacterial DNA. Excluding these four regions from pairwise distance comparisons reduced biologically implausible variation from 5.2% to 0% in an independent validation set derived from 226 individuals. Thus, we demonstrated an approach identifying critical genomic regions contributing to clinically relevant artifactual variation in bacterial similarity searches. The approach described monitors the outputs of the complex multistep laboratory and bioinformatics process, allows periodic process adjustments, and will have application to quality control of routine bacterial genomics. American Society for Microbiology 2018-07-26 /pmc/articles/PMC6062814/ /pubmed/29875188 http://dx.doi.org/10.1128/JCM.00104-18 Text en Copyright © 2018 Wyllie et al. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Mycobacteriology and Aerobic Actinomycetes
Wyllie, David H.
Sanderson, Nicholas
Myers, Richard
Peto, Tim
Robinson, Esther
Crook, Derrick W.
Smith, E. Grace
Walker, A. Sarah
Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline
title Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline
title_full Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline
title_fullStr Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline
title_full_unstemmed Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline
title_short Control of Artifactual Variation in Reported Intersample Relatedness during Clinical Use of a Mycobacterium tuberculosis Sequencing Pipeline
title_sort control of artifactual variation in reported intersample relatedness during clinical use of a mycobacterium tuberculosis sequencing pipeline
topic Mycobacteriology and Aerobic Actinomycetes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062814/
https://www.ncbi.nlm.nih.gov/pubmed/29875188
http://dx.doi.org/10.1128/JCM.00104-18
work_keys_str_mv AT wylliedavidh controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline
AT sandersonnicholas controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline
AT myersrichard controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline
AT petotim controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline
AT robinsonesther controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline
AT crookderrickw controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline
AT smithegrace controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline
AT walkerasarah controlofartifactualvariationinreportedintersamplerelatednessduringclinicaluseofamycobacteriumtuberculosissequencingpipeline