Cargando…

Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data

High sensitivity methods such as next generation sequencing and polymerase chain reaction (PCR) are adversely impacted by organismal and DNA contaminants. Current methods for detecting contaminants in microbial materials (genomic DNA and cultures) are not sensitive enough and require either a known...

Descripción completa

Detalles Bibliográficos
Autores principales: Olson, Nathan D., Zook, Justin M., Morrow, Jayne B., Lin, Nancy J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5600177/
https://www.ncbi.nlm.nih.gov/pubmed/28924496
http://dx.doi.org/10.7717/peerj.3729
_version_ 1783264199696187392
author Olson, Nathan D.
Zook, Justin M.
Morrow, Jayne B.
Lin, Nancy J.
author_facet Olson, Nathan D.
Zook, Justin M.
Morrow, Jayne B.
Lin, Nancy J.
author_sort Olson, Nathan D.
collection PubMed
description High sensitivity methods such as next generation sequencing and polymerase chain reaction (PCR) are adversely impacted by organismal and DNA contaminants. Current methods for detecting contaminants in microbial materials (genomic DNA and cultures) are not sensitive enough and require either a known or culturable contaminant. Whole genome sequencing (WGS) is a promising approach for detecting contaminants due to its sensitivity and lack of need for a priori assumptions about the contaminant. Prior to applying WGS, we must first understand its limitations for detecting contaminants and potential for false positives. Herein we demonstrate and characterize a WGS-based approach to detect organismal contaminants using an existing metagenomic taxonomic classification algorithm. Simulated WGS datasets from ten genera as individuals and binary mixtures of eight organisms at varying ratios were analyzed to evaluate the role of contaminant concentration and taxonomy on detection. For the individual genomes the false positive contaminants reported depended on the genus, with Staphylococcus, Escherichia, and Shigella having the highest proportion of false positives. For nearly all binary mixtures the contaminant was detected in the in-silico datasets at the equivalent of 1 in 1,000 cells, though F. tularensis was not detected in any of the simulated contaminant mixtures and Y. pestis was only detected at the equivalent of one in 10 cells. Once a WGS method for detecting contaminants is characterized, it can be applied to evaluate microbial material purity, in efforts to ensure that contaminants are characterized in microbial materials used to validate pathogen detection assays, generate genome assemblies for database submission, and benchmark sequencing methods.
format Online
Article
Text
id pubmed-5600177
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-56001772017-09-18 Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data Olson, Nathan D. Zook, Justin M. Morrow, Jayne B. Lin, Nancy J. PeerJ Bioinformatics High sensitivity methods such as next generation sequencing and polymerase chain reaction (PCR) are adversely impacted by organismal and DNA contaminants. Current methods for detecting contaminants in microbial materials (genomic DNA and cultures) are not sensitive enough and require either a known or culturable contaminant. Whole genome sequencing (WGS) is a promising approach for detecting contaminants due to its sensitivity and lack of need for a priori assumptions about the contaminant. Prior to applying WGS, we must first understand its limitations for detecting contaminants and potential for false positives. Herein we demonstrate and characterize a WGS-based approach to detect organismal contaminants using an existing metagenomic taxonomic classification algorithm. Simulated WGS datasets from ten genera as individuals and binary mixtures of eight organisms at varying ratios were analyzed to evaluate the role of contaminant concentration and taxonomy on detection. For the individual genomes the false positive contaminants reported depended on the genus, with Staphylococcus, Escherichia, and Shigella having the highest proportion of false positives. For nearly all binary mixtures the contaminant was detected in the in-silico datasets at the equivalent of 1 in 1,000 cells, though F. tularensis was not detected in any of the simulated contaminant mixtures and Y. pestis was only detected at the equivalent of one in 10 cells. Once a WGS method for detecting contaminants is characterized, it can be applied to evaluate microbial material purity, in efforts to ensure that contaminants are characterized in microbial materials used to validate pathogen detection assays, generate genome assemblies for database submission, and benchmark sequencing methods. PeerJ Inc. 2017-09-12 /pmc/articles/PMC5600177/ /pubmed/28924496 http://dx.doi.org/10.7717/peerj.3729 Text en http://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, made available under the Creative Commons Public Domain Dedication (http://creativecommons.org/publicdomain/zero/1.0/) . This work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Bioinformatics
Olson, Nathan D.
Zook, Justin M.
Morrow, Jayne B.
Lin, Nancy J.
Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
title Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
title_full Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
title_fullStr Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
title_full_unstemmed Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
title_short Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
title_sort challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5600177/
https://www.ncbi.nlm.nih.gov/pubmed/28924496
http://dx.doi.org/10.7717/peerj.3729
work_keys_str_mv AT olsonnathand challengingabioinformatictoolsabilitytodetectmicrobialcontaminantsusinginsilicowholegenomesequencingdata
AT zookjustinm challengingabioinformatictoolsabilitytodetectmicrobialcontaminantsusinginsilicowholegenomesequencingdata
AT morrowjayneb challengingabioinformatictoolsabilitytodetectmicrobialcontaminantsusinginsilicowholegenomesequencingdata
AT linnancyj challengingabioinformatictoolsabilitytodetectmicrobialcontaminantsusinginsilicowholegenomesequencingdata