Cargando…
IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
BACKGROUND: Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive proces...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7566497/ https://www.ncbi.nlm.nih.gov/pubmed/33057676 http://dx.doi.org/10.1093/gigascience/giaa111 |
_version_ | 1783596144542089216 |
---|---|
author | Kalantar, Katrina L Carvalho, Tiago de Bourcy, Charles F A Dimitrov, Boris Dingle, Greg Egger, Rebecca Han, Julie Holmes, Olivia B Juan, Yun-Fang King, Ryan Kislyuk, Andrey Lin, Michael F Mariano, Maria Morse, Todd Reynoso, Lucia V Cruz, David Rissato Sheu, Jonathan Tang, Jennifer Wang, James Zhang, Mark A Zhong, Emily Ahyong, Vida Lay, Sreyngim Chea, Sophana Bohl, Jennifer A Manning, Jessica E Tato, Cristina M DeRisi, Joseph L |
author_facet | Kalantar, Katrina L Carvalho, Tiago de Bourcy, Charles F A Dimitrov, Boris Dingle, Greg Egger, Rebecca Han, Julie Holmes, Olivia B Juan, Yun-Fang King, Ryan Kislyuk, Andrey Lin, Michael F Mariano, Maria Morse, Todd Reynoso, Lucia V Cruz, David Rissato Sheu, Jonathan Tang, Jennifer Wang, James Zhang, Mark A Zhong, Emily Ahyong, Vida Lay, Sreyngim Chea, Sophana Bohl, Jennifer A Manning, Jessica E Tato, Cristina M DeRisi, Joseph L |
author_sort | Kalantar, Katrina L |
collection | PubMed |
description | BACKGROUND: Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive processing steps to accurately determine the microbial composition of a sample. Existing mNGS data analysis tools typically require bioinformatics expertise and access to local server-class hardware resources. For many research laboratories, this presents an obstacle, especially in resource-limited environments. FINDINGS: We present IDseq, an open source cloud-based metagenomics pipeline and service for global pathogen detection and monitoring (https://idseq.net). The IDseq Portal accepts raw mNGS data, performs host and quality filtration steps, then executes an assembly-based alignment pipeline, which results in the assignment of reads and contigs to taxonomic categories. The taxonomic relative abundances are reported and visualized in an easy-to-use web application to facilitate data interpretation and hypothesis generation. Furthermore, IDseq supports environmental background model generation and automatic internal spike-in control recognition, providing statistics that are critical for data interpretation. IDseq was designed with the specific intent of detecting novel pathogens. Here, we benchmark novel virus detection capability using both synthetically evolved viral sequences and real-world samples, including IDseq analysis of a nasopharyngeal swab sample acquired and processed locally in Cambodia from a tourist from Wuhan, China, infected with the recently emergent SARS-CoV-2. CONCLUSION: The IDseq Portal reduces the barrier to entry for mNGS data analysis and enables bench scientists, clinicians, and bioinformaticians to gain insight from mNGS datasets for both known and novel pathogens. |
format | Online Article Text |
id | pubmed-7566497 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-75664972020-10-21 IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring Kalantar, Katrina L Carvalho, Tiago de Bourcy, Charles F A Dimitrov, Boris Dingle, Greg Egger, Rebecca Han, Julie Holmes, Olivia B Juan, Yun-Fang King, Ryan Kislyuk, Andrey Lin, Michael F Mariano, Maria Morse, Todd Reynoso, Lucia V Cruz, David Rissato Sheu, Jonathan Tang, Jennifer Wang, James Zhang, Mark A Zhong, Emily Ahyong, Vida Lay, Sreyngim Chea, Sophana Bohl, Jennifer A Manning, Jessica E Tato, Cristina M DeRisi, Joseph L Gigascience Technical Note BACKGROUND: Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive processing steps to accurately determine the microbial composition of a sample. Existing mNGS data analysis tools typically require bioinformatics expertise and access to local server-class hardware resources. For many research laboratories, this presents an obstacle, especially in resource-limited environments. FINDINGS: We present IDseq, an open source cloud-based metagenomics pipeline and service for global pathogen detection and monitoring (https://idseq.net). The IDseq Portal accepts raw mNGS data, performs host and quality filtration steps, then executes an assembly-based alignment pipeline, which results in the assignment of reads and contigs to taxonomic categories. The taxonomic relative abundances are reported and visualized in an easy-to-use web application to facilitate data interpretation and hypothesis generation. Furthermore, IDseq supports environmental background model generation and automatic internal spike-in control recognition, providing statistics that are critical for data interpretation. IDseq was designed with the specific intent of detecting novel pathogens. Here, we benchmark novel virus detection capability using both synthetically evolved viral sequences and real-world samples, including IDseq analysis of a nasopharyngeal swab sample acquired and processed locally in Cambodia from a tourist from Wuhan, China, infected with the recently emergent SARS-CoV-2. CONCLUSION: The IDseq Portal reduces the barrier to entry for mNGS data analysis and enables bench scientists, clinicians, and bioinformaticians to gain insight from mNGS datasets for both known and novel pathogens. Oxford University Press 2020-10-15 /pmc/articles/PMC7566497/ /pubmed/33057676 http://dx.doi.org/10.1093/gigascience/giaa111 Text en © The Author(s) 2020. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Kalantar, Katrina L Carvalho, Tiago de Bourcy, Charles F A Dimitrov, Boris Dingle, Greg Egger, Rebecca Han, Julie Holmes, Olivia B Juan, Yun-Fang King, Ryan Kislyuk, Andrey Lin, Michael F Mariano, Maria Morse, Todd Reynoso, Lucia V Cruz, David Rissato Sheu, Jonathan Tang, Jennifer Wang, James Zhang, Mark A Zhong, Emily Ahyong, Vida Lay, Sreyngim Chea, Sophana Bohl, Jennifer A Manning, Jessica E Tato, Cristina M DeRisi, Joseph L IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring |
title | IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring |
title_full | IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring |
title_fullStr | IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring |
title_full_unstemmed | IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring |
title_short | IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring |
title_sort | idseq—an open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7566497/ https://www.ncbi.nlm.nih.gov/pubmed/33057676 http://dx.doi.org/10.1093/gigascience/giaa111 |
work_keys_str_mv | AT kalantarkatrinal idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT carvalhotiago idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT debourcycharlesfa idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT dimitrovboris idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT dinglegreg idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT eggerrebecca idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT hanjulie idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT holmesoliviab idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT juanyunfang idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT kingryan idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT kislyukandrey idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT linmichaelf idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT marianomaria idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT morsetodd idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT reynosoluciav idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT cruzdavidrissato idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT sheujonathan idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT tangjennifer idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT wangjames idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT zhangmarka idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT zhongemily idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT ahyongvida idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT laysreyngim idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT cheasophana idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT bohljennifera idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT manningjessicae idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT tatocristinam idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring AT derisijosephl idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring |