Cargando…

IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring

BACKGROUND: Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive proces...

Descripción completa

Detalles Bibliográficos
Autores principales: Kalantar, Katrina L, Carvalho, Tiago, de Bourcy, Charles F A, Dimitrov, Boris, Dingle, Greg, Egger, Rebecca, Han, Julie, Holmes, Olivia B, Juan, Yun-Fang, King, Ryan, Kislyuk, Andrey, Lin, Michael F, Mariano, Maria, Morse, Todd, Reynoso, Lucia V, Cruz, David Rissato, Sheu, Jonathan, Tang, Jennifer, Wang, James, Zhang, Mark A, Zhong, Emily, Ahyong, Vida, Lay, Sreyngim, Chea, Sophana, Bohl, Jennifer A, Manning, Jessica E, Tato, Cristina M, DeRisi, Joseph L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7566497/
https://www.ncbi.nlm.nih.gov/pubmed/33057676
http://dx.doi.org/10.1093/gigascience/giaa111
_version_ 1783596144542089216
author Kalantar, Katrina L
Carvalho, Tiago
de Bourcy, Charles F A
Dimitrov, Boris
Dingle, Greg
Egger, Rebecca
Han, Julie
Holmes, Olivia B
Juan, Yun-Fang
King, Ryan
Kislyuk, Andrey
Lin, Michael F
Mariano, Maria
Morse, Todd
Reynoso, Lucia V
Cruz, David Rissato
Sheu, Jonathan
Tang, Jennifer
Wang, James
Zhang, Mark A
Zhong, Emily
Ahyong, Vida
Lay, Sreyngim
Chea, Sophana
Bohl, Jennifer A
Manning, Jessica E
Tato, Cristina M
DeRisi, Joseph L
author_facet Kalantar, Katrina L
Carvalho, Tiago
de Bourcy, Charles F A
Dimitrov, Boris
Dingle, Greg
Egger, Rebecca
Han, Julie
Holmes, Olivia B
Juan, Yun-Fang
King, Ryan
Kislyuk, Andrey
Lin, Michael F
Mariano, Maria
Morse, Todd
Reynoso, Lucia V
Cruz, David Rissato
Sheu, Jonathan
Tang, Jennifer
Wang, James
Zhang, Mark A
Zhong, Emily
Ahyong, Vida
Lay, Sreyngim
Chea, Sophana
Bohl, Jennifer A
Manning, Jessica E
Tato, Cristina M
DeRisi, Joseph L
author_sort Kalantar, Katrina L
collection PubMed
description BACKGROUND: Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive processing steps to accurately determine the microbial composition of a sample. Existing mNGS data analysis tools typically require bioinformatics expertise and access to local server-class hardware resources. For many research laboratories, this presents an obstacle, especially in resource-limited environments. FINDINGS: We present IDseq, an open source cloud-based metagenomics pipeline and service for global pathogen detection and monitoring (https://idseq.net). The IDseq Portal accepts raw mNGS data, performs host and quality filtration steps, then executes an assembly-based alignment pipeline, which results in the assignment of reads and contigs to taxonomic categories. The taxonomic relative abundances are reported and visualized in an easy-to-use web application to facilitate data interpretation and hypothesis generation. Furthermore, IDseq supports environmental background model generation and automatic internal spike-in control recognition, providing statistics that are critical for data interpretation. IDseq was designed with the specific intent of detecting novel pathogens. Here, we benchmark novel virus detection capability using both synthetically evolved viral sequences and real-world samples, including IDseq analysis of a nasopharyngeal swab sample acquired and processed locally in Cambodia from a tourist from Wuhan, China, infected with the recently emergent SARS-CoV-2. CONCLUSION: The IDseq Portal reduces the barrier to entry for mNGS data analysis and enables bench scientists, clinicians, and bioinformaticians to gain insight from mNGS datasets for both known and novel pathogens.
format Online
Article
Text
id pubmed-7566497
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-75664972020-10-21 IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring Kalantar, Katrina L Carvalho, Tiago de Bourcy, Charles F A Dimitrov, Boris Dingle, Greg Egger, Rebecca Han, Julie Holmes, Olivia B Juan, Yun-Fang King, Ryan Kislyuk, Andrey Lin, Michael F Mariano, Maria Morse, Todd Reynoso, Lucia V Cruz, David Rissato Sheu, Jonathan Tang, Jennifer Wang, James Zhang, Mark A Zhong, Emily Ahyong, Vida Lay, Sreyngim Chea, Sophana Bohl, Jennifer A Manning, Jessica E Tato, Cristina M DeRisi, Joseph L Gigascience Technical Note BACKGROUND: Metagenomic next-generation sequencing (mNGS) has enabled the rapid, unbiased detection and identification of microbes without pathogen-specific reagents, culturing, or a priori knowledge of the microbial landscape. mNGS data analysis requires a series of computationally intensive processing steps to accurately determine the microbial composition of a sample. Existing mNGS data analysis tools typically require bioinformatics expertise and access to local server-class hardware resources. For many research laboratories, this presents an obstacle, especially in resource-limited environments. FINDINGS: We present IDseq, an open source cloud-based metagenomics pipeline and service for global pathogen detection and monitoring (https://idseq.net). The IDseq Portal accepts raw mNGS data, performs host and quality filtration steps, then executes an assembly-based alignment pipeline, which results in the assignment of reads and contigs to taxonomic categories. The taxonomic relative abundances are reported and visualized in an easy-to-use web application to facilitate data interpretation and hypothesis generation. Furthermore, IDseq supports environmental background model generation and automatic internal spike-in control recognition, providing statistics that are critical for data interpretation. IDseq was designed with the specific intent of detecting novel pathogens. Here, we benchmark novel virus detection capability using both synthetically evolved viral sequences and real-world samples, including IDseq analysis of a nasopharyngeal swab sample acquired and processed locally in Cambodia from a tourist from Wuhan, China, infected with the recently emergent SARS-CoV-2. CONCLUSION: The IDseq Portal reduces the barrier to entry for mNGS data analysis and enables bench scientists, clinicians, and bioinformaticians to gain insight from mNGS datasets for both known and novel pathogens. Oxford University Press 2020-10-15 /pmc/articles/PMC7566497/ /pubmed/33057676 http://dx.doi.org/10.1093/gigascience/giaa111 Text en © The Author(s) 2020. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Kalantar, Katrina L
Carvalho, Tiago
de Bourcy, Charles F A
Dimitrov, Boris
Dingle, Greg
Egger, Rebecca
Han, Julie
Holmes, Olivia B
Juan, Yun-Fang
King, Ryan
Kislyuk, Andrey
Lin, Michael F
Mariano, Maria
Morse, Todd
Reynoso, Lucia V
Cruz, David Rissato
Sheu, Jonathan
Tang, Jennifer
Wang, James
Zhang, Mark A
Zhong, Emily
Ahyong, Vida
Lay, Sreyngim
Chea, Sophana
Bohl, Jennifer A
Manning, Jessica E
Tato, Cristina M
DeRisi, Joseph L
IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
title IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
title_full IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
title_fullStr IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
title_full_unstemmed IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
title_short IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
title_sort idseq—an open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7566497/
https://www.ncbi.nlm.nih.gov/pubmed/33057676
http://dx.doi.org/10.1093/gigascience/giaa111
work_keys_str_mv AT kalantarkatrinal idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT carvalhotiago idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT debourcycharlesfa idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT dimitrovboris idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT dinglegreg idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT eggerrebecca idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT hanjulie idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT holmesoliviab idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT juanyunfang idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT kingryan idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT kislyukandrey idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT linmichaelf idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT marianomaria idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT morsetodd idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT reynosoluciav idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT cruzdavidrissato idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT sheujonathan idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT tangjennifer idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT wangjames idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT zhangmarka idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT zhongemily idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT ahyongvida idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT laysreyngim idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT cheasophana idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT bohljennifera idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT manningjessicae idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT tatocristinam idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring
AT derisijosephl idseqanopensourcecloudbasedpipelineandanalysisserviceformetagenomicpathogendetectionandmonitoring