Cargando…
Exodus: sequencing-based pipeline for quantification of pooled variants
SUMMARY: Next-Generation Sequencing is widely used as a tool for identifying and quantifying microorganisms pooled together in either natural or designed samples. However, a prominent obstacle is achieving correct quantification when the pooled microbes are genetically related. In such cases, the ou...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9191209/ https://www.ncbi.nlm.nih.gov/pubmed/35551337 http://dx.doi.org/10.1093/bioinformatics/btac319 |
_version_ | 1784725964091031552 |
---|---|
author | Vainberg-Slutskin, Ilya Kowalsman, Noga Silberberg, Yael Cohen, Tal Gold, Jenia Kario, Edith Weiner, Iddo Gahali-Sass, Inbar Kredo-Russo, Sharon Zak, Naomi B Bassan, Merav |
author_facet | Vainberg-Slutskin, Ilya Kowalsman, Noga Silberberg, Yael Cohen, Tal Gold, Jenia Kario, Edith Weiner, Iddo Gahali-Sass, Inbar Kredo-Russo, Sharon Zak, Naomi B Bassan, Merav |
author_sort | Vainberg-Slutskin, Ilya |
collection | PubMed |
description | SUMMARY: Next-Generation Sequencing is widely used as a tool for identifying and quantifying microorganisms pooled together in either natural or designed samples. However, a prominent obstacle is achieving correct quantification when the pooled microbes are genetically related. In such cases, the outcome mostly depends on the method used for assigning reads to the individual targets. To address this challenge, we have developed Exodus—a reference-based Python algorithm for quantification of genomes, including those that are highly similar, when they are sequenced together in a single mix. To test Exodus’ performance, we generated both empirical and in silico next-generation sequencing data of mixed genomes. When applying Exodus to these data, we observed median error rates varying between 0% and 0.21% as a function of the complexity of the mix. Importantly, no false negatives were recorded, demonstrating that Exodus’ likelihood of missing an existing genome is very low, even if the genome’s relative abundance is low and similar genomes are present in the same mix. Taken together, these data position Exodus as a reliable tool for identifying and quantifying genomes in mixed samples. Exodus is open source and free to use at: https://github.com/ilyavs/exodus. AVAILABILITY AND IMPLEMENTATION: Exodus is implemented in Python within a Snakemake framework. It is available on GitHub alongside a docker containing the required dependencies: https://github.com/ilyavs/exodus. The data underlying this article will be shared on reasonable request to the corresponding author. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9191209 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-91912092022-06-14 Exodus: sequencing-based pipeline for quantification of pooled variants Vainberg-Slutskin, Ilya Kowalsman, Noga Silberberg, Yael Cohen, Tal Gold, Jenia Kario, Edith Weiner, Iddo Gahali-Sass, Inbar Kredo-Russo, Sharon Zak, Naomi B Bassan, Merav Bioinformatics Applications Notes SUMMARY: Next-Generation Sequencing is widely used as a tool for identifying and quantifying microorganisms pooled together in either natural or designed samples. However, a prominent obstacle is achieving correct quantification when the pooled microbes are genetically related. In such cases, the outcome mostly depends on the method used for assigning reads to the individual targets. To address this challenge, we have developed Exodus—a reference-based Python algorithm for quantification of genomes, including those that are highly similar, when they are sequenced together in a single mix. To test Exodus’ performance, we generated both empirical and in silico next-generation sequencing data of mixed genomes. When applying Exodus to these data, we observed median error rates varying between 0% and 0.21% as a function of the complexity of the mix. Importantly, no false negatives were recorded, demonstrating that Exodus’ likelihood of missing an existing genome is very low, even if the genome’s relative abundance is low and similar genomes are present in the same mix. Taken together, these data position Exodus as a reliable tool for identifying and quantifying genomes in mixed samples. Exodus is open source and free to use at: https://github.com/ilyavs/exodus. AVAILABILITY AND IMPLEMENTATION: Exodus is implemented in Python within a Snakemake framework. It is available on GitHub alongside a docker containing the required dependencies: https://github.com/ilyavs/exodus. The data underlying this article will be shared on reasonable request to the corresponding author. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-05-12 /pmc/articles/PMC9191209/ /pubmed/35551337 http://dx.doi.org/10.1093/bioinformatics/btac319 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Vainberg-Slutskin, Ilya Kowalsman, Noga Silberberg, Yael Cohen, Tal Gold, Jenia Kario, Edith Weiner, Iddo Gahali-Sass, Inbar Kredo-Russo, Sharon Zak, Naomi B Bassan, Merav Exodus: sequencing-based pipeline for quantification of pooled variants |
title | Exodus: sequencing-based pipeline for quantification of pooled variants |
title_full | Exodus: sequencing-based pipeline for quantification of pooled variants |
title_fullStr | Exodus: sequencing-based pipeline for quantification of pooled variants |
title_full_unstemmed | Exodus: sequencing-based pipeline for quantification of pooled variants |
title_short | Exodus: sequencing-based pipeline for quantification of pooled variants |
title_sort | exodus: sequencing-based pipeline for quantification of pooled variants |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9191209/ https://www.ncbi.nlm.nih.gov/pubmed/35551337 http://dx.doi.org/10.1093/bioinformatics/btac319 |
work_keys_str_mv | AT vainbergslutskinilya exodussequencingbasedpipelineforquantificationofpooledvariants AT kowalsmannoga exodussequencingbasedpipelineforquantificationofpooledvariants AT silberbergyael exodussequencingbasedpipelineforquantificationofpooledvariants AT cohental exodussequencingbasedpipelineforquantificationofpooledvariants AT goldjenia exodussequencingbasedpipelineforquantificationofpooledvariants AT karioedith exodussequencingbasedpipelineforquantificationofpooledvariants AT weineriddo exodussequencingbasedpipelineforquantificationofpooledvariants AT gahalisassinbar exodussequencingbasedpipelineforquantificationofpooledvariants AT kredorussosharon exodussequencingbasedpipelineforquantificationofpooledvariants AT zaknaomib exodussequencingbasedpipelineforquantificationofpooledvariants AT bassanmerav exodussequencingbasedpipelineforquantificationofpooledvariants |