Cargando…
Capturing variation in metagenomic assembly graphs with MetaCortex
MOTIVATION: The assembly of contiguous sequence from metagenomic samples presents a particular challenge, due to the presence of multiple species, often closely related, at varying levels of abundance. Capturing diversity within species, for example, viral haplotypes, or bacterial strain-level diver...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889960/ https://www.ncbi.nlm.nih.gov/pubmed/36722204 http://dx.doi.org/10.1093/bioinformatics/btad020 |
_version_ | 1784880848031449088 |
---|---|
author | Martin, Samuel Ayling, Martin Patrono, Livia Caccamo, Mario Murcia, Pablo Leggett, Richard M |
author_facet | Martin, Samuel Ayling, Martin Patrono, Livia Caccamo, Mario Murcia, Pablo Leggett, Richard M |
author_sort | Martin, Samuel |
collection | PubMed |
description | MOTIVATION: The assembly of contiguous sequence from metagenomic samples presents a particular challenge, due to the presence of multiple species, often closely related, at varying levels of abundance. Capturing diversity within species, for example, viral haplotypes, or bacterial strain-level diversity, is even more challenging. RESULTS: We present MetaCortex, a metagenome assembler that captures intra-species diversity by searching for signatures of local variation along assembled sequences in the underlying assembly graph and outputting these sequences in sequence graph format. We show that MetaCortex produces accurate assemblies with higher genome coverage and contiguity than other popular metagenomic assemblers on mock viral communities with high levels of strain-level diversity and on simulated communities containing simulated strains. AVAILABILITY AND IMPLEMENTATION: Source code is freely available to download from https://github.com/SR-Martin/metacortex, is implemented in C and supported on MacOS and Linux. The version used for the results presented in this article is available at doi.org/10.5281/zenodo.7273627. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9889960 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98899602023-02-02 Capturing variation in metagenomic assembly graphs with MetaCortex Martin, Samuel Ayling, Martin Patrono, Livia Caccamo, Mario Murcia, Pablo Leggett, Richard M Bioinformatics Original Paper MOTIVATION: The assembly of contiguous sequence from metagenomic samples presents a particular challenge, due to the presence of multiple species, often closely related, at varying levels of abundance. Capturing diversity within species, for example, viral haplotypes, or bacterial strain-level diversity, is even more challenging. RESULTS: We present MetaCortex, a metagenome assembler that captures intra-species diversity by searching for signatures of local variation along assembled sequences in the underlying assembly graph and outputting these sequences in sequence graph format. We show that MetaCortex produces accurate assemblies with higher genome coverage and contiguity than other popular metagenomic assemblers on mock viral communities with high levels of strain-level diversity and on simulated communities containing simulated strains. AVAILABILITY AND IMPLEMENTATION: Source code is freely available to download from https://github.com/SR-Martin/metacortex, is implemented in C and supported on MacOS and Linux. The version used for the results presented in this article is available at doi.org/10.5281/zenodo.7273627. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-12 /pmc/articles/PMC9889960/ /pubmed/36722204 http://dx.doi.org/10.1093/bioinformatics/btad020 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Martin, Samuel Ayling, Martin Patrono, Livia Caccamo, Mario Murcia, Pablo Leggett, Richard M Capturing variation in metagenomic assembly graphs with MetaCortex |
title | Capturing variation in metagenomic assembly graphs with MetaCortex |
title_full | Capturing variation in metagenomic assembly graphs with MetaCortex |
title_fullStr | Capturing variation in metagenomic assembly graphs with MetaCortex |
title_full_unstemmed | Capturing variation in metagenomic assembly graphs with MetaCortex |
title_short | Capturing variation in metagenomic assembly graphs with MetaCortex |
title_sort | capturing variation in metagenomic assembly graphs with metacortex |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889960/ https://www.ncbi.nlm.nih.gov/pubmed/36722204 http://dx.doi.org/10.1093/bioinformatics/btad020 |
work_keys_str_mv | AT martinsamuel capturingvariationinmetagenomicassemblygraphswithmetacortex AT aylingmartin capturingvariationinmetagenomicassemblygraphswithmetacortex AT patronolivia capturingvariationinmetagenomicassemblygraphswithmetacortex AT caccamomario capturingvariationinmetagenomicassemblygraphswithmetacortex AT murciapablo capturingvariationinmetagenomicassemblygraphswithmetacortex AT leggettrichardm capturingvariationinmetagenomicassemblygraphswithmetacortex |