Cargando…

Capturing variation in metagenomic assembly graphs with MetaCortex

MOTIVATION: The assembly of contiguous sequence from metagenomic samples presents a particular challenge, due to the presence of multiple species, often closely related, at varying levels of abundance. Capturing diversity within species, for example, viral haplotypes, or bacterial strain-level diver...

Descripción completa

Detalles Bibliográficos
Autores principales: Martin, Samuel, Ayling, Martin, Patrono, Livia, Caccamo, Mario, Murcia, Pablo, Leggett, Richard M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889960/
https://www.ncbi.nlm.nih.gov/pubmed/36722204
http://dx.doi.org/10.1093/bioinformatics/btad020
_version_ 1784880848031449088
author Martin, Samuel
Ayling, Martin
Patrono, Livia
Caccamo, Mario
Murcia, Pablo
Leggett, Richard M
author_facet Martin, Samuel
Ayling, Martin
Patrono, Livia
Caccamo, Mario
Murcia, Pablo
Leggett, Richard M
author_sort Martin, Samuel
collection PubMed
description MOTIVATION: The assembly of contiguous sequence from metagenomic samples presents a particular challenge, due to the presence of multiple species, often closely related, at varying levels of abundance. Capturing diversity within species, for example, viral haplotypes, or bacterial strain-level diversity, is even more challenging. RESULTS: We present MetaCortex, a metagenome assembler that captures intra-species diversity by searching for signatures of local variation along assembled sequences in the underlying assembly graph and outputting these sequences in sequence graph format. We show that MetaCortex produces accurate assemblies with higher genome coverage and contiguity than other popular metagenomic assemblers on mock viral communities with high levels of strain-level diversity and on simulated communities containing simulated strains. AVAILABILITY AND IMPLEMENTATION: Source code is freely available to download from https://github.com/SR-Martin/metacortex, is implemented in C and supported on MacOS and Linux. The version used for the results presented in this article is available at doi.org/10.5281/zenodo.7273627. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9889960
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98899602023-02-02 Capturing variation in metagenomic assembly graphs with MetaCortex Martin, Samuel Ayling, Martin Patrono, Livia Caccamo, Mario Murcia, Pablo Leggett, Richard M Bioinformatics Original Paper MOTIVATION: The assembly of contiguous sequence from metagenomic samples presents a particular challenge, due to the presence of multiple species, often closely related, at varying levels of abundance. Capturing diversity within species, for example, viral haplotypes, or bacterial strain-level diversity, is even more challenging. RESULTS: We present MetaCortex, a metagenome assembler that captures intra-species diversity by searching for signatures of local variation along assembled sequences in the underlying assembly graph and outputting these sequences in sequence graph format. We show that MetaCortex produces accurate assemblies with higher genome coverage and contiguity than other popular metagenomic assemblers on mock viral communities with high levels of strain-level diversity and on simulated communities containing simulated strains. AVAILABILITY AND IMPLEMENTATION: Source code is freely available to download from https://github.com/SR-Martin/metacortex, is implemented in C and supported on MacOS and Linux. The version used for the results presented in this article is available at doi.org/10.5281/zenodo.7273627. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-12 /pmc/articles/PMC9889960/ /pubmed/36722204 http://dx.doi.org/10.1093/bioinformatics/btad020 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Martin, Samuel
Ayling, Martin
Patrono, Livia
Caccamo, Mario
Murcia, Pablo
Leggett, Richard M
Capturing variation in metagenomic assembly graphs with MetaCortex
title Capturing variation in metagenomic assembly graphs with MetaCortex
title_full Capturing variation in metagenomic assembly graphs with MetaCortex
title_fullStr Capturing variation in metagenomic assembly graphs with MetaCortex
title_full_unstemmed Capturing variation in metagenomic assembly graphs with MetaCortex
title_short Capturing variation in metagenomic assembly graphs with MetaCortex
title_sort capturing variation in metagenomic assembly graphs with metacortex
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889960/
https://www.ncbi.nlm.nih.gov/pubmed/36722204
http://dx.doi.org/10.1093/bioinformatics/btad020
work_keys_str_mv AT martinsamuel capturingvariationinmetagenomicassemblygraphswithmetacortex
AT aylingmartin capturingvariationinmetagenomicassemblygraphswithmetacortex
AT patronolivia capturingvariationinmetagenomicassemblygraphswithmetacortex
AT caccamomario capturingvariationinmetagenomicassemblygraphswithmetacortex
AT murciapablo capturingvariationinmetagenomicassemblygraphswithmetacortex
AT leggettrichardm capturingvariationinmetagenomicassemblygraphswithmetacortex