Cargando…

MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads

An important step in ‘metagenomics’ analysis is the assembly of multiple genomes from mixed sequence reads of multiple species in a microbial community. Most conventional pipelines use a single-genome assembler with carefully optimized parameters. A limitation of a single-genome assembler for de nov...

Descripción completa

Detalles Bibliográficos
Autores principales: Namiki, Toshiaki, Hachiya, Tsuyoshi, Tanaka, Hideaki, Sakakibara, Yasubumi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488206/
https://www.ncbi.nlm.nih.gov/pubmed/22821567
http://dx.doi.org/10.1093/nar/gks678
_version_ 1782248581647826944
author Namiki, Toshiaki
Hachiya, Tsuyoshi
Tanaka, Hideaki
Sakakibara, Yasubumi
author_facet Namiki, Toshiaki
Hachiya, Tsuyoshi
Tanaka, Hideaki
Sakakibara, Yasubumi
author_sort Namiki, Toshiaki
collection PubMed
description An important step in ‘metagenomics’ analysis is the assembly of multiple genomes from mixed sequence reads of multiple species in a microbial community. Most conventional pipelines use a single-genome assembler with carefully optimized parameters. A limitation of a single-genome assembler for de novo metagenome assembly is that sequences of highly abundant species are likely misidentified as repeats in a single genome, resulting in a number of small fragmented scaffolds. We extended a single-genome assembler for short reads, known as ‘Velvet’, to metagenome assembly, which we called ‘MetaVelvet’, for mixed short reads of multiple species. Our fundamental concept was to first decompose a de Bruijn graph constructed from mixed short reads into individual sub-graphs, and second, to build scaffolds based on each decomposed de Bruijn sub-graph as an isolate species genome. We made use of two features, the coverage (abundance) difference and graph connectivity, for the decomposition of the de Bruijn graph. For simulated datasets, MetaVelvet succeeded in generating significantly higher N50 scores than any single-genome assemblers. MetaVelvet also reconstructed relatively low-coverage genome sequences as scaffolds. On real datasets of human gut microbial read data, MetaVelvet produced longer scaffolds and increased the number of predicted genes.
format Online
Article
Text
id pubmed-3488206
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34882062012-11-06 MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads Namiki, Toshiaki Hachiya, Tsuyoshi Tanaka, Hideaki Sakakibara, Yasubumi Nucleic Acids Res Methods Online An important step in ‘metagenomics’ analysis is the assembly of multiple genomes from mixed sequence reads of multiple species in a microbial community. Most conventional pipelines use a single-genome assembler with carefully optimized parameters. A limitation of a single-genome assembler for de novo metagenome assembly is that sequences of highly abundant species are likely misidentified as repeats in a single genome, resulting in a number of small fragmented scaffolds. We extended a single-genome assembler for short reads, known as ‘Velvet’, to metagenome assembly, which we called ‘MetaVelvet’, for mixed short reads of multiple species. Our fundamental concept was to first decompose a de Bruijn graph constructed from mixed short reads into individual sub-graphs, and second, to build scaffolds based on each decomposed de Bruijn sub-graph as an isolate species genome. We made use of two features, the coverage (abundance) difference and graph connectivity, for the decomposition of the de Bruijn graph. For simulated datasets, MetaVelvet succeeded in generating significantly higher N50 scores than any single-genome assemblers. MetaVelvet also reconstructed relatively low-coverage genome sequences as scaffolds. On real datasets of human gut microbial read data, MetaVelvet produced longer scaffolds and increased the number of predicted genes. Oxford University Press 2012-11 2012-07-19 /pmc/articles/PMC3488206/ /pubmed/22821567 http://dx.doi.org/10.1093/nar/gks678 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Namiki, Toshiaki
Hachiya, Tsuyoshi
Tanaka, Hideaki
Sakakibara, Yasubumi
MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads
title MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads
title_full MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads
title_fullStr MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads
title_full_unstemmed MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads
title_short MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads
title_sort metavelvet: an extension of velvet assembler to de novo metagenome assembly from short sequence reads
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488206/
https://www.ncbi.nlm.nih.gov/pubmed/22821567
http://dx.doi.org/10.1093/nar/gks678
work_keys_str_mv AT namikitoshiaki metavelvetanextensionofvelvetassemblertodenovometagenomeassemblyfromshortsequencereads
AT hachiyatsuyoshi metavelvetanextensionofvelvetassemblertodenovometagenomeassemblyfromshortsequencereads
AT tanakahideaki metavelvetanextensionofvelvetassemblertodenovometagenomeassemblyfromshortsequencereads
AT sakakibarayasubumi metavelvetanextensionofvelvetassemblertodenovometagenomeassemblyfromshortsequencereads