Cargando…

Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs

Assembly of metagenomic samples is a very complex process, with algorithms designed to address sequencing platform-specific issues, (read length, data volume, and/or community complexity), while also faced with genomes that differ greatly in nucleotide compositional biases and in abundance. To addre...

Descripción completa

Detalles Bibliográficos
Autores principales: Scholz, Matthew, Lo, Chien-Chi, Chain, Patrick S. G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180827/
https://www.ncbi.nlm.nih.gov/pubmed/25270300
http://dx.doi.org/10.1038/srep06480
_version_ 1782337266757140480
author Scholz, Matthew
Lo, Chien-Chi
Chain, Patrick S. G.
author_facet Scholz, Matthew
Lo, Chien-Chi
Chain, Patrick S. G.
author_sort Scholz, Matthew
collection PubMed
description Assembly of metagenomic samples is a very complex process, with algorithms designed to address sequencing platform-specific issues, (read length, data volume, and/or community complexity), while also faced with genomes that differ greatly in nucleotide compositional biases and in abundance. To address these issues, we have developed a post-assembly process: MetaGenomic Assembly by Merging (MeGAMerge). We compare this process to the performance of several assemblers, using both real, and in-silico generated samples of different community composition and complexity. MeGAMerge consistently outperforms individual assembly methods, producing larger contigs with an increased number of predicted genes, without replication of data. MeGAMerge contigs are supported by read mapping and contig alignment data, when using synthetically-derived and real metagenomic data, as well as by gene prediction analyses and similarity searches. MeGAMerge is a flexible method that generates improved metagenome assemblies, with the ability to accommodate upcoming sequencing platforms, as well as present and future assembly algorithms.
format Online
Article
Text
id pubmed-4180827
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-41808272014-10-02 Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs Scholz, Matthew Lo, Chien-Chi Chain, Patrick S. G. Sci Rep Article Assembly of metagenomic samples is a very complex process, with algorithms designed to address sequencing platform-specific issues, (read length, data volume, and/or community complexity), while also faced with genomes that differ greatly in nucleotide compositional biases and in abundance. To address these issues, we have developed a post-assembly process: MetaGenomic Assembly by Merging (MeGAMerge). We compare this process to the performance of several assemblers, using both real, and in-silico generated samples of different community composition and complexity. MeGAMerge consistently outperforms individual assembly methods, producing larger contigs with an increased number of predicted genes, without replication of data. MeGAMerge contigs are supported by read mapping and contig alignment data, when using synthetically-derived and real metagenomic data, as well as by gene prediction analyses and similarity searches. MeGAMerge is a flexible method that generates improved metagenome assemblies, with the ability to accommodate upcoming sequencing platforms, as well as present and future assembly algorithms. Nature Publishing Group 2014-10-01 /pmc/articles/PMC4180827/ /pubmed/25270300 http://dx.doi.org/10.1038/srep06480 Text en Copyright © 2014, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/
spellingShingle Article
Scholz, Matthew
Lo, Chien-Chi
Chain, Patrick S. G.
Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs
title Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs
title_full Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs
title_fullStr Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs
title_full_unstemmed Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs
title_short Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs
title_sort improved assemblies using a source-agnostic pipeline for metagenomic assembly by merging (megamerge) of contigs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180827/
https://www.ncbi.nlm.nih.gov/pubmed/25270300
http://dx.doi.org/10.1038/srep06480
work_keys_str_mv AT scholzmatthew improvedassembliesusingasourceagnosticpipelineformetagenomicassemblybymergingmegamergeofcontigs
AT lochienchi improvedassembliesusingasourceagnosticpipelineformetagenomicassemblybymergingmegamergeofcontigs
AT chainpatricksg improvedassembliesusingasourceagnosticpipelineformetagenomicassemblybymergingmegamergeofcontigs