Cargando…

SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline

The improvement of sequencing technologies has facilitated generalization of metagenomic sequencing, which has become a standard procedure for analyzing the structure and functionality of microbiomes. Bioinformatic analysis of sequencing results poses a challenge because it involves many different c...

Descripción completa

Detalles Bibliográficos
Autores principales: Tamames, Javier, Puente-Sánchez, Fernando
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6353838/
https://www.ncbi.nlm.nih.gov/pubmed/30733714
http://dx.doi.org/10.3389/fmicb.2018.03349
_version_ 1783391040074416128
author Tamames, Javier
Puente-Sánchez, Fernando
author_facet Tamames, Javier
Puente-Sánchez, Fernando
author_sort Tamames, Javier
collection PubMed
description The improvement of sequencing technologies has facilitated generalization of metagenomic sequencing, which has become a standard procedure for analyzing the structure and functionality of microbiomes. Bioinformatic analysis of sequencing results poses a challenge because it involves many different complex steps. SqueezeMeta is a fully automatic pipeline for metagenomics/metatranscriptomics, covering all steps of the analysis. SqueezeMeta includes multi-metagenome support that enables co-assembly of related metagenomes and retrieval of individual genomes via binning procedures. SqueezeMeta features several unique characteristics: co-assembly procedure or co-assembly of unlimited number of metagenomes via merging of individual assembled metagenomes, both with read mapping for estimation of the abundances of genes in each metagenome. It also includes binning and bin checking for retrieving individual genomes. Internal checks for the assembly and binning steps provide information about the consistency of contigs and bins. Moreover, results are stored in a MySQL database, where they can be easily exported and shared, and can be inspected anywhere using a flexible web interface that allows simple creation of complex queries. We illustrate the potential of SqueezeMeta by analyzing 32 gut metagenomes in a fully automatic way, enabling retrieval of several million genes and several hundreds of genomic bins. One of the motivations in the development of SqueezeMeta was producing a software capable of running in small desktop computers and thus amenable to all users and settings. We were also able to co-assemble two of these metagenomes and complete the full analysis in less than one day using a simple laptop computer. This reveals the capacity of SqueezeMeta to run without high-performance computing infrastructure and in absence of any network connectivity. It is therefore adequate for in situ, real time analysis of metagenomes produced by nanopore sequencing. SqueezeMeta can be downloaded from https://github.com/jtamames/SqueezeMeta.
format Online
Article
Text
id pubmed-6353838
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63538382019-02-07 SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline Tamames, Javier Puente-Sánchez, Fernando Front Microbiol Microbiology The improvement of sequencing technologies has facilitated generalization of metagenomic sequencing, which has become a standard procedure for analyzing the structure and functionality of microbiomes. Bioinformatic analysis of sequencing results poses a challenge because it involves many different complex steps. SqueezeMeta is a fully automatic pipeline for metagenomics/metatranscriptomics, covering all steps of the analysis. SqueezeMeta includes multi-metagenome support that enables co-assembly of related metagenomes and retrieval of individual genomes via binning procedures. SqueezeMeta features several unique characteristics: co-assembly procedure or co-assembly of unlimited number of metagenomes via merging of individual assembled metagenomes, both with read mapping for estimation of the abundances of genes in each metagenome. It also includes binning and bin checking for retrieving individual genomes. Internal checks for the assembly and binning steps provide information about the consistency of contigs and bins. Moreover, results are stored in a MySQL database, where they can be easily exported and shared, and can be inspected anywhere using a flexible web interface that allows simple creation of complex queries. We illustrate the potential of SqueezeMeta by analyzing 32 gut metagenomes in a fully automatic way, enabling retrieval of several million genes and several hundreds of genomic bins. One of the motivations in the development of SqueezeMeta was producing a software capable of running in small desktop computers and thus amenable to all users and settings. We were also able to co-assemble two of these metagenomes and complete the full analysis in less than one day using a simple laptop computer. This reveals the capacity of SqueezeMeta to run without high-performance computing infrastructure and in absence of any network connectivity. It is therefore adequate for in situ, real time analysis of metagenomes produced by nanopore sequencing. SqueezeMeta can be downloaded from https://github.com/jtamames/SqueezeMeta. Frontiers Media S.A. 2019-01-24 /pmc/articles/PMC6353838/ /pubmed/30733714 http://dx.doi.org/10.3389/fmicb.2018.03349 Text en Copyright © 2019 Tamames and Puente-Sánchez. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Tamames, Javier
Puente-Sánchez, Fernando
SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline
title SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline
title_full SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline
title_fullStr SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline
title_full_unstemmed SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline
title_short SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline
title_sort squeezemeta, a highly portable, fully automatic metagenomic analysis pipeline
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6353838/
https://www.ncbi.nlm.nih.gov/pubmed/30733714
http://dx.doi.org/10.3389/fmicb.2018.03349
work_keys_str_mv AT tamamesjavier squeezemetaahighlyportablefullyautomaticmetagenomicanalysispipeline
AT puentesanchezfernando squeezemetaahighlyportablefullyautomaticmetagenomicanalysispipeline