Cargando…
SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline
The improvement of sequencing technologies has facilitated generalization of metagenomic sequencing, which has become a standard procedure for analyzing the structure and functionality of microbiomes. Bioinformatic analysis of sequencing results poses a challenge because it involves many different c...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6353838/ https://www.ncbi.nlm.nih.gov/pubmed/30733714 http://dx.doi.org/10.3389/fmicb.2018.03349 |
_version_ | 1783391040074416128 |
---|---|
author | Tamames, Javier Puente-Sánchez, Fernando |
author_facet | Tamames, Javier Puente-Sánchez, Fernando |
author_sort | Tamames, Javier |
collection | PubMed |
description | The improvement of sequencing technologies has facilitated generalization of metagenomic sequencing, which has become a standard procedure for analyzing the structure and functionality of microbiomes. Bioinformatic analysis of sequencing results poses a challenge because it involves many different complex steps. SqueezeMeta is a fully automatic pipeline for metagenomics/metatranscriptomics, covering all steps of the analysis. SqueezeMeta includes multi-metagenome support that enables co-assembly of related metagenomes and retrieval of individual genomes via binning procedures. SqueezeMeta features several unique characteristics: co-assembly procedure or co-assembly of unlimited number of metagenomes via merging of individual assembled metagenomes, both with read mapping for estimation of the abundances of genes in each metagenome. It also includes binning and bin checking for retrieving individual genomes. Internal checks for the assembly and binning steps provide information about the consistency of contigs and bins. Moreover, results are stored in a MySQL database, where they can be easily exported and shared, and can be inspected anywhere using a flexible web interface that allows simple creation of complex queries. We illustrate the potential of SqueezeMeta by analyzing 32 gut metagenomes in a fully automatic way, enabling retrieval of several million genes and several hundreds of genomic bins. One of the motivations in the development of SqueezeMeta was producing a software capable of running in small desktop computers and thus amenable to all users and settings. We were also able to co-assemble two of these metagenomes and complete the full analysis in less than one day using a simple laptop computer. This reveals the capacity of SqueezeMeta to run without high-performance computing infrastructure and in absence of any network connectivity. It is therefore adequate for in situ, real time analysis of metagenomes produced by nanopore sequencing. SqueezeMeta can be downloaded from https://github.com/jtamames/SqueezeMeta. |
format | Online Article Text |
id | pubmed-6353838 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-63538382019-02-07 SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline Tamames, Javier Puente-Sánchez, Fernando Front Microbiol Microbiology The improvement of sequencing technologies has facilitated generalization of metagenomic sequencing, which has become a standard procedure for analyzing the structure and functionality of microbiomes. Bioinformatic analysis of sequencing results poses a challenge because it involves many different complex steps. SqueezeMeta is a fully automatic pipeline for metagenomics/metatranscriptomics, covering all steps of the analysis. SqueezeMeta includes multi-metagenome support that enables co-assembly of related metagenomes and retrieval of individual genomes via binning procedures. SqueezeMeta features several unique characteristics: co-assembly procedure or co-assembly of unlimited number of metagenomes via merging of individual assembled metagenomes, both with read mapping for estimation of the abundances of genes in each metagenome. It also includes binning and bin checking for retrieving individual genomes. Internal checks for the assembly and binning steps provide information about the consistency of contigs and bins. Moreover, results are stored in a MySQL database, where they can be easily exported and shared, and can be inspected anywhere using a flexible web interface that allows simple creation of complex queries. We illustrate the potential of SqueezeMeta by analyzing 32 gut metagenomes in a fully automatic way, enabling retrieval of several million genes and several hundreds of genomic bins. One of the motivations in the development of SqueezeMeta was producing a software capable of running in small desktop computers and thus amenable to all users and settings. We were also able to co-assemble two of these metagenomes and complete the full analysis in less than one day using a simple laptop computer. This reveals the capacity of SqueezeMeta to run without high-performance computing infrastructure and in absence of any network connectivity. It is therefore adequate for in situ, real time analysis of metagenomes produced by nanopore sequencing. SqueezeMeta can be downloaded from https://github.com/jtamames/SqueezeMeta. Frontiers Media S.A. 2019-01-24 /pmc/articles/PMC6353838/ /pubmed/30733714 http://dx.doi.org/10.3389/fmicb.2018.03349 Text en Copyright © 2019 Tamames and Puente-Sánchez. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Tamames, Javier Puente-Sánchez, Fernando SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline |
title | SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline |
title_full | SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline |
title_fullStr | SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline |
title_full_unstemmed | SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline |
title_short | SqueezeMeta, A Highly Portable, Fully Automatic Metagenomic Analysis Pipeline |
title_sort | squeezemeta, a highly portable, fully automatic metagenomic analysis pipeline |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6353838/ https://www.ncbi.nlm.nih.gov/pubmed/30733714 http://dx.doi.org/10.3389/fmicb.2018.03349 |
work_keys_str_mv | AT tamamesjavier squeezemetaahighlyportablefullyautomaticmetagenomicanalysispipeline AT puentesanchezfernando squeezemetaahighlyportablefullyautomaticmetagenomicanalysispipeline |