Cargando…

BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS

BACKGROUND: Substantial advances in microbiology, molecular evolution and biodiversity have been carried out in recent years thanks to Metagenomics, which allows to unveil the composition and functions of mixed microbial communities in any environmental niche. If the investigation is aimed only at t...

Descripción completa

Detalles Bibliográficos
Autores principales: Fosso, Bruno, Santamaria, Monica, Marzano, Marinella, Alonso-Alemany, Daniel, Valiente, Gabriel, Donvito, Giacinto, Monaco, Alfonso, Notarangelo, Pasquale, Pesole, Graziano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4486701/
https://www.ncbi.nlm.nih.gov/pubmed/26130132
http://dx.doi.org/10.1186/s12859-015-0595-z
_version_ 1782378914699542528
author Fosso, Bruno
Santamaria, Monica
Marzano, Marinella
Alonso-Alemany, Daniel
Valiente, Gabriel
Donvito, Giacinto
Monaco, Alfonso
Notarangelo, Pasquale
Pesole, Graziano
author_facet Fosso, Bruno
Santamaria, Monica
Marzano, Marinella
Alonso-Alemany, Daniel
Valiente, Gabriel
Donvito, Giacinto
Monaco, Alfonso
Notarangelo, Pasquale
Pesole, Graziano
author_sort Fosso, Bruno
collection PubMed
description BACKGROUND: Substantial advances in microbiology, molecular evolution and biodiversity have been carried out in recent years thanks to Metagenomics, which allows to unveil the composition and functions of mixed microbial communities in any environmental niche. If the investigation is aimed only at the microbiome taxonomic structure, a target-based metagenomic approach, here also referred as Meta-barcoding, is generally applied. This approach commonly involves the selective amplification of a species-specific genetic marker (DNA meta-barcode) in the whole taxonomic range of interest and the exploration of its taxon-related variants through High-Throughput Sequencing (HTS) technologies. The accessibility to proper computational systems for the large-scale bioinformatic analysis of HTS data represents, currently, one of the major challenges in advanced Meta-barcoding projects. RESULTS: BioMaS (Bioinformatic analysis of Metagenomic AmpliconS) is a new bioinformatic pipeline designed to support biomolecular researchers involved in taxonomic studies of environmental microbial communities by a completely automated workflow, comprehensive of all the fundamental steps, from raw sequence data upload and cleaning to final taxonomic identification, that are absolutely required in an appropriately designed Meta-barcoding HTS-based experiment. In its current version, BioMaS allows the analysis of both bacterial and fungal environments starting directly from the raw sequencing data from either Roche 454 or Illumina HTS platforms, following two alternative paths, respectively. BioMaS is implemented into a public web service available at https://recasgateway.ba.infn.it/ and is also available in Galaxy at http://galaxy.cloud.ba.infn.it:8080 (only for Illumina data). CONCLUSION: BioMaS is a friendly pipeline for Meta-barcoding HTS data analysis specifically designed for users without particular computing skills. A comparative benchmark, carried out by using a simulated dataset suitably designed to broadly represent the currently known bacterial and fungal world, showed that BioMaS outperforms QIIME and MOTHUR in terms of extent and accuracy of deep taxonomic sequence assignments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0595-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4486701
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44867012015-07-02 BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS Fosso, Bruno Santamaria, Monica Marzano, Marinella Alonso-Alemany, Daniel Valiente, Gabriel Donvito, Giacinto Monaco, Alfonso Notarangelo, Pasquale Pesole, Graziano BMC Bioinformatics Methodology BACKGROUND: Substantial advances in microbiology, molecular evolution and biodiversity have been carried out in recent years thanks to Metagenomics, which allows to unveil the composition and functions of mixed microbial communities in any environmental niche. If the investigation is aimed only at the microbiome taxonomic structure, a target-based metagenomic approach, here also referred as Meta-barcoding, is generally applied. This approach commonly involves the selective amplification of a species-specific genetic marker (DNA meta-barcode) in the whole taxonomic range of interest and the exploration of its taxon-related variants through High-Throughput Sequencing (HTS) technologies. The accessibility to proper computational systems for the large-scale bioinformatic analysis of HTS data represents, currently, one of the major challenges in advanced Meta-barcoding projects. RESULTS: BioMaS (Bioinformatic analysis of Metagenomic AmpliconS) is a new bioinformatic pipeline designed to support biomolecular researchers involved in taxonomic studies of environmental microbial communities by a completely automated workflow, comprehensive of all the fundamental steps, from raw sequence data upload and cleaning to final taxonomic identification, that are absolutely required in an appropriately designed Meta-barcoding HTS-based experiment. In its current version, BioMaS allows the analysis of both bacterial and fungal environments starting directly from the raw sequencing data from either Roche 454 or Illumina HTS platforms, following two alternative paths, respectively. BioMaS is implemented into a public web service available at https://recasgateway.ba.infn.it/ and is also available in Galaxy at http://galaxy.cloud.ba.infn.it:8080 (only for Illumina data). CONCLUSION: BioMaS is a friendly pipeline for Meta-barcoding HTS data analysis specifically designed for users without particular computing skills. A comparative benchmark, carried out by using a simulated dataset suitably designed to broadly represent the currently known bacterial and fungal world, showed that BioMaS outperforms QIIME and MOTHUR in terms of extent and accuracy of deep taxonomic sequence assignments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0595-z) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-01 /pmc/articles/PMC4486701/ /pubmed/26130132 http://dx.doi.org/10.1186/s12859-015-0595-z Text en © Fosso et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Fosso, Bruno
Santamaria, Monica
Marzano, Marinella
Alonso-Alemany, Daniel
Valiente, Gabriel
Donvito, Giacinto
Monaco, Alfonso
Notarangelo, Pasquale
Pesole, Graziano
BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS
title BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS
title_full BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS
title_fullStr BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS
title_full_unstemmed BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS
title_short BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS
title_sort biomas: a modular pipeline for bioinformatic analysis of metagenomic amplicons
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4486701/
https://www.ncbi.nlm.nih.gov/pubmed/26130132
http://dx.doi.org/10.1186/s12859-015-0595-z
work_keys_str_mv AT fossobruno biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT santamariamonica biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT marzanomarinella biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT alonsoalemanydaniel biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT valientegabriel biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT donvitogiacinto biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT monacoalfonso biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT notarangelopasquale biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons
AT pesolegraziano biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons