Cargando…
BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS
BACKGROUND: Substantial advances in microbiology, molecular evolution and biodiversity have been carried out in recent years thanks to Metagenomics, which allows to unveil the composition and functions of mixed microbial communities in any environmental niche. If the investigation is aimed only at t...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4486701/ https://www.ncbi.nlm.nih.gov/pubmed/26130132 http://dx.doi.org/10.1186/s12859-015-0595-z |
_version_ | 1782378914699542528 |
---|---|
author | Fosso, Bruno Santamaria, Monica Marzano, Marinella Alonso-Alemany, Daniel Valiente, Gabriel Donvito, Giacinto Monaco, Alfonso Notarangelo, Pasquale Pesole, Graziano |
author_facet | Fosso, Bruno Santamaria, Monica Marzano, Marinella Alonso-Alemany, Daniel Valiente, Gabriel Donvito, Giacinto Monaco, Alfonso Notarangelo, Pasquale Pesole, Graziano |
author_sort | Fosso, Bruno |
collection | PubMed |
description | BACKGROUND: Substantial advances in microbiology, molecular evolution and biodiversity have been carried out in recent years thanks to Metagenomics, which allows to unveil the composition and functions of mixed microbial communities in any environmental niche. If the investigation is aimed only at the microbiome taxonomic structure, a target-based metagenomic approach, here also referred as Meta-barcoding, is generally applied. This approach commonly involves the selective amplification of a species-specific genetic marker (DNA meta-barcode) in the whole taxonomic range of interest and the exploration of its taxon-related variants through High-Throughput Sequencing (HTS) technologies. The accessibility to proper computational systems for the large-scale bioinformatic analysis of HTS data represents, currently, one of the major challenges in advanced Meta-barcoding projects. RESULTS: BioMaS (Bioinformatic analysis of Metagenomic AmpliconS) is a new bioinformatic pipeline designed to support biomolecular researchers involved in taxonomic studies of environmental microbial communities by a completely automated workflow, comprehensive of all the fundamental steps, from raw sequence data upload and cleaning to final taxonomic identification, that are absolutely required in an appropriately designed Meta-barcoding HTS-based experiment. In its current version, BioMaS allows the analysis of both bacterial and fungal environments starting directly from the raw sequencing data from either Roche 454 or Illumina HTS platforms, following two alternative paths, respectively. BioMaS is implemented into a public web service available at https://recasgateway.ba.infn.it/ and is also available in Galaxy at http://galaxy.cloud.ba.infn.it:8080 (only for Illumina data). CONCLUSION: BioMaS is a friendly pipeline for Meta-barcoding HTS data analysis specifically designed for users without particular computing skills. A comparative benchmark, carried out by using a simulated dataset suitably designed to broadly represent the currently known bacterial and fungal world, showed that BioMaS outperforms QIIME and MOTHUR in terms of extent and accuracy of deep taxonomic sequence assignments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0595-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4486701 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44867012015-07-02 BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS Fosso, Bruno Santamaria, Monica Marzano, Marinella Alonso-Alemany, Daniel Valiente, Gabriel Donvito, Giacinto Monaco, Alfonso Notarangelo, Pasquale Pesole, Graziano BMC Bioinformatics Methodology BACKGROUND: Substantial advances in microbiology, molecular evolution and biodiversity have been carried out in recent years thanks to Metagenomics, which allows to unveil the composition and functions of mixed microbial communities in any environmental niche. If the investigation is aimed only at the microbiome taxonomic structure, a target-based metagenomic approach, here also referred as Meta-barcoding, is generally applied. This approach commonly involves the selective amplification of a species-specific genetic marker (DNA meta-barcode) in the whole taxonomic range of interest and the exploration of its taxon-related variants through High-Throughput Sequencing (HTS) technologies. The accessibility to proper computational systems for the large-scale bioinformatic analysis of HTS data represents, currently, one of the major challenges in advanced Meta-barcoding projects. RESULTS: BioMaS (Bioinformatic analysis of Metagenomic AmpliconS) is a new bioinformatic pipeline designed to support biomolecular researchers involved in taxonomic studies of environmental microbial communities by a completely automated workflow, comprehensive of all the fundamental steps, from raw sequence data upload and cleaning to final taxonomic identification, that are absolutely required in an appropriately designed Meta-barcoding HTS-based experiment. In its current version, BioMaS allows the analysis of both bacterial and fungal environments starting directly from the raw sequencing data from either Roche 454 or Illumina HTS platforms, following two alternative paths, respectively. BioMaS is implemented into a public web service available at https://recasgateway.ba.infn.it/ and is also available in Galaxy at http://galaxy.cloud.ba.infn.it:8080 (only for Illumina data). CONCLUSION: BioMaS is a friendly pipeline for Meta-barcoding HTS data analysis specifically designed for users without particular computing skills. A comparative benchmark, carried out by using a simulated dataset suitably designed to broadly represent the currently known bacterial and fungal world, showed that BioMaS outperforms QIIME and MOTHUR in terms of extent and accuracy of deep taxonomic sequence assignments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0595-z) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-01 /pmc/articles/PMC4486701/ /pubmed/26130132 http://dx.doi.org/10.1186/s12859-015-0595-z Text en © Fosso et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Fosso, Bruno Santamaria, Monica Marzano, Marinella Alonso-Alemany, Daniel Valiente, Gabriel Donvito, Giacinto Monaco, Alfonso Notarangelo, Pasquale Pesole, Graziano BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS |
title | BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS |
title_full | BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS |
title_fullStr | BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS |
title_full_unstemmed | BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS |
title_short | BioMaS: a modular pipeline for Bioinformatic analysis of Metagenomic AmpliconS |
title_sort | biomas: a modular pipeline for bioinformatic analysis of metagenomic amplicons |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4486701/ https://www.ncbi.nlm.nih.gov/pubmed/26130132 http://dx.doi.org/10.1186/s12859-015-0595-z |
work_keys_str_mv | AT fossobruno biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT santamariamonica biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT marzanomarinella biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT alonsoalemanydaniel biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT valientegabriel biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT donvitogiacinto biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT monacoalfonso biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT notarangelopasquale biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons AT pesolegraziano biomasamodularpipelineforbioinformaticanalysisofmetagenomicamplicons |