Cargando…

MICCA: a complete and accurate software for taxonomic profiling of metagenomic data

The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standar...

Descripción completa

Detalles Bibliográficos
Autores principales: Albanese, Davide, Fontana, Paolo, De Filippo, Carlotta, Cavalieri, Duccio, Donati, Claudio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649890/
https://www.ncbi.nlm.nih.gov/pubmed/25988396
http://dx.doi.org/10.1038/srep09743
_version_ 1782401439365070848
author Albanese, Davide
Fontana, Paolo
De Filippo, Carlotta
Cavalieri, Duccio
Donati, Claudio
author_facet Albanese, Davide
Fontana, Paolo
De Filippo, Carlotta
Cavalieri, Duccio
Donati, Claudio
author_sort Albanese, Davide
collection PubMed
description The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project.
format Online
Article
Text
id pubmed-4649890
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-46498902015-11-23 MICCA: a complete and accurate software for taxonomic profiling of metagenomic data Albanese, Davide Fontana, Paolo De Filippo, Carlotta Cavalieri, Duccio Donati, Claudio Sci Rep Article The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project. Nature Publishing Group 2015-05-19 /pmc/articles/PMC4649890/ /pubmed/25988396 http://dx.doi.org/10.1038/srep09743 Text en Copyright © 2015, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Albanese, Davide
Fontana, Paolo
De Filippo, Carlotta
Cavalieri, Duccio
Donati, Claudio
MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
title MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
title_full MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
title_fullStr MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
title_full_unstemmed MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
title_short MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
title_sort micca: a complete and accurate software for taxonomic profiling of metagenomic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649890/
https://www.ncbi.nlm.nih.gov/pubmed/25988396
http://dx.doi.org/10.1038/srep09743
work_keys_str_mv AT albanesedavide miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata
AT fontanapaolo miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata
AT defilippocarlotta miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata
AT cavalieriduccio miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata
AT donaticlaudio miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata