Cargando…
MICCA: a complete and accurate software for taxonomic profiling of metagenomic data
The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standar...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649890/ https://www.ncbi.nlm.nih.gov/pubmed/25988396 http://dx.doi.org/10.1038/srep09743 |
_version_ | 1782401439365070848 |
---|---|
author | Albanese, Davide Fontana, Paolo De Filippo, Carlotta Cavalieri, Duccio Donati, Claudio |
author_facet | Albanese, Davide Fontana, Paolo De Filippo, Carlotta Cavalieri, Duccio Donati, Claudio |
author_sort | Albanese, Davide |
collection | PubMed |
description | The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project. |
format | Online Article Text |
id | pubmed-4649890 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-46498902015-11-23 MICCA: a complete and accurate software for taxonomic profiling of metagenomic data Albanese, Davide Fontana, Paolo De Filippo, Carlotta Cavalieri, Duccio Donati, Claudio Sci Rep Article The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project. Nature Publishing Group 2015-05-19 /pmc/articles/PMC4649890/ /pubmed/25988396 http://dx.doi.org/10.1038/srep09743 Text en Copyright © 2015, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Albanese, Davide Fontana, Paolo De Filippo, Carlotta Cavalieri, Duccio Donati, Claudio MICCA: a complete and accurate software for taxonomic profiling of metagenomic data |
title | MICCA: a complete and accurate software for taxonomic profiling of metagenomic data |
title_full | MICCA: a complete and accurate software for taxonomic profiling of metagenomic data |
title_fullStr | MICCA: a complete and accurate software for taxonomic profiling of metagenomic data |
title_full_unstemmed | MICCA: a complete and accurate software for taxonomic profiling of metagenomic data |
title_short | MICCA: a complete and accurate software for taxonomic profiling of metagenomic data |
title_sort | micca: a complete and accurate software for taxonomic profiling of metagenomic data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649890/ https://www.ncbi.nlm.nih.gov/pubmed/25988396 http://dx.doi.org/10.1038/srep09743 |
work_keys_str_mv | AT albanesedavide miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata AT fontanapaolo miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata AT defilippocarlotta miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata AT cavalieriduccio miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata AT donaticlaudio miccaacompleteandaccuratesoftwarefortaxonomicprofilingofmetagenomicdata |