Cargando…
CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis
In recent years, there has been a veritable boost in next-generation sequencing (NGS) of gene amplicons in biological and medical studies. Huge amounts of data are produced and need to be analyzed adequately. Various online and offline analysis tools are available; however, most of them require exte...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7710066/ https://www.ncbi.nlm.nih.gov/pubmed/33264369 http://dx.doi.org/10.1371/journal.pone.0243241 |
_version_ | 1783617871484551168 |
---|---|
author | Hupfauf, Sebastian Etemadi, Mohammad Fernández-Delgado Juárez, Marina Gómez-Brandón, María Insam, Heribert Podmirseg, Sabine Marie |
author_facet | Hupfauf, Sebastian Etemadi, Mohammad Fernández-Delgado Juárez, Marina Gómez-Brandón, María Insam, Heribert Podmirseg, Sabine Marie |
author_sort | Hupfauf, Sebastian |
collection | PubMed |
description | In recent years, there has been a veritable boost in next-generation sequencing (NGS) of gene amplicons in biological and medical studies. Huge amounts of data are produced and need to be analyzed adequately. Various online and offline analysis tools are available; however, most of them require extensive expertise in computer science or bioinformatics, and often a Linux-based operating system. Here, we introduce “CoMA–Comparative Microbiome Analysis” as a free and intuitive analysis pipeline for amplicon-sequencing data, compatible with any common operating system. Moreover, the tool offers various useful services including data pre-processing, quality checking, clustering to operational taxonomic units (OTUs), taxonomic assignment, data post-processing, data visualization, and statistical appraisal. The workflow results in highly esthetic and publication-ready graphics, as well as output files in standardized formats (e.g. tab-delimited OTU-table, BIOM, NEWICK tree) that can be used for more sophisticated analyses. The CoMA output was validated by a benchmark test, using three mock communities with different sample characteristics (primer set, amplicon length, diversity). The performance was compared with that of Mothur, QIIME and QIIME2-DADA2, popular packages for NGS data analysis. Furthermore, the functionality of CoMA is demonstrated on a practical example, investigating microbial communities from three different soils (grassland, forest, swamp). All tools performed well in the benchmark test and were able to reveal the majority of all genera in the mock communities. Also for the soil samples, the results of CoMA were congruent to those of the other pipelines, in particular when looking at the key microbial players. |
format | Online Article Text |
id | pubmed-7710066 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-77100662020-12-03 CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis Hupfauf, Sebastian Etemadi, Mohammad Fernández-Delgado Juárez, Marina Gómez-Brandón, María Insam, Heribert Podmirseg, Sabine Marie PLoS One Research Article In recent years, there has been a veritable boost in next-generation sequencing (NGS) of gene amplicons in biological and medical studies. Huge amounts of data are produced and need to be analyzed adequately. Various online and offline analysis tools are available; however, most of them require extensive expertise in computer science or bioinformatics, and often a Linux-based operating system. Here, we introduce “CoMA–Comparative Microbiome Analysis” as a free and intuitive analysis pipeline for amplicon-sequencing data, compatible with any common operating system. Moreover, the tool offers various useful services including data pre-processing, quality checking, clustering to operational taxonomic units (OTUs), taxonomic assignment, data post-processing, data visualization, and statistical appraisal. The workflow results in highly esthetic and publication-ready graphics, as well as output files in standardized formats (e.g. tab-delimited OTU-table, BIOM, NEWICK tree) that can be used for more sophisticated analyses. The CoMA output was validated by a benchmark test, using three mock communities with different sample characteristics (primer set, amplicon length, diversity). The performance was compared with that of Mothur, QIIME and QIIME2-DADA2, popular packages for NGS data analysis. Furthermore, the functionality of CoMA is demonstrated on a practical example, investigating microbial communities from three different soils (grassland, forest, swamp). All tools performed well in the benchmark test and were able to reveal the majority of all genera in the mock communities. Also for the soil samples, the results of CoMA were congruent to those of the other pipelines, in particular when looking at the key microbial players. Public Library of Science 2020-12-02 /pmc/articles/PMC7710066/ /pubmed/33264369 http://dx.doi.org/10.1371/journal.pone.0243241 Text en © 2020 Hupfauf et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Hupfauf, Sebastian Etemadi, Mohammad Fernández-Delgado Juárez, Marina Gómez-Brandón, María Insam, Heribert Podmirseg, Sabine Marie CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis |
title | CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis |
title_full | CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis |
title_fullStr | CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis |
title_full_unstemmed | CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis |
title_short | CoMA – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis |
title_sort | coma – an intuitive and user-friendly pipeline for amplicon-sequencing data analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7710066/ https://www.ncbi.nlm.nih.gov/pubmed/33264369 http://dx.doi.org/10.1371/journal.pone.0243241 |
work_keys_str_mv | AT hupfaufsebastian comaanintuitiveanduserfriendlypipelineforampliconsequencingdataanalysis AT etemadimohammad comaanintuitiveanduserfriendlypipelineforampliconsequencingdataanalysis AT fernandezdelgadojuarezmarina comaanintuitiveanduserfriendlypipelineforampliconsequencingdataanalysis AT gomezbrandonmaria comaanintuitiveanduserfriendlypipelineforampliconsequencingdataanalysis AT insamheribert comaanintuitiveanduserfriendlypipelineforampliconsequencingdataanalysis AT podmirsegsabinemarie comaanintuitiveanduserfriendlypipelineforampliconsequencingdataanalysis |