Cargando…

MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data

Metagenomic sequencing is an increasingly common tool in environmental and biomedical sciences.  While software for detailing the composition of microbial communities using 16S rRNA marker genes is relatively mature, increasingly researchers are interested in identifying changes exhibited within mic...

Descripción completa

Detalles Bibliográficos
Autores principales: Thang, Mike W.C., Chua, Xin-Yi, Price, Gareth, Gorse, Dominique, Field, Matt A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6807170/
https://www.ncbi.nlm.nih.gov/pubmed/31737256
http://dx.doi.org/10.12688/f1000research.18866.2
_version_ 1783461690912800768
author Thang, Mike W.C.
Chua, Xin-Yi
Price, Gareth
Gorse, Dominique
Field, Matt A.
author_facet Thang, Mike W.C.
Chua, Xin-Yi
Price, Gareth
Gorse, Dominique
Field, Matt A.
author_sort Thang, Mike W.C.
collection PubMed
description Metagenomic sequencing is an increasingly common tool in environmental and biomedical sciences.  While software for detailing the composition of microbial communities using 16S rRNA marker genes is relatively mature, increasingly researchers are interested in identifying changes exhibited within microbial communities under differing environmental conditions. In order to gain maximum value from metagenomic sequence data we must improve the existing analysis environment by providing accessible and scalable computational workflows able to generate reproducible results. Here we describe a complete end-to-end open-source metagenomics workflow running within Galaxy for 16S differential abundance analysis. The workflow accepts 454 or Illumina sequence data (either overlapping or non-overlapping paired end reads) and outputs lists of the operational taxonomic unit (OTUs) exhibiting the greatest change under differing conditions. A range of analysis steps and graphing options are available giving users a high-level of control over their data and analyses. Additionally, users are able to input complex sample-specific metadata information which can be incorporated into differential analysis and used for grouping / colouring within graphs.  Detailed tutorials containing sample data and existing workflows are available for three different input types: overlapping and non-overlapping read pairs as well as for pre-generated Biological Observation Matrix (BIOM) files. Using the Galaxy platform we developed MetaDEGalaxy, a complete metagenomics differential abundance analysis workflow. MetaDEGalaxy is designed for bench scientists working with 16S data who are interested in comparative metagenomics.  MetaDEGalaxy builds on momentum within the wider Galaxy metagenomics community with the hope that more tools will be added as existing methods mature.
format Online
Article
Text
id pubmed-6807170
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-68071702019-11-15 MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data Thang, Mike W.C. Chua, Xin-Yi Price, Gareth Gorse, Dominique Field, Matt A. F1000Res Software Tool Article Metagenomic sequencing is an increasingly common tool in environmental and biomedical sciences.  While software for detailing the composition of microbial communities using 16S rRNA marker genes is relatively mature, increasingly researchers are interested in identifying changes exhibited within microbial communities under differing environmental conditions. In order to gain maximum value from metagenomic sequence data we must improve the existing analysis environment by providing accessible and scalable computational workflows able to generate reproducible results. Here we describe a complete end-to-end open-source metagenomics workflow running within Galaxy for 16S differential abundance analysis. The workflow accepts 454 or Illumina sequence data (either overlapping or non-overlapping paired end reads) and outputs lists of the operational taxonomic unit (OTUs) exhibiting the greatest change under differing conditions. A range of analysis steps and graphing options are available giving users a high-level of control over their data and analyses. Additionally, users are able to input complex sample-specific metadata information which can be incorporated into differential analysis and used for grouping / colouring within graphs.  Detailed tutorials containing sample data and existing workflows are available for three different input types: overlapping and non-overlapping read pairs as well as for pre-generated Biological Observation Matrix (BIOM) files. Using the Galaxy platform we developed MetaDEGalaxy, a complete metagenomics differential abundance analysis workflow. MetaDEGalaxy is designed for bench scientists working with 16S data who are interested in comparative metagenomics.  MetaDEGalaxy builds on momentum within the wider Galaxy metagenomics community with the hope that more tools will be added as existing methods mature. F1000 Research Limited 2019-10-18 /pmc/articles/PMC6807170/ /pubmed/31737256 http://dx.doi.org/10.12688/f1000research.18866.2 Text en Copyright: © 2019 Thang MWC et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Tool Article
Thang, Mike W.C.
Chua, Xin-Yi
Price, Gareth
Gorse, Dominique
Field, Matt A.
MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data
title MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data
title_full MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data
title_fullStr MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data
title_full_unstemmed MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data
title_short MetaDEGalaxy: Galaxy workflow for differential abundance analysis of 16s metagenomic data
title_sort metadegalaxy: galaxy workflow for differential abundance analysis of 16s metagenomic data
topic Software Tool Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6807170/
https://www.ncbi.nlm.nih.gov/pubmed/31737256
http://dx.doi.org/10.12688/f1000research.18866.2
work_keys_str_mv AT thangmikewc metadegalaxygalaxyworkflowfordifferentialabundanceanalysisof16smetagenomicdata
AT chuaxinyi metadegalaxygalaxyworkflowfordifferentialabundanceanalysisof16smetagenomicdata
AT pricegareth metadegalaxygalaxyworkflowfordifferentialabundanceanalysisof16smetagenomicdata
AT gorsedominique metadegalaxygalaxyworkflowfordifferentialabundanceanalysisof16smetagenomicdata
AT fieldmatta metadegalaxygalaxyworkflowfordifferentialabundanceanalysisof16smetagenomicdata