Cargando…
A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion
Metagenomics analysis revealing the composition and functional repertoire of complex microbial communities typically relies on large amounts of sequence data. Numerous analysis strategies and computational tools are available for their analysis. Fully integrated automated analysis pipelines such as...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7284732/ https://www.ncbi.nlm.nih.gov/pubmed/32380653 http://dx.doi.org/10.3390/microorganisms8050669 |
_version_ | 1783544536448892928 |
---|---|
author | Becker, Daniela Popp, Denny Harms, Hauke Centler, Florian |
author_facet | Becker, Daniela Popp, Denny Harms, Hauke Centler, Florian |
author_sort | Becker, Daniela |
collection | PubMed |
description | Metagenomics analysis revealing the composition and functional repertoire of complex microbial communities typically relies on large amounts of sequence data. Numerous analysis strategies and computational tools are available for their analysis. Fully integrated automated analysis pipelines such as MG-RAST or MEGAN6 are user-friendly but not designed for integrating specific knowledge on the biological system under study. In order to facilitate the consideration of such knowledge, we introduce a modular, adaptable analysis pipeline combining existing tools. We applied the novel pipeline to simulated mock data sets focusing on anaerobic digestion microbiomes and compare results to those obtained with established automated analysis pipelines. We find that the analysis strategy and choice of tools and parameters have a strong effect on the inferred taxonomic community composition, but not on the inferred functional profile. By including prior knowledge, computational costs can be decreased while improving result accuracy. While automated off-the-shelf analysis pipelines are easy to apply and require no knowledge on the microbial system under study, custom-made pipelines require more preparation time and bioinformatics expertise. This extra effort is minimized by our modular, flexible, custom-made pipeline, which can be adapted to different scenarios and can take available knowledge on the microbial system under study into account. |
format | Online Article Text |
id | pubmed-7284732 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-72847322020-06-15 A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion Becker, Daniela Popp, Denny Harms, Hauke Centler, Florian Microorganisms Article Metagenomics analysis revealing the composition and functional repertoire of complex microbial communities typically relies on large amounts of sequence data. Numerous analysis strategies and computational tools are available for their analysis. Fully integrated automated analysis pipelines such as MG-RAST or MEGAN6 are user-friendly but not designed for integrating specific knowledge on the biological system under study. In order to facilitate the consideration of such knowledge, we introduce a modular, adaptable analysis pipeline combining existing tools. We applied the novel pipeline to simulated mock data sets focusing on anaerobic digestion microbiomes and compare results to those obtained with established automated analysis pipelines. We find that the analysis strategy and choice of tools and parameters have a strong effect on the inferred taxonomic community composition, but not on the inferred functional profile. By including prior knowledge, computational costs can be decreased while improving result accuracy. While automated off-the-shelf analysis pipelines are easy to apply and require no knowledge on the microbial system under study, custom-made pipelines require more preparation time and bioinformatics expertise. This extra effort is minimized by our modular, flexible, custom-made pipeline, which can be adapted to different scenarios and can take available knowledge on the microbial system under study into account. MDPI 2020-05-05 /pmc/articles/PMC7284732/ /pubmed/32380653 http://dx.doi.org/10.3390/microorganisms8050669 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Becker, Daniela Popp, Denny Harms, Hauke Centler, Florian A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion |
title | A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion |
title_full | A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion |
title_fullStr | A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion |
title_full_unstemmed | A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion |
title_short | A Modular Metagenomics Pipeline Allowing for the Inclusion of Prior Knowledge Using the Example of Anaerobic Digestion |
title_sort | modular metagenomics pipeline allowing for the inclusion of prior knowledge using the example of anaerobic digestion |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7284732/ https://www.ncbi.nlm.nih.gov/pubmed/32380653 http://dx.doi.org/10.3390/microorganisms8050669 |
work_keys_str_mv | AT beckerdaniela amodularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion AT poppdenny amodularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion AT harmshauke amodularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion AT centlerflorian amodularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion AT beckerdaniela modularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion AT poppdenny modularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion AT harmshauke modularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion AT centlerflorian modularmetagenomicspipelineallowingfortheinclusionofpriorknowledgeusingtheexampleofanaerobicdigestion |