Cargando…

Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice

Reproducibility is a major issue in microbiome studies, which is partly caused by missing consensus about data analysis strategies. The complex nature of microbiome data, which are high-dimensional, zero-inflated, and compositional, makes them challenging to analyze, as they often violate assumption...

Descripción completa

Detalles Bibliográficos
Autores principales: Kleine Bardenhorst, Sven, Berger, Tom, Klawonn, Frank, Vital, Marius, Karch, André, Rübsamen, Nicole
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573962/
https://www.ncbi.nlm.nih.gov/pubmed/33622856
http://dx.doi.org/10.1128/mSystems.01154-20
_version_ 1784595524406476800
author Kleine Bardenhorst, Sven
Berger, Tom
Klawonn, Frank
Vital, Marius
Karch, André
Rübsamen, Nicole
author_facet Kleine Bardenhorst, Sven
Berger, Tom
Klawonn, Frank
Vital, Marius
Karch, André
Rübsamen, Nicole
author_sort Kleine Bardenhorst, Sven
collection PubMed
description Reproducibility is a major issue in microbiome studies, which is partly caused by missing consensus about data analysis strategies. The complex nature of microbiome data, which are high-dimensional, zero-inflated, and compositional, makes them challenging to analyze, as they often violate assumptions of classic statistical methods. With advances in human microbiome research, research questions and study designs increase in complexity so that more sophisticated data analysis concepts are applied. To improve current practice of the analysis of microbiome studies, it is important to understand what kind of research questions are asked and which tools are used to answer these questions. We conducted a systematic literature review considering all publications focusing on the analysis of human microbiome data from June 2018 to June 2019. Of 1,444 studies screened, 419 fulfilled the inclusion criteria. Information about research questions, study designs, and analysis strategies were extracted. The results confirmed the expected shift to more advanced research questions, as one-third of the studies analyzed clustered data. Although heterogeneity in the methods used was found at any stage of the analysis process, it was largest for differential abundance testing. Especially if the underlying data structure was clustered, we identified a lack of use of methods that appropriately addressed the underlying data structure while taking into account additional dependencies in the data. Our results confirm considerable heterogeneity in analysis strategies among microbiome studies; increasingly complex research questions require better guidance for analysis strategies. IMPORTANCE The human microbiome has emerged as an important factor in the development of health and disease. Growing interest in this topic has led to an increasing number of studies investigating the human microbiome using high-throughput sequencing methods. However, the development of suitable analytical methods for analyzing microbiome data has not kept pace with the rapid progression in the field. It is crucial to understand current practice to identify the scope for development. Our results highlight the need for an extensive evaluation of the strengths and shortcomings of existing methods in order to guide the choice of proper analysis strategies. We have identified where new methods could be designed to address more advanced research questions while taking into account the complex structure of the data.
format Online
Article
Text
id pubmed-8573962
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-85739622021-11-08 Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice Kleine Bardenhorst, Sven Berger, Tom Klawonn, Frank Vital, Marius Karch, André Rübsamen, Nicole mSystems Research Article Reproducibility is a major issue in microbiome studies, which is partly caused by missing consensus about data analysis strategies. The complex nature of microbiome data, which are high-dimensional, zero-inflated, and compositional, makes them challenging to analyze, as they often violate assumptions of classic statistical methods. With advances in human microbiome research, research questions and study designs increase in complexity so that more sophisticated data analysis concepts are applied. To improve current practice of the analysis of microbiome studies, it is important to understand what kind of research questions are asked and which tools are used to answer these questions. We conducted a systematic literature review considering all publications focusing on the analysis of human microbiome data from June 2018 to June 2019. Of 1,444 studies screened, 419 fulfilled the inclusion criteria. Information about research questions, study designs, and analysis strategies were extracted. The results confirmed the expected shift to more advanced research questions, as one-third of the studies analyzed clustered data. Although heterogeneity in the methods used was found at any stage of the analysis process, it was largest for differential abundance testing. Especially if the underlying data structure was clustered, we identified a lack of use of methods that appropriately addressed the underlying data structure while taking into account additional dependencies in the data. Our results confirm considerable heterogeneity in analysis strategies among microbiome studies; increasingly complex research questions require better guidance for analysis strategies. IMPORTANCE The human microbiome has emerged as an important factor in the development of health and disease. Growing interest in this topic has led to an increasing number of studies investigating the human microbiome using high-throughput sequencing methods. However, the development of suitable analytical methods for analyzing microbiome data has not kept pace with the rapid progression in the field. It is crucial to understand current practice to identify the scope for development. Our results highlight the need for an extensive evaluation of the strengths and shortcomings of existing methods in order to guide the choice of proper analysis strategies. We have identified where new methods could be designed to address more advanced research questions while taking into account the complex structure of the data. American Society for Microbiology 2021-02-23 /pmc/articles/PMC8573962/ /pubmed/33622856 http://dx.doi.org/10.1128/mSystems.01154-20 Text en Copyright © 2021 Kleine Bardenhorst et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Kleine Bardenhorst, Sven
Berger, Tom
Klawonn, Frank
Vital, Marius
Karch, André
Rübsamen, Nicole
Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice
title Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice
title_full Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice
title_fullStr Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice
title_full_unstemmed Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice
title_short Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice
title_sort data analysis strategies for microbiome studies in human populations—a systematic review of current practice
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573962/
https://www.ncbi.nlm.nih.gov/pubmed/33622856
http://dx.doi.org/10.1128/mSystems.01154-20
work_keys_str_mv AT kleinebardenhorstsven dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice
AT bergertom dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice
AT klawonnfrank dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice
AT vitalmarius dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice
AT karchandre dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice
AT rubsamennicole dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice