Cargando…
Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice
Reproducibility is a major issue in microbiome studies, which is partly caused by missing consensus about data analysis strategies. The complex nature of microbiome data, which are high-dimensional, zero-inflated, and compositional, makes them challenging to analyze, as they often violate assumption...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573962/ https://www.ncbi.nlm.nih.gov/pubmed/33622856 http://dx.doi.org/10.1128/mSystems.01154-20 |
_version_ | 1784595524406476800 |
---|---|
author | Kleine Bardenhorst, Sven Berger, Tom Klawonn, Frank Vital, Marius Karch, André Rübsamen, Nicole |
author_facet | Kleine Bardenhorst, Sven Berger, Tom Klawonn, Frank Vital, Marius Karch, André Rübsamen, Nicole |
author_sort | Kleine Bardenhorst, Sven |
collection | PubMed |
description | Reproducibility is a major issue in microbiome studies, which is partly caused by missing consensus about data analysis strategies. The complex nature of microbiome data, which are high-dimensional, zero-inflated, and compositional, makes them challenging to analyze, as they often violate assumptions of classic statistical methods. With advances in human microbiome research, research questions and study designs increase in complexity so that more sophisticated data analysis concepts are applied. To improve current practice of the analysis of microbiome studies, it is important to understand what kind of research questions are asked and which tools are used to answer these questions. We conducted a systematic literature review considering all publications focusing on the analysis of human microbiome data from June 2018 to June 2019. Of 1,444 studies screened, 419 fulfilled the inclusion criteria. Information about research questions, study designs, and analysis strategies were extracted. The results confirmed the expected shift to more advanced research questions, as one-third of the studies analyzed clustered data. Although heterogeneity in the methods used was found at any stage of the analysis process, it was largest for differential abundance testing. Especially if the underlying data structure was clustered, we identified a lack of use of methods that appropriately addressed the underlying data structure while taking into account additional dependencies in the data. Our results confirm considerable heterogeneity in analysis strategies among microbiome studies; increasingly complex research questions require better guidance for analysis strategies. IMPORTANCE The human microbiome has emerged as an important factor in the development of health and disease. Growing interest in this topic has led to an increasing number of studies investigating the human microbiome using high-throughput sequencing methods. However, the development of suitable analytical methods for analyzing microbiome data has not kept pace with the rapid progression in the field. It is crucial to understand current practice to identify the scope for development. Our results highlight the need for an extensive evaluation of the strengths and shortcomings of existing methods in order to guide the choice of proper analysis strategies. We have identified where new methods could be designed to address more advanced research questions while taking into account the complex structure of the data. |
format | Online Article Text |
id | pubmed-8573962 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-85739622021-11-08 Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice Kleine Bardenhorst, Sven Berger, Tom Klawonn, Frank Vital, Marius Karch, André Rübsamen, Nicole mSystems Research Article Reproducibility is a major issue in microbiome studies, which is partly caused by missing consensus about data analysis strategies. The complex nature of microbiome data, which are high-dimensional, zero-inflated, and compositional, makes them challenging to analyze, as they often violate assumptions of classic statistical methods. With advances in human microbiome research, research questions and study designs increase in complexity so that more sophisticated data analysis concepts are applied. To improve current practice of the analysis of microbiome studies, it is important to understand what kind of research questions are asked and which tools are used to answer these questions. We conducted a systematic literature review considering all publications focusing on the analysis of human microbiome data from June 2018 to June 2019. Of 1,444 studies screened, 419 fulfilled the inclusion criteria. Information about research questions, study designs, and analysis strategies were extracted. The results confirmed the expected shift to more advanced research questions, as one-third of the studies analyzed clustered data. Although heterogeneity in the methods used was found at any stage of the analysis process, it was largest for differential abundance testing. Especially if the underlying data structure was clustered, we identified a lack of use of methods that appropriately addressed the underlying data structure while taking into account additional dependencies in the data. Our results confirm considerable heterogeneity in analysis strategies among microbiome studies; increasingly complex research questions require better guidance for analysis strategies. IMPORTANCE The human microbiome has emerged as an important factor in the development of health and disease. Growing interest in this topic has led to an increasing number of studies investigating the human microbiome using high-throughput sequencing methods. However, the development of suitable analytical methods for analyzing microbiome data has not kept pace with the rapid progression in the field. It is crucial to understand current practice to identify the scope for development. Our results highlight the need for an extensive evaluation of the strengths and shortcomings of existing methods in order to guide the choice of proper analysis strategies. We have identified where new methods could be designed to address more advanced research questions while taking into account the complex structure of the data. American Society for Microbiology 2021-02-23 /pmc/articles/PMC8573962/ /pubmed/33622856 http://dx.doi.org/10.1128/mSystems.01154-20 Text en Copyright © 2021 Kleine Bardenhorst et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article Kleine Bardenhorst, Sven Berger, Tom Klawonn, Frank Vital, Marius Karch, André Rübsamen, Nicole Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice |
title | Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice |
title_full | Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice |
title_fullStr | Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice |
title_full_unstemmed | Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice |
title_short | Data Analysis Strategies for Microbiome Studies in Human Populations—a Systematic Review of Current Practice |
title_sort | data analysis strategies for microbiome studies in human populations—a systematic review of current practice |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573962/ https://www.ncbi.nlm.nih.gov/pubmed/33622856 http://dx.doi.org/10.1128/mSystems.01154-20 |
work_keys_str_mv | AT kleinebardenhorstsven dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice AT bergertom dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice AT klawonnfrank dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice AT vitalmarius dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice AT karchandre dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice AT rubsamennicole dataanalysisstrategiesformicrobiomestudiesinhumanpopulationsasystematicreviewofcurrentpractice |