Cargando…
Hill-based dissimilarity indices and null models for analysis of microbial community assembly
BACKGROUND: High-throughput amplicon sequencing of marker genes, such as the 16S rRNA gene in Bacteria and Archaea, provides a wealth of information about the composition of microbial communities. To quantify differences between samples and draw conclusions about factors affecting community assembly...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7488682/ https://www.ncbi.nlm.nih.gov/pubmed/32917275 http://dx.doi.org/10.1186/s40168-020-00909-7 |
_version_ | 1783581743906816000 |
---|---|
author | Modin, Oskar Liébana, Raquel Saheb-Alam, Soroush Wilén, Britt-Marie Suarez, Carolina Hermansson, Malte Persson, Frank |
author_facet | Modin, Oskar Liébana, Raquel Saheb-Alam, Soroush Wilén, Britt-Marie Suarez, Carolina Hermansson, Malte Persson, Frank |
author_sort | Modin, Oskar |
collection | PubMed |
description | BACKGROUND: High-throughput amplicon sequencing of marker genes, such as the 16S rRNA gene in Bacteria and Archaea, provides a wealth of information about the composition of microbial communities. To quantify differences between samples and draw conclusions about factors affecting community assembly, dissimilarity indices are typically used. However, results are subject to several biases, and data interpretation can be challenging. The Jaccard and Bray-Curtis indices, which are often used to quantify taxonomic dissimilarity, are not necessarily the most logical choices. Instead, we argue that Hill-based indices, which make it possible to systematically investigate the impact of relative abundance on dissimilarity, should be used for robust analysis of data. In combination with a null model, mechanisms of microbial community assembly can be analyzed. Here, we also introduce a new software, qdiv, which enables rapid calculations of Hill-based dissimilarity indices in combination with null models. RESULTS: Using amplicon sequencing data from two experimental systems, aerobic granular sludge (AGS) reactors and microbial fuel cells (MFC), we show that the choice of dissimilarity index can have considerable impact on results and conclusions. High dissimilarity between replicates because of random sampling effects make incidence-based indices less suited for identifying differences between groups of samples. Determining a consensus table based on count tables generated with different bioinformatic pipelines reduced the number of low-abundant, potentially spurious amplicon sequence variants (ASVs) in the data sets, which led to lower dissimilarity between replicates. Analysis with a combination of Hill-based indices and a null model allowed us to show that different ecological mechanisms acted on different fractions of the microbial communities in the experimental systems. CONCLUSIONS: Hill-based indices provide a rational framework for analysis of dissimilarity between microbial community samples. In combination with a null model, the effects of deterministic and stochastic community assembly factors on taxa of different relative abundances can be systematically investigated. Calculations of Hill-based dissimilarity indices in combination with a null model can be done in qdiv, which is freely available as a Python package (https://github.com/omvatten/qdiv). In qdiv, a consensus table can also be determined from several count tables generated with different bioinformatic pipelines. |
format | Online Article Text |
id | pubmed-7488682 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-74886822020-09-16 Hill-based dissimilarity indices and null models for analysis of microbial community assembly Modin, Oskar Liébana, Raquel Saheb-Alam, Soroush Wilén, Britt-Marie Suarez, Carolina Hermansson, Malte Persson, Frank Microbiome Methodology BACKGROUND: High-throughput amplicon sequencing of marker genes, such as the 16S rRNA gene in Bacteria and Archaea, provides a wealth of information about the composition of microbial communities. To quantify differences between samples and draw conclusions about factors affecting community assembly, dissimilarity indices are typically used. However, results are subject to several biases, and data interpretation can be challenging. The Jaccard and Bray-Curtis indices, which are often used to quantify taxonomic dissimilarity, are not necessarily the most logical choices. Instead, we argue that Hill-based indices, which make it possible to systematically investigate the impact of relative abundance on dissimilarity, should be used for robust analysis of data. In combination with a null model, mechanisms of microbial community assembly can be analyzed. Here, we also introduce a new software, qdiv, which enables rapid calculations of Hill-based dissimilarity indices in combination with null models. RESULTS: Using amplicon sequencing data from two experimental systems, aerobic granular sludge (AGS) reactors and microbial fuel cells (MFC), we show that the choice of dissimilarity index can have considerable impact on results and conclusions. High dissimilarity between replicates because of random sampling effects make incidence-based indices less suited for identifying differences between groups of samples. Determining a consensus table based on count tables generated with different bioinformatic pipelines reduced the number of low-abundant, potentially spurious amplicon sequence variants (ASVs) in the data sets, which led to lower dissimilarity between replicates. Analysis with a combination of Hill-based indices and a null model allowed us to show that different ecological mechanisms acted on different fractions of the microbial communities in the experimental systems. CONCLUSIONS: Hill-based indices provide a rational framework for analysis of dissimilarity between microbial community samples. In combination with a null model, the effects of deterministic and stochastic community assembly factors on taxa of different relative abundances can be systematically investigated. Calculations of Hill-based dissimilarity indices in combination with a null model can be done in qdiv, which is freely available as a Python package (https://github.com/omvatten/qdiv). In qdiv, a consensus table can also be determined from several count tables generated with different bioinformatic pipelines. BioMed Central 2020-09-11 /pmc/articles/PMC7488682/ /pubmed/32917275 http://dx.doi.org/10.1186/s40168-020-00909-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Modin, Oskar Liébana, Raquel Saheb-Alam, Soroush Wilén, Britt-Marie Suarez, Carolina Hermansson, Malte Persson, Frank Hill-based dissimilarity indices and null models for analysis of microbial community assembly |
title | Hill-based dissimilarity indices and null models for analysis of microbial community assembly |
title_full | Hill-based dissimilarity indices and null models for analysis of microbial community assembly |
title_fullStr | Hill-based dissimilarity indices and null models for analysis of microbial community assembly |
title_full_unstemmed | Hill-based dissimilarity indices and null models for analysis of microbial community assembly |
title_short | Hill-based dissimilarity indices and null models for analysis of microbial community assembly |
title_sort | hill-based dissimilarity indices and null models for analysis of microbial community assembly |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7488682/ https://www.ncbi.nlm.nih.gov/pubmed/32917275 http://dx.doi.org/10.1186/s40168-020-00909-7 |
work_keys_str_mv | AT modinoskar hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly AT liebanaraquel hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly AT sahebalamsoroush hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly AT wilenbrittmarie hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly AT suarezcarolina hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly AT hermanssonmalte hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly AT perssonfrank hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly |