Cargando…

Hill-based dissimilarity indices and null models for analysis of microbial community assembly

BACKGROUND: High-throughput amplicon sequencing of marker genes, such as the 16S rRNA gene in Bacteria and Archaea, provides a wealth of information about the composition of microbial communities. To quantify differences between samples and draw conclusions about factors affecting community assembly...

Descripción completa

Detalles Bibliográficos
Autores principales: Modin, Oskar, Liébana, Raquel, Saheb-Alam, Soroush, Wilén, Britt-Marie, Suarez, Carolina, Hermansson, Malte, Persson, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7488682/
https://www.ncbi.nlm.nih.gov/pubmed/32917275
http://dx.doi.org/10.1186/s40168-020-00909-7
_version_ 1783581743906816000
author Modin, Oskar
Liébana, Raquel
Saheb-Alam, Soroush
Wilén, Britt-Marie
Suarez, Carolina
Hermansson, Malte
Persson, Frank
author_facet Modin, Oskar
Liébana, Raquel
Saheb-Alam, Soroush
Wilén, Britt-Marie
Suarez, Carolina
Hermansson, Malte
Persson, Frank
author_sort Modin, Oskar
collection PubMed
description BACKGROUND: High-throughput amplicon sequencing of marker genes, such as the 16S rRNA gene in Bacteria and Archaea, provides a wealth of information about the composition of microbial communities. To quantify differences between samples and draw conclusions about factors affecting community assembly, dissimilarity indices are typically used. However, results are subject to several biases, and data interpretation can be challenging. The Jaccard and Bray-Curtis indices, which are often used to quantify taxonomic dissimilarity, are not necessarily the most logical choices. Instead, we argue that Hill-based indices, which make it possible to systematically investigate the impact of relative abundance on dissimilarity, should be used for robust analysis of data. In combination with a null model, mechanisms of microbial community assembly can be analyzed. Here, we also introduce a new software, qdiv, which enables rapid calculations of Hill-based dissimilarity indices in combination with null models. RESULTS: Using amplicon sequencing data from two experimental systems, aerobic granular sludge (AGS) reactors and microbial fuel cells (MFC), we show that the choice of dissimilarity index can have considerable impact on results and conclusions. High dissimilarity between replicates because of random sampling effects make incidence-based indices less suited for identifying differences between groups of samples. Determining a consensus table based on count tables generated with different bioinformatic pipelines reduced the number of low-abundant, potentially spurious amplicon sequence variants (ASVs) in the data sets, which led to lower dissimilarity between replicates. Analysis with a combination of Hill-based indices and a null model allowed us to show that different ecological mechanisms acted on different fractions of the microbial communities in the experimental systems. CONCLUSIONS: Hill-based indices provide a rational framework for analysis of dissimilarity between microbial community samples. In combination with a null model, the effects of deterministic and stochastic community assembly factors on taxa of different relative abundances can be systematically investigated. Calculations of Hill-based dissimilarity indices in combination with a null model can be done in qdiv, which is freely available as a Python package (https://github.com/omvatten/qdiv). In qdiv, a consensus table can also be determined from several count tables generated with different bioinformatic pipelines.
format Online
Article
Text
id pubmed-7488682
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74886822020-09-16 Hill-based dissimilarity indices and null models for analysis of microbial community assembly Modin, Oskar Liébana, Raquel Saheb-Alam, Soroush Wilén, Britt-Marie Suarez, Carolina Hermansson, Malte Persson, Frank Microbiome Methodology BACKGROUND: High-throughput amplicon sequencing of marker genes, such as the 16S rRNA gene in Bacteria and Archaea, provides a wealth of information about the composition of microbial communities. To quantify differences between samples and draw conclusions about factors affecting community assembly, dissimilarity indices are typically used. However, results are subject to several biases, and data interpretation can be challenging. The Jaccard and Bray-Curtis indices, which are often used to quantify taxonomic dissimilarity, are not necessarily the most logical choices. Instead, we argue that Hill-based indices, which make it possible to systematically investigate the impact of relative abundance on dissimilarity, should be used for robust analysis of data. In combination with a null model, mechanisms of microbial community assembly can be analyzed. Here, we also introduce a new software, qdiv, which enables rapid calculations of Hill-based dissimilarity indices in combination with null models. RESULTS: Using amplicon sequencing data from two experimental systems, aerobic granular sludge (AGS) reactors and microbial fuel cells (MFC), we show that the choice of dissimilarity index can have considerable impact on results and conclusions. High dissimilarity between replicates because of random sampling effects make incidence-based indices less suited for identifying differences between groups of samples. Determining a consensus table based on count tables generated with different bioinformatic pipelines reduced the number of low-abundant, potentially spurious amplicon sequence variants (ASVs) in the data sets, which led to lower dissimilarity between replicates. Analysis with a combination of Hill-based indices and a null model allowed us to show that different ecological mechanisms acted on different fractions of the microbial communities in the experimental systems. CONCLUSIONS: Hill-based indices provide a rational framework for analysis of dissimilarity between microbial community samples. In combination with a null model, the effects of deterministic and stochastic community assembly factors on taxa of different relative abundances can be systematically investigated. Calculations of Hill-based dissimilarity indices in combination with a null model can be done in qdiv, which is freely available as a Python package (https://github.com/omvatten/qdiv). In qdiv, a consensus table can also be determined from several count tables generated with different bioinformatic pipelines. BioMed Central 2020-09-11 /pmc/articles/PMC7488682/ /pubmed/32917275 http://dx.doi.org/10.1186/s40168-020-00909-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Modin, Oskar
Liébana, Raquel
Saheb-Alam, Soroush
Wilén, Britt-Marie
Suarez, Carolina
Hermansson, Malte
Persson, Frank
Hill-based dissimilarity indices and null models for analysis of microbial community assembly
title Hill-based dissimilarity indices and null models for analysis of microbial community assembly
title_full Hill-based dissimilarity indices and null models for analysis of microbial community assembly
title_fullStr Hill-based dissimilarity indices and null models for analysis of microbial community assembly
title_full_unstemmed Hill-based dissimilarity indices and null models for analysis of microbial community assembly
title_short Hill-based dissimilarity indices and null models for analysis of microbial community assembly
title_sort hill-based dissimilarity indices and null models for analysis of microbial community assembly
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7488682/
https://www.ncbi.nlm.nih.gov/pubmed/32917275
http://dx.doi.org/10.1186/s40168-020-00909-7
work_keys_str_mv AT modinoskar hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly
AT liebanaraquel hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly
AT sahebalamsoroush hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly
AT wilenbrittmarie hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly
AT suarezcarolina hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly
AT hermanssonmalte hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly
AT perssonfrank hillbaseddissimilarityindicesandnullmodelsforanalysisofmicrobialcommunityassembly