Cargando…
A unified framework for unconstrained and constrained ordination of microbiome read count data
Explorative visualization techniques provide a first summary of microbiome read count datasets through dimension reduction. A plethora of dimension reduction methods exists, but many of them focus primarily on sample ordination, failing to elucidate the role of the bacterial species. Moreover, impli...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6373939/ https://www.ncbi.nlm.nih.gov/pubmed/30759084 http://dx.doi.org/10.1371/journal.pone.0205474 |
_version_ | 1783395070998740992 |
---|---|
author | Hawinkel, Stijn Kerckhof, Frederiek-Maarten Bijnens, Luc Thas, Olivier |
author_facet | Hawinkel, Stijn Kerckhof, Frederiek-Maarten Bijnens, Luc Thas, Olivier |
author_sort | Hawinkel, Stijn |
collection | PubMed |
description | Explorative visualization techniques provide a first summary of microbiome read count datasets through dimension reduction. A plethora of dimension reduction methods exists, but many of them focus primarily on sample ordination, failing to elucidate the role of the bacterial species. Moreover, implicit but often unrealistic assumptions underlying these methods fail to account for overdispersion and differences in sequencing depth, which are two typical characteristics of sequencing data. We combine log-linear models with a dispersion estimation algorithm and flexible response function modelling into a framework for unconstrained and constrained ordination. The method is able to cope with differences in dispersion between taxa and varying sequencing depths, to yield meaningful biological patterns. Moreover, it can correct for observed technical confounders, whereas other methods are adversely affected by these artefacts. Unlike distance-based ordination methods, the assumptions underlying our method are stated explicitly and can be verified using simple diagnostics. The combination of unconstrained and constrained ordination in the same framework is unique in the field and facilitates microbiome data exploration. We illustrate the advantages of our method on simulated and real datasets, while pointing out flaws in existing methods. The algorithms for fitting and plotting are available in the R-package RCM. |
format | Online Article Text |
id | pubmed-6373939 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-63739392019-03-01 A unified framework for unconstrained and constrained ordination of microbiome read count data Hawinkel, Stijn Kerckhof, Frederiek-Maarten Bijnens, Luc Thas, Olivier PLoS One Research Article Explorative visualization techniques provide a first summary of microbiome read count datasets through dimension reduction. A plethora of dimension reduction methods exists, but many of them focus primarily on sample ordination, failing to elucidate the role of the bacterial species. Moreover, implicit but often unrealistic assumptions underlying these methods fail to account for overdispersion and differences in sequencing depth, which are two typical characteristics of sequencing data. We combine log-linear models with a dispersion estimation algorithm and flexible response function modelling into a framework for unconstrained and constrained ordination. The method is able to cope with differences in dispersion between taxa and varying sequencing depths, to yield meaningful biological patterns. Moreover, it can correct for observed technical confounders, whereas other methods are adversely affected by these artefacts. Unlike distance-based ordination methods, the assumptions underlying our method are stated explicitly and can be verified using simple diagnostics. The combination of unconstrained and constrained ordination in the same framework is unique in the field and facilitates microbiome data exploration. We illustrate the advantages of our method on simulated and real datasets, while pointing out flaws in existing methods. The algorithms for fitting and plotting are available in the R-package RCM. Public Library of Science 2019-02-13 /pmc/articles/PMC6373939/ /pubmed/30759084 http://dx.doi.org/10.1371/journal.pone.0205474 Text en © 2019 Hawinkel et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Hawinkel, Stijn Kerckhof, Frederiek-Maarten Bijnens, Luc Thas, Olivier A unified framework for unconstrained and constrained ordination of microbiome read count data |
title | A unified framework for unconstrained and constrained ordination of microbiome read count data |
title_full | A unified framework for unconstrained and constrained ordination of microbiome read count data |
title_fullStr | A unified framework for unconstrained and constrained ordination of microbiome read count data |
title_full_unstemmed | A unified framework for unconstrained and constrained ordination of microbiome read count data |
title_short | A unified framework for unconstrained and constrained ordination of microbiome read count data |
title_sort | unified framework for unconstrained and constrained ordination of microbiome read count data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6373939/ https://www.ncbi.nlm.nih.gov/pubmed/30759084 http://dx.doi.org/10.1371/journal.pone.0205474 |
work_keys_str_mv | AT hawinkelstijn aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata AT kerckhoffrederiekmaarten aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata AT bijnensluc aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata AT thasolivier aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata AT hawinkelstijn unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata AT kerckhoffrederiekmaarten unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata AT bijnensluc unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata AT thasolivier unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata |