Cargando…

A unified framework for unconstrained and constrained ordination of microbiome read count data

Explorative visualization techniques provide a first summary of microbiome read count datasets through dimension reduction. A plethora of dimension reduction methods exists, but many of them focus primarily on sample ordination, failing to elucidate the role of the bacterial species. Moreover, impli...

Descripción completa

Detalles Bibliográficos
Autores principales: Hawinkel, Stijn, Kerckhof, Frederiek-Maarten, Bijnens, Luc, Thas, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6373939/
https://www.ncbi.nlm.nih.gov/pubmed/30759084
http://dx.doi.org/10.1371/journal.pone.0205474
_version_ 1783395070998740992
author Hawinkel, Stijn
Kerckhof, Frederiek-Maarten
Bijnens, Luc
Thas, Olivier
author_facet Hawinkel, Stijn
Kerckhof, Frederiek-Maarten
Bijnens, Luc
Thas, Olivier
author_sort Hawinkel, Stijn
collection PubMed
description Explorative visualization techniques provide a first summary of microbiome read count datasets through dimension reduction. A plethora of dimension reduction methods exists, but many of them focus primarily on sample ordination, failing to elucidate the role of the bacterial species. Moreover, implicit but often unrealistic assumptions underlying these methods fail to account for overdispersion and differences in sequencing depth, which are two typical characteristics of sequencing data. We combine log-linear models with a dispersion estimation algorithm and flexible response function modelling into a framework for unconstrained and constrained ordination. The method is able to cope with differences in dispersion between taxa and varying sequencing depths, to yield meaningful biological patterns. Moreover, it can correct for observed technical confounders, whereas other methods are adversely affected by these artefacts. Unlike distance-based ordination methods, the assumptions underlying our method are stated explicitly and can be verified using simple diagnostics. The combination of unconstrained and constrained ordination in the same framework is unique in the field and facilitates microbiome data exploration. We illustrate the advantages of our method on simulated and real datasets, while pointing out flaws in existing methods. The algorithms for fitting and plotting are available in the R-package RCM.
format Online
Article
Text
id pubmed-6373939
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63739392019-03-01 A unified framework for unconstrained and constrained ordination of microbiome read count data Hawinkel, Stijn Kerckhof, Frederiek-Maarten Bijnens, Luc Thas, Olivier PLoS One Research Article Explorative visualization techniques provide a first summary of microbiome read count datasets through dimension reduction. A plethora of dimension reduction methods exists, but many of them focus primarily on sample ordination, failing to elucidate the role of the bacterial species. Moreover, implicit but often unrealistic assumptions underlying these methods fail to account for overdispersion and differences in sequencing depth, which are two typical characteristics of sequencing data. We combine log-linear models with a dispersion estimation algorithm and flexible response function modelling into a framework for unconstrained and constrained ordination. The method is able to cope with differences in dispersion between taxa and varying sequencing depths, to yield meaningful biological patterns. Moreover, it can correct for observed technical confounders, whereas other methods are adversely affected by these artefacts. Unlike distance-based ordination methods, the assumptions underlying our method are stated explicitly and can be verified using simple diagnostics. The combination of unconstrained and constrained ordination in the same framework is unique in the field and facilitates microbiome data exploration. We illustrate the advantages of our method on simulated and real datasets, while pointing out flaws in existing methods. The algorithms for fitting and plotting are available in the R-package RCM. Public Library of Science 2019-02-13 /pmc/articles/PMC6373939/ /pubmed/30759084 http://dx.doi.org/10.1371/journal.pone.0205474 Text en © 2019 Hawinkel et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hawinkel, Stijn
Kerckhof, Frederiek-Maarten
Bijnens, Luc
Thas, Olivier
A unified framework for unconstrained and constrained ordination of microbiome read count data
title A unified framework for unconstrained and constrained ordination of microbiome read count data
title_full A unified framework for unconstrained and constrained ordination of microbiome read count data
title_fullStr A unified framework for unconstrained and constrained ordination of microbiome read count data
title_full_unstemmed A unified framework for unconstrained and constrained ordination of microbiome read count data
title_short A unified framework for unconstrained and constrained ordination of microbiome read count data
title_sort unified framework for unconstrained and constrained ordination of microbiome read count data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6373939/
https://www.ncbi.nlm.nih.gov/pubmed/30759084
http://dx.doi.org/10.1371/journal.pone.0205474
work_keys_str_mv AT hawinkelstijn aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata
AT kerckhoffrederiekmaarten aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata
AT bijnensluc aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata
AT thasolivier aunifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata
AT hawinkelstijn unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata
AT kerckhoffrederiekmaarten unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata
AT bijnensluc unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata
AT thasolivier unifiedframeworkforunconstrainedandconstrainedordinationofmicrobiomereadcountdata