Cargando…

Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control

We consider the problem of incorporating evolutionary information (e.g., taxonomic or phylogenic trees) in the context of metagenomics differential analysis. Recent results published in the literature propose different ways to leverage the tree structure to increase the detection rate of differentia...

Descripción completa

Detalles Bibliográficos
Autores principales: Bichat, Antoine, Plassais, Jonathan, Ambroise, Christophe, Mariadassou, Mahendra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7174607/
https://www.ncbi.nlm.nih.gov/pubmed/32351481
http://dx.doi.org/10.3389/fmicb.2020.00649
_version_ 1783524658369265664
author Bichat, Antoine
Plassais, Jonathan
Ambroise, Christophe
Mariadassou, Mahendra
author_facet Bichat, Antoine
Plassais, Jonathan
Ambroise, Christophe
Mariadassou, Mahendra
author_sort Bichat, Antoine
collection PubMed
description We consider the problem of incorporating evolutionary information (e.g., taxonomic or phylogenic trees) in the context of metagenomics differential analysis. Recent results published in the literature propose different ways to leverage the tree structure to increase the detection rate of differentially abundant taxa. Here, we propose instead to use a different hierarchical structure, in the form of a correlation-based tree, as it may capture the structure of the data better than the phylogeny. We first show that the correlation tree and the phylogeny are significantly different before turning to the impact of tree choice on detection rates. Using synthetic data, we show that the tree does have an impact: smoothing p-values according to the phylogeny leads to equal or inferior rates as smoothing according to the correlation tree. However, both trees are outperformed by the classical, non-hierarchical, Benjamini–Hochberg (BH) procedure in terms of detection rates. Other procedures may use the hierarchical structure with profit but do not control the False Discovery Rate (FDR) a priori and remain inferior to a classical Benjamini–Hochberg procedure with the same nominal FDR. On real datasets, no hierarchical procedure had significantly higher detection rate that BH. Intuition advocates that the use of hierarchical structures should increase the detection rate of differentially abundant taxa in microbiome studies. However, our results suggest that current hierarchical procedures are still inferior to standard methods and more effective procedures remain to be invented.
format Online
Article
Text
id pubmed-7174607
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-71746072020-04-29 Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control Bichat, Antoine Plassais, Jonathan Ambroise, Christophe Mariadassou, Mahendra Front Microbiol Microbiology We consider the problem of incorporating evolutionary information (e.g., taxonomic or phylogenic trees) in the context of metagenomics differential analysis. Recent results published in the literature propose different ways to leverage the tree structure to increase the detection rate of differentially abundant taxa. Here, we propose instead to use a different hierarchical structure, in the form of a correlation-based tree, as it may capture the structure of the data better than the phylogeny. We first show that the correlation tree and the phylogeny are significantly different before turning to the impact of tree choice on detection rates. Using synthetic data, we show that the tree does have an impact: smoothing p-values according to the phylogeny leads to equal or inferior rates as smoothing according to the correlation tree. However, both trees are outperformed by the classical, non-hierarchical, Benjamini–Hochberg (BH) procedure in terms of detection rates. Other procedures may use the hierarchical structure with profit but do not control the False Discovery Rate (FDR) a priori and remain inferior to a classical Benjamini–Hochberg procedure with the same nominal FDR. On real datasets, no hierarchical procedure had significantly higher detection rate that BH. Intuition advocates that the use of hierarchical structures should increase the detection rate of differentially abundant taxa in microbiome studies. However, our results suggest that current hierarchical procedures are still inferior to standard methods and more effective procedures remain to be invented. Frontiers Media S.A. 2020-04-15 /pmc/articles/PMC7174607/ /pubmed/32351481 http://dx.doi.org/10.3389/fmicb.2020.00649 Text en Copyright © 2020 Bichat, Plassais, Ambroise and Mariadassou. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Bichat, Antoine
Plassais, Jonathan
Ambroise, Christophe
Mariadassou, Mahendra
Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control
title Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control
title_full Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control
title_fullStr Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control
title_full_unstemmed Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control
title_short Incorporating Phylogenetic Information in Microbiome Differential Abundance Studies Has No Effect on Detection Power and FDR Control
title_sort incorporating phylogenetic information in microbiome differential abundance studies has no effect on detection power and fdr control
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7174607/
https://www.ncbi.nlm.nih.gov/pubmed/32351481
http://dx.doi.org/10.3389/fmicb.2020.00649
work_keys_str_mv AT bichatantoine incorporatingphylogeneticinformationinmicrobiomedifferentialabundancestudieshasnoeffectondetectionpowerandfdrcontrol
AT plassaisjonathan incorporatingphylogeneticinformationinmicrobiomedifferentialabundancestudieshasnoeffectondetectionpowerandfdrcontrol
AT ambroisechristophe incorporatingphylogeneticinformationinmicrobiomedifferentialabundancestudieshasnoeffectondetectionpowerandfdrcontrol
AT mariadassoumahendra incorporatingphylogeneticinformationinmicrobiomedifferentialabundancestudieshasnoeffectondetectionpowerandfdrcontrol