Cargando…

Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes

Differential abundance testing is a critical task in microbiome studies that is complicated by the sparsity of data matrices. Here we adapt for microbiome studies a solution from the field of gene expression analysis to produce a new method, discrete false-discovery rate (DS-FDR), that greatly impro...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Lingjing, Amir, Amnon, Morton, James T., Heller, Ruth, Arias-Castro, Ery, Knight, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698492/
https://www.ncbi.nlm.nih.gov/pubmed/29181446
http://dx.doi.org/10.1128/mSystems.00092-17
_version_ 1783280777082961920
author Jiang, Lingjing
Amir, Amnon
Morton, James T.
Heller, Ruth
Arias-Castro, Ery
Knight, Rob
author_facet Jiang, Lingjing
Amir, Amnon
Morton, James T.
Heller, Ruth
Arias-Castro, Ery
Knight, Rob
author_sort Jiang, Lingjing
collection PubMed
description Differential abundance testing is a critical task in microbiome studies that is complicated by the sparsity of data matrices. Here we adapt for microbiome studies a solution from the field of gene expression analysis to produce a new method, discrete false-discovery rate (DS-FDR), that greatly improves the power to detect differential taxa by exploiting the discreteness of the data. Additionally, DS-FDR is relatively robust to the number of noninformative features, and thus removes the problem of filtering taxonomy tables by an arbitrary abundance threshold. We show by using a combination of simulations and reanalysis of nine real-world microbiome data sets that this new method outperforms existing methods at the differential abundance testing task, producing a false-discovery rate that is up to threefold more accurate, and halves the number of samples required to find a given difference (thus increasing the efficiency of microbiome experiments considerably). We therefore expect DS-FDR to be widely applied in microbiome studies. IMPORTANCE DS-FDR can achieve higher statistical power to detect significant findings in sparse and noisy microbiome data compared to the commonly used Benjamini-Hochberg procedure and other FDR-controlling procedures.
format Online
Article
Text
id pubmed-5698492
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-56984922017-11-27 Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes Jiang, Lingjing Amir, Amnon Morton, James T. Heller, Ruth Arias-Castro, Ery Knight, Rob mSystems Research Article Differential abundance testing is a critical task in microbiome studies that is complicated by the sparsity of data matrices. Here we adapt for microbiome studies a solution from the field of gene expression analysis to produce a new method, discrete false-discovery rate (DS-FDR), that greatly improves the power to detect differential taxa by exploiting the discreteness of the data. Additionally, DS-FDR is relatively robust to the number of noninformative features, and thus removes the problem of filtering taxonomy tables by an arbitrary abundance threshold. We show by using a combination of simulations and reanalysis of nine real-world microbiome data sets that this new method outperforms existing methods at the differential abundance testing task, producing a false-discovery rate that is up to threefold more accurate, and halves the number of samples required to find a given difference (thus increasing the efficiency of microbiome experiments considerably). We therefore expect DS-FDR to be widely applied in microbiome studies. IMPORTANCE DS-FDR can achieve higher statistical power to detect significant findings in sparse and noisy microbiome data compared to the commonly used Benjamini-Hochberg procedure and other FDR-controlling procedures. American Society for Microbiology 2017-11-21 /pmc/articles/PMC5698492/ /pubmed/29181446 http://dx.doi.org/10.1128/mSystems.00092-17 Text en Copyright © 2017 Jiang et al. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Jiang, Lingjing
Amir, Amnon
Morton, James T.
Heller, Ruth
Arias-Castro, Ery
Knight, Rob
Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes
title Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes
title_full Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes
title_fullStr Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes
title_full_unstemmed Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes
title_short Discrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes
title_sort discrete false-discovery rate improves identification of differentially abundant microbes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698492/
https://www.ncbi.nlm.nih.gov/pubmed/29181446
http://dx.doi.org/10.1128/mSystems.00092-17
work_keys_str_mv AT jianglingjing discretefalsediscoveryrateimprovesidentificationofdifferentiallyabundantmicrobes
AT amiramnon discretefalsediscoveryrateimprovesidentificationofdifferentiallyabundantmicrobes
AT mortonjamest discretefalsediscoveryrateimprovesidentificationofdifferentiallyabundantmicrobes
AT hellerruth discretefalsediscoveryrateimprovesidentificationofdifferentiallyabundantmicrobes
AT ariascastroery discretefalsediscoveryrateimprovesidentificationofdifferentiallyabundantmicrobes
AT knightrob discretefalsediscoveryrateimprovesidentificationofdifferentiallyabundantmicrobes