Cargando…

A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses

BACKGROUND: High-throughput metagenomic sequencing technologies have shown prominent advantages over traditional pathogen detection methods, bringing great potential in clinical pathogen diagnosis and treatment of infectious diseases. Nevertheless, how to accurately detect the difference in microbio...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhengbang, Yu, Xiaochen, Guo, Hongping, Lee, TingFang, Hu, Jiyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9650337/
https://www.ncbi.nlm.nih.gov/pubmed/36389165
http://dx.doi.org/10.3389/fcimb.2022.988717
_version_ 1784827993156222976
author Li, Zhengbang
Yu, Xiaochen
Guo, Hongping
Lee, TingFang
Hu, Jiyuan
author_facet Li, Zhengbang
Yu, Xiaochen
Guo, Hongping
Lee, TingFang
Hu, Jiyuan
author_sort Li, Zhengbang
collection PubMed
description BACKGROUND: High-throughput metagenomic sequencing technologies have shown prominent advantages over traditional pathogen detection methods, bringing great potential in clinical pathogen diagnosis and treatment of infectious diseases. Nevertheless, how to accurately detect the difference in microbiome profiles between treatment or disease conditions remains computationally challenging. RESULTS: In this study, we propose a novel test for identifying the difference between two high-dimensional microbiome abundance data matrices based on the centered log-ratio transformation of the microbiome compositions. The test p-value can be calculated directly with a closed-form solution from the derived asymptotic null distribution. We also investigate the asymptotic statistical power against sparse alternatives that are typically encountered in microbiome studies. The proposed test is maximum-type equal-covariance-assumption-free (MECAF), making it widely applicable to studies that compare microbiome compositions between conditions. Our simulation studies demonstrated that the proposed MECAF test achieves more desirable power than competing methods while having the type I error rate well controlled under various scenarios. The usefulness of the proposed test is further illustrated with two real microbiome data analyses. The source code of the proposed method is freely available at https://github.com/Jiyuan-NYU-Langone/MECAF. CONCLUSIONS: MECAF is a flexible differential abundance test and achieves statistical efficiency in analyzing high-throughput microbiome data. The proposed new method will allow us to efficiently discover shifts in microbiome abundances between disease and treatment conditions, broadening our understanding of the disease and ultimately improving clinical diagnosis and treatment.
format Online
Article
Text
id pubmed-9650337
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96503372022-11-15 A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses Li, Zhengbang Yu, Xiaochen Guo, Hongping Lee, TingFang Hu, Jiyuan Front Cell Infect Microbiol Cellular and Infection Microbiology BACKGROUND: High-throughput metagenomic sequencing technologies have shown prominent advantages over traditional pathogen detection methods, bringing great potential in clinical pathogen diagnosis and treatment of infectious diseases. Nevertheless, how to accurately detect the difference in microbiome profiles between treatment or disease conditions remains computationally challenging. RESULTS: In this study, we propose a novel test for identifying the difference between two high-dimensional microbiome abundance data matrices based on the centered log-ratio transformation of the microbiome compositions. The test p-value can be calculated directly with a closed-form solution from the derived asymptotic null distribution. We also investigate the asymptotic statistical power against sparse alternatives that are typically encountered in microbiome studies. The proposed test is maximum-type equal-covariance-assumption-free (MECAF), making it widely applicable to studies that compare microbiome compositions between conditions. Our simulation studies demonstrated that the proposed MECAF test achieves more desirable power than competing methods while having the type I error rate well controlled under various scenarios. The usefulness of the proposed test is further illustrated with two real microbiome data analyses. The source code of the proposed method is freely available at https://github.com/Jiyuan-NYU-Langone/MECAF. CONCLUSIONS: MECAF is a flexible differential abundance test and achieves statistical efficiency in analyzing high-throughput microbiome data. The proposed new method will allow us to efficiently discover shifts in microbiome abundances between disease and treatment conditions, broadening our understanding of the disease and ultimately improving clinical diagnosis and treatment. Frontiers Media S.A. 2022-10-28 /pmc/articles/PMC9650337/ /pubmed/36389165 http://dx.doi.org/10.3389/fcimb.2022.988717 Text en Copyright © 2022 Li, Yu, Guo, Lee and Hu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cellular and Infection Microbiology
Li, Zhengbang
Yu, Xiaochen
Guo, Hongping
Lee, TingFang
Hu, Jiyuan
A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses
title A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses
title_full A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses
title_fullStr A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses
title_full_unstemmed A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses
title_short A maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses
title_sort maximum-type microbial differential abundance test with application to high-dimensional microbiome data analyses
topic Cellular and Infection Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9650337/
https://www.ncbi.nlm.nih.gov/pubmed/36389165
http://dx.doi.org/10.3389/fcimb.2022.988717
work_keys_str_mv AT lizhengbang amaximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT yuxiaochen amaximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT guohongping amaximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT leetingfang amaximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT hujiyuan amaximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT lizhengbang maximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT yuxiaochen maximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT guohongping maximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT leetingfang maximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses
AT hujiyuan maximumtypemicrobialdifferentialabundancetestwithapplicationtohighdimensionalmicrobiomedataanalyses