Cargando…

Batch effects removal for microbiome data via conditional quantile regression

Batch effects in microbiome data arise from differential processing of specimens and can lead to spurious findings and obscure true signals. Strategies designed for genomic data to mitigate batch effects usually fail to address the zero-inflated and over-dispersed microbiome data. Most strategies ta...

Descripción completa

Detalles Bibliográficos
Autores principales: Ling, Wodan, Lu, Jiuyao, Zhao, Ni, Lulla, Anju, Plantinga, Anna M., Fu, Weijia, Zhang, Angela, Liu, Hongjiao, Song, Hoseung, Li, Zhigang, Chen, Jun, Randolph, Timothy W., Koay, Wei Li A., White, James R., Launer, Lenore J., Fodor, Anthony A., Meyer, Katie A., Wu, Michael C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9477887/
https://www.ncbi.nlm.nih.gov/pubmed/36109499
http://dx.doi.org/10.1038/s41467-022-33071-9
Descripción
Sumario:Batch effects in microbiome data arise from differential processing of specimens and can lead to spurious findings and obscure true signals. Strategies designed for genomic data to mitigate batch effects usually fail to address the zero-inflated and over-dispersed microbiome data. Most strategies tailored for microbiome data are restricted to association testing or specialized study designs, failing to allow other analytic goals or general designs. Here, we develop the Conditional Quantile Regression (ConQuR) approach to remove microbiome batch effects using a two-part quantile regression model. ConQuR is a comprehensive method that accommodates the complex distributions of microbial read counts by non-parametric modeling, and it generates batch-removed zero-inflated read counts that can be used in and benefit usual subsequent analyses. We apply ConQuR to simulated and real microbiome datasets and demonstrate its advantages in removing batch effects while preserving the signals of interest.