Cargando…

A general framework for powerful confounder adjustment in omics association studies

MOTIVATION: Genomic data are subject to various sources of confounding, such as demographic variables, biological heterogeneity, and batch effects. To identify genomic features associated with a variable of interest in the presence of confounders, the traditional approach involves fitting a confound...

Descripción completa

Detalles Bibliográficos
Autores principales: Roy, Asmita, Chen, Jun, Zhang, Xianyang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10539716/
https://www.ncbi.nlm.nih.gov/pubmed/37688561
http://dx.doi.org/10.1093/bioinformatics/btad563
_version_ 1785113562719453184
author Roy, Asmita
Chen, Jun
Zhang, Xianyang
author_facet Roy, Asmita
Chen, Jun
Zhang, Xianyang
author_sort Roy, Asmita
collection PubMed
description MOTIVATION: Genomic data are subject to various sources of confounding, such as demographic variables, biological heterogeneity, and batch effects. To identify genomic features associated with a variable of interest in the presence of confounders, the traditional approach involves fitting a confounder-adjusted regression model to each genomic feature, followed by multiplicity correction. RESULTS: This study shows that the traditional approach is suboptimal and proposes a new two-dimensional false discovery rate control framework (2DFDR+) that provides significant power improvement over the conventional method and applies to a wide range of settings. 2DFDR+ uses marginal independence test statistics as auxiliary information to filter out less promising features, and FDR control is performed based on conditional independence test statistics in the remaining features. 2DFDR+ provides (asymptotically) valid inference from samples in settings where the conditional distribution of the genomic variables given the covariate of interest and the confounders is arbitrary and completely unknown. Promising finite sample performance is demonstrated via extensive simulations and real data applications. AVAILABILITY AND IMPLEMENTATION: R codes and vignettes are available at https://github.com/asmita112358/tdfdr.np.
format Online
Article
Text
id pubmed-10539716
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105397162023-09-30 A general framework for powerful confounder adjustment in omics association studies Roy, Asmita Chen, Jun Zhang, Xianyang Bioinformatics Original Paper MOTIVATION: Genomic data are subject to various sources of confounding, such as demographic variables, biological heterogeneity, and batch effects. To identify genomic features associated with a variable of interest in the presence of confounders, the traditional approach involves fitting a confounder-adjusted regression model to each genomic feature, followed by multiplicity correction. RESULTS: This study shows that the traditional approach is suboptimal and proposes a new two-dimensional false discovery rate control framework (2DFDR+) that provides significant power improvement over the conventional method and applies to a wide range of settings. 2DFDR+ uses marginal independence test statistics as auxiliary information to filter out less promising features, and FDR control is performed based on conditional independence test statistics in the remaining features. 2DFDR+ provides (asymptotically) valid inference from samples in settings where the conditional distribution of the genomic variables given the covariate of interest and the confounders is arbitrary and completely unknown. Promising finite sample performance is demonstrated via extensive simulations and real data applications. AVAILABILITY AND IMPLEMENTATION: R codes and vignettes are available at https://github.com/asmita112358/tdfdr.np. Oxford University Press 2023-09-09 /pmc/articles/PMC10539716/ /pubmed/37688561 http://dx.doi.org/10.1093/bioinformatics/btad563 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Roy, Asmita
Chen, Jun
Zhang, Xianyang
A general framework for powerful confounder adjustment in omics association studies
title A general framework for powerful confounder adjustment in omics association studies
title_full A general framework for powerful confounder adjustment in omics association studies
title_fullStr A general framework for powerful confounder adjustment in omics association studies
title_full_unstemmed A general framework for powerful confounder adjustment in omics association studies
title_short A general framework for powerful confounder adjustment in omics association studies
title_sort general framework for powerful confounder adjustment in omics association studies
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10539716/
https://www.ncbi.nlm.nih.gov/pubmed/37688561
http://dx.doi.org/10.1093/bioinformatics/btad563
work_keys_str_mv AT royasmita ageneralframeworkforpowerfulconfounderadjustmentinomicsassociationstudies
AT chenjun ageneralframeworkforpowerfulconfounderadjustmentinomicsassociationstudies
AT zhangxianyang ageneralframeworkforpowerfulconfounderadjustmentinomicsassociationstudies
AT royasmita generalframeworkforpowerfulconfounderadjustmentinomicsassociationstudies
AT chenjun generalframeworkforpowerfulconfounderadjustmentinomicsassociationstudies
AT zhangxianyang generalframeworkforpowerfulconfounderadjustmentinomicsassociationstudies