Cargando…
Bayesian variable selection using Knockoffs with applications to genomics
Given the costliness of HIV drug therapy research, it is important not only to maximize true positive rate (TPR) by identifying which genetic markers are related to drug resistance, but also to minimize false discovery rate (FDR) by reducing the number of incorrect markers unrelated to drug resistan...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483350/ https://www.ncbi.nlm.nih.gov/pubmed/36157067 http://dx.doi.org/10.1007/s00180-022-01283-8 |
_version_ | 1784791652276109312 |
---|---|
author | Yap, Jurel K. Gauran, Iris Ivy M. |
author_facet | Yap, Jurel K. Gauran, Iris Ivy M. |
author_sort | Yap, Jurel K. |
collection | PubMed |
description | Given the costliness of HIV drug therapy research, it is important not only to maximize true positive rate (TPR) by identifying which genetic markers are related to drug resistance, but also to minimize false discovery rate (FDR) by reducing the number of incorrect markers unrelated to drug resistance. In this study, we propose a multiple testing procedure that unifies key concepts in computational statistics, namely Model-free Knockoffs, Bayesian variable selection, and the local false discovery rate. We develop an algorithm that utilizes the augmented data-Knockoff matrix and implement Bayesian Lasso. We then identify signals using test statistics based on Markov Chain Monte Carlo outputs and local false discovery rate. We test our proposed methods against non-bayesian methods such as Benjamini–Hochberg (BHq) and Lasso regression in terms TPR and FDR. Using numerical studies, we show the proposed method yields lower FDR compared to BHq and Lasso for certain cases, such as for low and equi-dimensional cases. We also discuss an application to an HIV-1 data set, which aims to be applied analyzing genetic markers linked to drug resistant HIV in the Philippines in future work. |
format | Online Article Text |
id | pubmed-9483350 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-94833502022-09-19 Bayesian variable selection using Knockoffs with applications to genomics Yap, Jurel K. Gauran, Iris Ivy M. Comput Stat Original Paper Given the costliness of HIV drug therapy research, it is important not only to maximize true positive rate (TPR) by identifying which genetic markers are related to drug resistance, but also to minimize false discovery rate (FDR) by reducing the number of incorrect markers unrelated to drug resistance. In this study, we propose a multiple testing procedure that unifies key concepts in computational statistics, namely Model-free Knockoffs, Bayesian variable selection, and the local false discovery rate. We develop an algorithm that utilizes the augmented data-Knockoff matrix and implement Bayesian Lasso. We then identify signals using test statistics based on Markov Chain Monte Carlo outputs and local false discovery rate. We test our proposed methods against non-bayesian methods such as Benjamini–Hochberg (BHq) and Lasso regression in terms TPR and FDR. Using numerical studies, we show the proposed method yields lower FDR compared to BHq and Lasso for certain cases, such as for low and equi-dimensional cases. We also discuss an application to an HIV-1 data set, which aims to be applied analyzing genetic markers linked to drug resistant HIV in the Philippines in future work. Springer Berlin Heidelberg 2022-09-18 /pmc/articles/PMC9483350/ /pubmed/36157067 http://dx.doi.org/10.1007/s00180-022-01283-8 Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Paper Yap, Jurel K. Gauran, Iris Ivy M. Bayesian variable selection using Knockoffs with applications to genomics |
title | Bayesian variable selection using Knockoffs with applications to genomics |
title_full | Bayesian variable selection using Knockoffs with applications to genomics |
title_fullStr | Bayesian variable selection using Knockoffs with applications to genomics |
title_full_unstemmed | Bayesian variable selection using Knockoffs with applications to genomics |
title_short | Bayesian variable selection using Knockoffs with applications to genomics |
title_sort | bayesian variable selection using knockoffs with applications to genomics |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483350/ https://www.ncbi.nlm.nih.gov/pubmed/36157067 http://dx.doi.org/10.1007/s00180-022-01283-8 |
work_keys_str_mv | AT yapjurelk bayesianvariableselectionusingknockoffswithapplicationstogenomics AT gauranirisivym bayesianvariableselectionusingknockoffswithapplicationstogenomics |