Cargando…
A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
Genome-wide association study (GWAS) has turned out to be an essential technology for exploring the genetic mechanism of complex traits. To reduce the complexity of computation, it is well accepted to remove unrelated single nucleotide polymorphisms (SNPs) before GWAS, e.g., by using iterative sure...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516787/ https://www.ncbi.nlm.nih.gov/pubmed/33286103 http://dx.doi.org/10.3390/e22030329 |
_version_ | 1783587081875881984 |
---|---|
author | Guo, Hongping Yu, Zuguo An, Jiyuan Han, Guosheng Ma, Yuanlin Tang, Runbin |
author_facet | Guo, Hongping Yu, Zuguo An, Jiyuan Han, Guosheng Ma, Yuanlin Tang, Runbin |
author_sort | Guo, Hongping |
collection | PubMed |
description | Genome-wide association study (GWAS) has turned out to be an essential technology for exploring the genetic mechanism of complex traits. To reduce the complexity of computation, it is well accepted to remove unrelated single nucleotide polymorphisms (SNPs) before GWAS, e.g., by using iterative sure independence screening expectation-maximization Bayesian Lasso (ISIS EM-BLASSO) method. In this work, a modified version of ISIS EM-BLASSO is proposed, which reduces the number of SNPs by a screening methodology based on Pearson correlation and mutual information, then estimates the effects via EM-Bayesian Lasso (EM-BLASSO), and finally detects the true quantitative trait nucleotides (QTNs) through likelihood ratio test. We call our method a two-stage mutual information based Bayesian Lasso (MBLASSO). Under three simulation scenarios, MBLASSO improves the statistical power and retains the higher effect estimation accuracy when comparing with three other algorithms. Moreover, MBLASSO performs best on model fitting, the accuracy of detected associations is the highest, and 21 genes can only be detected by MBLASSO in Arabidopsis thaliana datasets. |
format | Online Article Text |
id | pubmed-7516787 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75167872020-11-09 A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies Guo, Hongping Yu, Zuguo An, Jiyuan Han, Guosheng Ma, Yuanlin Tang, Runbin Entropy (Basel) Article Genome-wide association study (GWAS) has turned out to be an essential technology for exploring the genetic mechanism of complex traits. To reduce the complexity of computation, it is well accepted to remove unrelated single nucleotide polymorphisms (SNPs) before GWAS, e.g., by using iterative sure independence screening expectation-maximization Bayesian Lasso (ISIS EM-BLASSO) method. In this work, a modified version of ISIS EM-BLASSO is proposed, which reduces the number of SNPs by a screening methodology based on Pearson correlation and mutual information, then estimates the effects via EM-Bayesian Lasso (EM-BLASSO), and finally detects the true quantitative trait nucleotides (QTNs) through likelihood ratio test. We call our method a two-stage mutual information based Bayesian Lasso (MBLASSO). Under three simulation scenarios, MBLASSO improves the statistical power and retains the higher effect estimation accuracy when comparing with three other algorithms. Moreover, MBLASSO performs best on model fitting, the accuracy of detected associations is the highest, and 21 genes can only be detected by MBLASSO in Arabidopsis thaliana datasets. MDPI 2020-03-13 /pmc/articles/PMC7516787/ /pubmed/33286103 http://dx.doi.org/10.3390/e22030329 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Guo, Hongping Yu, Zuguo An, Jiyuan Han, Guosheng Ma, Yuanlin Tang, Runbin A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies |
title | A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies |
title_full | A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies |
title_fullStr | A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies |
title_full_unstemmed | A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies |
title_short | A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies |
title_sort | two-stage mutual information based bayesian lasso algorithm for multi-locus genome-wide association studies |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516787/ https://www.ncbi.nlm.nih.gov/pubmed/33286103 http://dx.doi.org/10.3390/e22030329 |
work_keys_str_mv | AT guohongping atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT yuzuguo atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT anjiyuan atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT hanguosheng atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT mayuanlin atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT tangrunbin atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT guohongping twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT yuzuguo twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT anjiyuan twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT hanguosheng twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT mayuanlin twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies AT tangrunbin twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies |