Cargando…

A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies

Genome-wide association study (GWAS) has turned out to be an essential technology for exploring the genetic mechanism of complex traits. To reduce the complexity of computation, it is well accepted to remove unrelated single nucleotide polymorphisms (SNPs) before GWAS, e.g., by using iterative sure...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Hongping, Yu, Zuguo, An, Jiyuan, Han, Guosheng, Ma, Yuanlin, Tang, Runbin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516787/
https://www.ncbi.nlm.nih.gov/pubmed/33286103
http://dx.doi.org/10.3390/e22030329
_version_ 1783587081875881984
author Guo, Hongping
Yu, Zuguo
An, Jiyuan
Han, Guosheng
Ma, Yuanlin
Tang, Runbin
author_facet Guo, Hongping
Yu, Zuguo
An, Jiyuan
Han, Guosheng
Ma, Yuanlin
Tang, Runbin
author_sort Guo, Hongping
collection PubMed
description Genome-wide association study (GWAS) has turned out to be an essential technology for exploring the genetic mechanism of complex traits. To reduce the complexity of computation, it is well accepted to remove unrelated single nucleotide polymorphisms (SNPs) before GWAS, e.g., by using iterative sure independence screening expectation-maximization Bayesian Lasso (ISIS EM-BLASSO) method. In this work, a modified version of ISIS EM-BLASSO is proposed, which reduces the number of SNPs by a screening methodology based on Pearson correlation and mutual information, then estimates the effects via EM-Bayesian Lasso (EM-BLASSO), and finally detects the true quantitative trait nucleotides (QTNs) through likelihood ratio test. We call our method a two-stage mutual information based Bayesian Lasso (MBLASSO). Under three simulation scenarios, MBLASSO improves the statistical power and retains the higher effect estimation accuracy when comparing with three other algorithms. Moreover, MBLASSO performs best on model fitting, the accuracy of detected associations is the highest, and 21 genes can only be detected by MBLASSO in Arabidopsis thaliana datasets.
format Online
Article
Text
id pubmed-7516787
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75167872020-11-09 A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies Guo, Hongping Yu, Zuguo An, Jiyuan Han, Guosheng Ma, Yuanlin Tang, Runbin Entropy (Basel) Article Genome-wide association study (GWAS) has turned out to be an essential technology for exploring the genetic mechanism of complex traits. To reduce the complexity of computation, it is well accepted to remove unrelated single nucleotide polymorphisms (SNPs) before GWAS, e.g., by using iterative sure independence screening expectation-maximization Bayesian Lasso (ISIS EM-BLASSO) method. In this work, a modified version of ISIS EM-BLASSO is proposed, which reduces the number of SNPs by a screening methodology based on Pearson correlation and mutual information, then estimates the effects via EM-Bayesian Lasso (EM-BLASSO), and finally detects the true quantitative trait nucleotides (QTNs) through likelihood ratio test. We call our method a two-stage mutual information based Bayesian Lasso (MBLASSO). Under three simulation scenarios, MBLASSO improves the statistical power and retains the higher effect estimation accuracy when comparing with three other algorithms. Moreover, MBLASSO performs best on model fitting, the accuracy of detected associations is the highest, and 21 genes can only be detected by MBLASSO in Arabidopsis thaliana datasets. MDPI 2020-03-13 /pmc/articles/PMC7516787/ /pubmed/33286103 http://dx.doi.org/10.3390/e22030329 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Guo, Hongping
Yu, Zuguo
An, Jiyuan
Han, Guosheng
Ma, Yuanlin
Tang, Runbin
A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
title A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
title_full A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
title_fullStr A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
title_full_unstemmed A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
title_short A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
title_sort two-stage mutual information based bayesian lasso algorithm for multi-locus genome-wide association studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516787/
https://www.ncbi.nlm.nih.gov/pubmed/33286103
http://dx.doi.org/10.3390/e22030329
work_keys_str_mv AT guohongping atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT yuzuguo atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT anjiyuan atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT hanguosheng atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT mayuanlin atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT tangrunbin atwostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT guohongping twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT yuzuguo twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT anjiyuan twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT hanguosheng twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT mayuanlin twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies
AT tangrunbin twostagemutualinformationbasedbayesianlassoalgorithmformultilocusgenomewideassociationstudies