Cargando…

pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies

Multilocus genome-wide association studies (GWAS) have become the state-of-the-art procedure to identify quantitative trait nucleotides (QTNs) associated with complex traits. However, implementation of multilocus model in GWAS is still difficult. In this study, we integrated least angle regression w...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, J, Feng, J-Y, Ni, Y-L, Wen, Y-J, Niu, Y, Tamba, C L, Yue, C, Song, Q, Zhang, Y-M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5436030/
https://www.ncbi.nlm.nih.gov/pubmed/28295030
http://dx.doi.org/10.1038/hdy.2017.8
_version_ 1783237328953671680
author Zhang, J
Feng, J-Y
Ni, Y-L
Wen, Y-J
Niu, Y
Tamba, C L
Yue, C
Song, Q
Zhang, Y-M
author_facet Zhang, J
Feng, J-Y
Ni, Y-L
Wen, Y-J
Niu, Y
Tamba, C L
Yue, C
Song, Q
Zhang, Y-M
author_sort Zhang, J
collection PubMed
description Multilocus genome-wide association studies (GWAS) have become the state-of-the-art procedure to identify quantitative trait nucleotides (QTNs) associated with complex traits. However, implementation of multilocus model in GWAS is still difficult. In this study, we integrated least angle regression with empirical Bayes to perform multilocus GWAS under polygenic background control. We used an algorithm of model transformation that whitened the covariance matrix of the polygenic matrix K and environmental noise. Markers on one chromosome were included simultaneously in a multilocus model and least angle regression was used to select the most potentially associated single-nucleotide polymorphisms (SNPs), whereas the markers on the other chromosomes were used to calculate kinship matrix as polygenic background control. The selected SNPs in multilocus model were further detected for their association with the trait by empirical Bayes and likelihood ratio test. We herein refer to this method as the pLARmEB (polygenic-background-control-based least angle regression plus empirical Bayes). Results from simulation studies showed that pLARmEB was more powerful in QTN detection and more accurate in QTN effect estimation, had less false positive rate and required less computing time than Bayesian hierarchical generalized linear model, efficient mixed model association (EMMA) and least angle regression plus empirical Bayes. pLARmEB, multilocus random-SNP-effect mixed linear model and fast multilocus random-SNP-effect EMMA methods had almost equal power of QTN detection in simulation experiments. However, only pLARmEB identified 48 previously reported genes for 7 flowering time-related traits in Arabidopsis thaliana.
format Online
Article
Text
id pubmed-5436030
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-54360302017-06-01 pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies Zhang, J Feng, J-Y Ni, Y-L Wen, Y-J Niu, Y Tamba, C L Yue, C Song, Q Zhang, Y-M Heredity (Edinb) Original Article Multilocus genome-wide association studies (GWAS) have become the state-of-the-art procedure to identify quantitative trait nucleotides (QTNs) associated with complex traits. However, implementation of multilocus model in GWAS is still difficult. In this study, we integrated least angle regression with empirical Bayes to perform multilocus GWAS under polygenic background control. We used an algorithm of model transformation that whitened the covariance matrix of the polygenic matrix K and environmental noise. Markers on one chromosome were included simultaneously in a multilocus model and least angle regression was used to select the most potentially associated single-nucleotide polymorphisms (SNPs), whereas the markers on the other chromosomes were used to calculate kinship matrix as polygenic background control. The selected SNPs in multilocus model were further detected for their association with the trait by empirical Bayes and likelihood ratio test. We herein refer to this method as the pLARmEB (polygenic-background-control-based least angle regression plus empirical Bayes). Results from simulation studies showed that pLARmEB was more powerful in QTN detection and more accurate in QTN effect estimation, had less false positive rate and required less computing time than Bayesian hierarchical generalized linear model, efficient mixed model association (EMMA) and least angle regression plus empirical Bayes. pLARmEB, multilocus random-SNP-effect mixed linear model and fast multilocus random-SNP-effect EMMA methods had almost equal power of QTN detection in simulation experiments. However, only pLARmEB identified 48 previously reported genes for 7 flowering time-related traits in Arabidopsis thaliana. Nature Publishing Group 2017-06 2017-03-15 /pmc/articles/PMC5436030/ /pubmed/28295030 http://dx.doi.org/10.1038/hdy.2017.8 Text en Copyright © 2017 The Author(s) http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/
spellingShingle Original Article
Zhang, J
Feng, J-Y
Ni, Y-L
Wen, Y-J
Niu, Y
Tamba, C L
Yue, C
Song, Q
Zhang, Y-M
pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies
title pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies
title_full pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies
title_fullStr pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies
title_full_unstemmed pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies
title_short pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies
title_sort plarmeb: integration of least angle regression with empirical bayes for multilocus genome-wide association studies
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5436030/
https://www.ncbi.nlm.nih.gov/pubmed/28295030
http://dx.doi.org/10.1038/hdy.2017.8
work_keys_str_mv AT zhangj plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT fengjy plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT niyl plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT wenyj plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT niuy plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT tambacl plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT yuec plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT songq plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies
AT zhangym plarmebintegrationofleastangleregressionwithempiricalbayesformultilocusgenomewideassociationstudies