Cargando…
An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies
Many methods used in multi-locus genome-wide association studies (GWAS) have been developed to improve statistical power. However, most existing multi-locus methods are not quicker than single-locus methods. To address this concern, we proposed a fast score test integrated with Empirical Bayes (Scor...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8517403/ https://www.ncbi.nlm.nih.gov/pubmed/34659362 http://dx.doi.org/10.3389/fgene.2021.742752 |
_version_ | 1784584009279340544 |
---|---|
author | Xiao, Jing Zhou, Yang He, Shu Ren, Wen-Long |
author_facet | Xiao, Jing Zhou, Yang He, Shu Ren, Wen-Long |
author_sort | Xiao, Jing |
collection | PubMed |
description | Many methods used in multi-locus genome-wide association studies (GWAS) have been developed to improve statistical power. However, most existing multi-locus methods are not quicker than single-locus methods. To address this concern, we proposed a fast score test integrated with Empirical Bayes (ScoreEB) for multi-locus GWAS. Firstly, a score test was conducted for each single nucleotide polymorphism (SNP) under a linear mixed model (LMM) framework, taking into account the genetic relatedness and population structure. Then, all of the potentially associated SNPs were selected with a less stringent criterion. Finally, Empirical Bayes in a multi-locus model was performed for all of the selected SNPs to identify the true quantitative trait nucleotide (QTN). Our new method ScoreEB adopts the similar strategy of multi-locus random-SNP-effect mixed linear model (mrMLM) and fast multi-locus random-SNP-effect EMMA (FASTmrEMMA), and the only difference is that we use the score test to select all the potentially associated markers. Monte Carlo simulation studies demonstrate that ScoreEB significantly improved the computational efficiency compared with the popular methods mrMLM, FASTmrEMMA, iterative modified-sure independence screening EM-Bayesian lasso (ISIS EM-BLASSO), hybrid of restricted and penalized maximum likelihood (HRePML) and genome-wide efficient mixed model association (GEMMA). In addition, ScoreEB remained accurate in QTN effect estimation and effectively controlled false positive rate. Subsequently, ScoreEB was applied to re-analyze quantitative traits in plants and animals. The results show that ScoreEB not only can detect previously reported genes, but also can mine new genes. |
format | Online Article Text |
id | pubmed-8517403 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-85174032021-10-16 An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies Xiao, Jing Zhou, Yang He, Shu Ren, Wen-Long Front Genet Genetics Many methods used in multi-locus genome-wide association studies (GWAS) have been developed to improve statistical power. However, most existing multi-locus methods are not quicker than single-locus methods. To address this concern, we proposed a fast score test integrated with Empirical Bayes (ScoreEB) for multi-locus GWAS. Firstly, a score test was conducted for each single nucleotide polymorphism (SNP) under a linear mixed model (LMM) framework, taking into account the genetic relatedness and population structure. Then, all of the potentially associated SNPs were selected with a less stringent criterion. Finally, Empirical Bayes in a multi-locus model was performed for all of the selected SNPs to identify the true quantitative trait nucleotide (QTN). Our new method ScoreEB adopts the similar strategy of multi-locus random-SNP-effect mixed linear model (mrMLM) and fast multi-locus random-SNP-effect EMMA (FASTmrEMMA), and the only difference is that we use the score test to select all the potentially associated markers. Monte Carlo simulation studies demonstrate that ScoreEB significantly improved the computational efficiency compared with the popular methods mrMLM, FASTmrEMMA, iterative modified-sure independence screening EM-Bayesian lasso (ISIS EM-BLASSO), hybrid of restricted and penalized maximum likelihood (HRePML) and genome-wide efficient mixed model association (GEMMA). In addition, ScoreEB remained accurate in QTN effect estimation and effectively controlled false positive rate. Subsequently, ScoreEB was applied to re-analyze quantitative traits in plants and animals. The results show that ScoreEB not only can detect previously reported genes, but also can mine new genes. Frontiers Media S.A. 2021-10-01 /pmc/articles/PMC8517403/ /pubmed/34659362 http://dx.doi.org/10.3389/fgene.2021.742752 Text en Copyright © 2021 Xiao, Zhou, He and Ren. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Xiao, Jing Zhou, Yang He, Shu Ren, Wen-Long An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies |
title | An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies |
title_full | An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies |
title_fullStr | An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies |
title_full_unstemmed | An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies |
title_short | An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies |
title_sort | efficient score test integrated with empirical bayes for genome-wide association studies |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8517403/ https://www.ncbi.nlm.nih.gov/pubmed/34659362 http://dx.doi.org/10.3389/fgene.2021.742752 |
work_keys_str_mv | AT xiaojing anefficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies AT zhouyang anefficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies AT heshu anefficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies AT renwenlong anefficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies AT xiaojing efficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies AT zhouyang efficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies AT heshu efficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies AT renwenlong efficientscoretestintegratedwithempiricalbayesforgenomewideassociationstudies |