Cargando…
Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study
In genome-wide association studies, linear mixed models (LMMs) have been widely used to explore the molecular mechanism of complex traits. However, typical association approaches suffer from several important drawbacks: estimation of variance components in LMMs with large scale individuals is comput...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7692801/ https://www.ncbi.nlm.nih.gov/pubmed/33138126 http://dx.doi.org/10.3390/genes11111286 |
_version_ | 1783614596718788608 |
---|---|
author | Ren, Wenlong Liang, Zhikai He, Shu Xiao, Jing |
author_facet | Ren, Wenlong Liang, Zhikai He, Shu Xiao, Jing |
author_sort | Ren, Wenlong |
collection | PubMed |
description | In genome-wide association studies, linear mixed models (LMMs) have been widely used to explore the molecular mechanism of complex traits. However, typical association approaches suffer from several important drawbacks: estimation of variance components in LMMs with large scale individuals is computationally slow; single-locus model is unsatisfactory to handle complex confounding and causes loss of statistical power. To address these issues, we propose an efficient two-stage method based on hybrid of restricted and penalized maximum likelihood, named HRePML. Firstly, we performed restricted maximum likelihood (REML) on single-locus LMM to remove unrelated markers, where spectral decomposition on covariance matrix was used to fast estimate variance components. Secondly, we carried out penalized maximum likelihood (PML) on multi-locus LMM for markers with reasonably large effects. To validate the effectiveness of HRePML, we conducted a series of simulation studies and real data analyses. As a result, our method always had the highest average statistical power compared with multi-locus mixed-model (MLMM), fixed and random model circulating probability unification (FarmCPU), and genome-wide efficient mixed model association (GEMMA). More importantly, HRePML can provide higher accuracy estimation of marker effects. HRePML also identifies 41 previous reported genes associated with development traits in Arabidopsis, which is more than was detected by the other methods. |
format | Online Article Text |
id | pubmed-7692801 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-76928012020-11-28 Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study Ren, Wenlong Liang, Zhikai He, Shu Xiao, Jing Genes (Basel) Article In genome-wide association studies, linear mixed models (LMMs) have been widely used to explore the molecular mechanism of complex traits. However, typical association approaches suffer from several important drawbacks: estimation of variance components in LMMs with large scale individuals is computationally slow; single-locus model is unsatisfactory to handle complex confounding and causes loss of statistical power. To address these issues, we propose an efficient two-stage method based on hybrid of restricted and penalized maximum likelihood, named HRePML. Firstly, we performed restricted maximum likelihood (REML) on single-locus LMM to remove unrelated markers, where spectral decomposition on covariance matrix was used to fast estimate variance components. Secondly, we carried out penalized maximum likelihood (PML) on multi-locus LMM for markers with reasonably large effects. To validate the effectiveness of HRePML, we conducted a series of simulation studies and real data analyses. As a result, our method always had the highest average statistical power compared with multi-locus mixed-model (MLMM), fixed and random model circulating probability unification (FarmCPU), and genome-wide efficient mixed model association (GEMMA). More importantly, HRePML can provide higher accuracy estimation of marker effects. HRePML also identifies 41 previous reported genes associated with development traits in Arabidopsis, which is more than was detected by the other methods. MDPI 2020-10-29 /pmc/articles/PMC7692801/ /pubmed/33138126 http://dx.doi.org/10.3390/genes11111286 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ren, Wenlong Liang, Zhikai He, Shu Xiao, Jing Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study |
title | Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study |
title_full | Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study |
title_fullStr | Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study |
title_full_unstemmed | Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study |
title_short | Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study |
title_sort | hybrid of restricted and penalized maximum likelihood method for efficient genome-wide association study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7692801/ https://www.ncbi.nlm.nih.gov/pubmed/33138126 http://dx.doi.org/10.3390/genes11111286 |
work_keys_str_mv | AT renwenlong hybridofrestrictedandpenalizedmaximumlikelihoodmethodforefficientgenomewideassociationstudy AT liangzhikai hybridofrestrictedandpenalizedmaximumlikelihoodmethodforefficientgenomewideassociationstudy AT heshu hybridofrestrictedandpenalizedmaximumlikelihoodmethodforefficientgenomewideassociationstudy AT xiaojing hybridofrestrictedandpenalizedmaximumlikelihoodmethodforefficientgenomewideassociationstudy |