Cargando…

Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology

Genome-wide association studies (GWAS) have been widely used in genetic dissection of complex traits. However, common methods are all based on a fixed-SNP-effect mixed linear model (MLM) and single marker analysis, such as efficient mixed model analysis (EMMA). These methods require Bonferroni corre...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shi-Bo, Feng, Jian-Ying, Ren, Wen-Long, Huang, Bo, Zhou, Ling, Wen, Yang-Jun, Zhang, Jin, Dunwell, Jim M., Xu, Shizhong, Zhang, Yuan-Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4726296/
https://www.ncbi.nlm.nih.gov/pubmed/26787347
http://dx.doi.org/10.1038/srep19444
_version_ 1782411791644491776
author Wang, Shi-Bo
Feng, Jian-Ying
Ren, Wen-Long
Huang, Bo
Zhou, Ling
Wen, Yang-Jun
Zhang, Jin
Dunwell, Jim M.
Xu, Shizhong
Zhang, Yuan-Ming
author_facet Wang, Shi-Bo
Feng, Jian-Ying
Ren, Wen-Long
Huang, Bo
Zhou, Ling
Wen, Yang-Jun
Zhang, Jin
Dunwell, Jim M.
Xu, Shizhong
Zhang, Yuan-Ming
author_sort Wang, Shi-Bo
collection PubMed
description Genome-wide association studies (GWAS) have been widely used in genetic dissection of complex traits. However, common methods are all based on a fixed-SNP-effect mixed linear model (MLM) and single marker analysis, such as efficient mixed model analysis (EMMA). These methods require Bonferroni correction for multiple tests, which often is too conservative when the number of markers is extremely large. To address this concern, we proposed a random-SNP-effect MLM (RMLM) and a multi-locus RMLM (MRMLM) for GWAS. The RMLM simply treats the SNP-effect as random, but it allows a modified Bonferroni correction to be used to calculate the threshold p value for significance tests. The MRMLM is a multi-locus model including markers selected from the RMLM method with a less stringent selection criterion. Due to the multi-locus nature, no multiple test correction is needed. Simulation studies show that the MRMLM is more powerful in QTN detection and more accurate in QTN effect estimation than the RMLM, which in turn is more powerful and accurate than the EMMA. To demonstrate the new methods, we analyzed six flowering time related traits in Arabidopsis thaliana and detected more genes than previous reported using the EMMA. Therefore, the MRMLM provides an alternative for multi-locus GWAS.
format Online
Article
Text
id pubmed-4726296
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-47262962016-01-27 Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology Wang, Shi-Bo Feng, Jian-Ying Ren, Wen-Long Huang, Bo Zhou, Ling Wen, Yang-Jun Zhang, Jin Dunwell, Jim M. Xu, Shizhong Zhang, Yuan-Ming Sci Rep Article Genome-wide association studies (GWAS) have been widely used in genetic dissection of complex traits. However, common methods are all based on a fixed-SNP-effect mixed linear model (MLM) and single marker analysis, such as efficient mixed model analysis (EMMA). These methods require Bonferroni correction for multiple tests, which often is too conservative when the number of markers is extremely large. To address this concern, we proposed a random-SNP-effect MLM (RMLM) and a multi-locus RMLM (MRMLM) for GWAS. The RMLM simply treats the SNP-effect as random, but it allows a modified Bonferroni correction to be used to calculate the threshold p value for significance tests. The MRMLM is a multi-locus model including markers selected from the RMLM method with a less stringent selection criterion. Due to the multi-locus nature, no multiple test correction is needed. Simulation studies show that the MRMLM is more powerful in QTN detection and more accurate in QTN effect estimation than the RMLM, which in turn is more powerful and accurate than the EMMA. To demonstrate the new methods, we analyzed six flowering time related traits in Arabidopsis thaliana and detected more genes than previous reported using the EMMA. Therefore, the MRMLM provides an alternative for multi-locus GWAS. Nature Publishing Group 2016-01-20 /pmc/articles/PMC4726296/ /pubmed/26787347 http://dx.doi.org/10.1038/srep19444 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Wang, Shi-Bo
Feng, Jian-Ying
Ren, Wen-Long
Huang, Bo
Zhou, Ling
Wen, Yang-Jun
Zhang, Jin
Dunwell, Jim M.
Xu, Shizhong
Zhang, Yuan-Ming
Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology
title Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology
title_full Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology
title_fullStr Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology
title_full_unstemmed Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology
title_short Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology
title_sort improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4726296/
https://www.ncbi.nlm.nih.gov/pubmed/26787347
http://dx.doi.org/10.1038/srep19444
work_keys_str_mv AT wangshibo improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT fengjianying improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT renwenlong improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT huangbo improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT zhouling improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT wenyangjun improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT zhangjin improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT dunwelljimm improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT xushizhong improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology
AT zhangyuanming improvingpowerandaccuracyofgenomewideassociationstudiesviaamultilocusmixedlinearmodelmethodology