Cargando…

A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle

The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) t...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Ming, Fu, Weixuan, Jiang, Dan, Zhang, Qin, Sun, Dongxiao, Ding, Xiangdong, Liu, Jianfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4141689/
https://www.ncbi.nlm.nih.gov/pubmed/25148050
http://dx.doi.org/10.1371/journal.pone.0099544
_version_ 1782331672721620992
author Fang, Ming
Fu, Weixuan
Jiang, Dan
Zhang, Qin
Sun, Dongxiao
Ding, Xiangdong
Liu, Jianfeng
author_facet Fang, Ming
Fu, Weixuan
Jiang, Dan
Zhang, Qin
Sun, Dongxiao
Ding, Xiangdong
Liu, Jianfeng
author_sort Fang, Ming
collection PubMed
description The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation.
format Online
Article
Text
id pubmed-4141689
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41416892014-08-25 A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle Fang, Ming Fu, Weixuan Jiang, Dan Zhang, Qin Sun, Dongxiao Ding, Xiangdong Liu, Jianfeng PLoS One Research Article The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation. Public Library of Science 2014-08-22 /pmc/articles/PMC4141689/ /pubmed/25148050 http://dx.doi.org/10.1371/journal.pone.0099544 Text en © 2014 Fang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fang, Ming
Fu, Weixuan
Jiang, Dan
Zhang, Qin
Sun, Dongxiao
Ding, Xiangdong
Liu, Jianfeng
A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle
title A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle
title_full A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle
title_fullStr A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle
title_full_unstemmed A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle
title_short A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle
title_sort multiple-snp approach for genome-wide association study of milk production traits in chinese holstein cattle
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4141689/
https://www.ncbi.nlm.nih.gov/pubmed/25148050
http://dx.doi.org/10.1371/journal.pone.0099544
work_keys_str_mv AT fangming amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT fuweixuan amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT jiangdan amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT zhangqin amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT sundongxiao amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT dingxiangdong amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT liujianfeng amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT fangming multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT fuweixuan multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT jiangdan multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT zhangqin multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT sundongxiao multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT dingxiangdong multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT liujianfeng multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle