Cargando…

An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection

Currently a hot topic, genomic selection (GS) has consistently provided powerful support for breeding studies and achieved more comprehensive and reliable selection in animal and plant breeding. GS estimates the effects of all single nucleotide polymorphisms (SNPs) and thereby predicts the genomic e...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jin, Li, Ling, Lv, Mingming, Wang, Yidi, Qiu, Wenzhe, An, Yuan, Zhang, Ye, Wan, Yuxuan, Xu, Yu, Chen, Juncong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9778053/
https://www.ncbi.nlm.nih.gov/pubmed/36553460
http://dx.doi.org/10.3390/genes13122193
_version_ 1784856261679906816
author Zhang, Jin
Li, Ling
Lv, Mingming
Wang, Yidi
Qiu, Wenzhe
An, Yuan
Zhang, Ye
Wan, Yuxuan
Xu, Yu
Chen, Juncong
author_facet Zhang, Jin
Li, Ling
Lv, Mingming
Wang, Yidi
Qiu, Wenzhe
An, Yuan
Zhang, Ye
Wan, Yuxuan
Xu, Yu
Chen, Juncong
author_sort Zhang, Jin
collection PubMed
description Currently a hot topic, genomic selection (GS) has consistently provided powerful support for breeding studies and achieved more comprehensive and reliable selection in animal and plant breeding. GS estimates the effects of all single nucleotide polymorphisms (SNPs) and thereby predicts the genomic estimation of breeding value (GEBV), accelerating breeding progress and overcoming the limitations of conventional breeding. The successful application of GS primarily depends on the accuracy of the GEBV. Adopting appropriate advanced algorithms to improve the accuracy of the GEBV is time-saving and efficient for breeders, and the available algorithms can be further improved in the big data era. In this study, we develop a new algorithm under the Bayesian Shrinkage Regression (BSR, which is called BayesA) framework, an improved expectation-maximization algorithm for BayesA (emBAI). The emBAI algorithm first corrects the polygenic and environmental noise and then calculates the GEBV by emBayesA. We conduct two simulation experiments and a real dataset analysis for flowering time-related Arabidopsis phenotypes to validate the new algorithm. Compared to established methods, emBAI is more powerful in terms of prediction accuracy, mean square error (MSE), mean absolute error (MAE), the area under the receiver operating characteristic curve (AUC) and correlation of prediction in simulation studies. In addition, emBAI performs well under the increasing genetic background. The analysis of the Arabidopsis real dataset further illustrates the benefits of emBAI for genomic prediction according to prediction accuracy, MSE, MAE and correlation of prediction. Furthermore, the new method shows the advantages of significant loci detection and effect coefficient estimation, which are confirmed by The Arabidopsis Information Resource (TAIR) gene bank. In conclusion, the emBAI algorithm provides powerful support for GS in high-dimensional genomic datasets.
format Online
Article
Text
id pubmed-9778053
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97780532022-12-23 An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection Zhang, Jin Li, Ling Lv, Mingming Wang, Yidi Qiu, Wenzhe An, Yuan Zhang, Ye Wan, Yuxuan Xu, Yu Chen, Juncong Genes (Basel) Article Currently a hot topic, genomic selection (GS) has consistently provided powerful support for breeding studies and achieved more comprehensive and reliable selection in animal and plant breeding. GS estimates the effects of all single nucleotide polymorphisms (SNPs) and thereby predicts the genomic estimation of breeding value (GEBV), accelerating breeding progress and overcoming the limitations of conventional breeding. The successful application of GS primarily depends on the accuracy of the GEBV. Adopting appropriate advanced algorithms to improve the accuracy of the GEBV is time-saving and efficient for breeders, and the available algorithms can be further improved in the big data era. In this study, we develop a new algorithm under the Bayesian Shrinkage Regression (BSR, which is called BayesA) framework, an improved expectation-maximization algorithm for BayesA (emBAI). The emBAI algorithm first corrects the polygenic and environmental noise and then calculates the GEBV by emBayesA. We conduct two simulation experiments and a real dataset analysis for flowering time-related Arabidopsis phenotypes to validate the new algorithm. Compared to established methods, emBAI is more powerful in terms of prediction accuracy, mean square error (MSE), mean absolute error (MAE), the area under the receiver operating characteristic curve (AUC) and correlation of prediction in simulation studies. In addition, emBAI performs well under the increasing genetic background. The analysis of the Arabidopsis real dataset further illustrates the benefits of emBAI for genomic prediction according to prediction accuracy, MSE, MAE and correlation of prediction. Furthermore, the new method shows the advantages of significant loci detection and effect coefficient estimation, which are confirmed by The Arabidopsis Information Resource (TAIR) gene bank. In conclusion, the emBAI algorithm provides powerful support for GS in high-dimensional genomic datasets. MDPI 2022-11-23 /pmc/articles/PMC9778053/ /pubmed/36553460 http://dx.doi.org/10.3390/genes13122193 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Jin
Li, Ling
Lv, Mingming
Wang, Yidi
Qiu, Wenzhe
An, Yuan
Zhang, Ye
Wan, Yuxuan
Xu, Yu
Chen, Juncong
An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection
title An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection
title_full An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection
title_fullStr An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection
title_full_unstemmed An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection
title_short An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection
title_sort improved bayesian shrinkage regression algorithm for genomic selection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9778053/
https://www.ncbi.nlm.nih.gov/pubmed/36553460
http://dx.doi.org/10.3390/genes13122193
work_keys_str_mv AT zhangjin animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT liling animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT lvmingming animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT wangyidi animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT qiuwenzhe animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT anyuan animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT zhangye animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT wanyuxuan animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT xuyu animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT chenjuncong animprovedbayesianshrinkageregressionalgorithmforgenomicselection
AT zhangjin improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT liling improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT lvmingming improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT wangyidi improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT qiuwenzhe improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT anyuan improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT zhangye improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT wanyuxuan improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT xuyu improvedbayesianshrinkageregressionalgorithmforgenomicselection
AT chenjuncong improvedbayesianshrinkageregressionalgorithmforgenomicselection