Cargando…

FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm

Genomic selection is an approach to select elite breeding stock based on the use of dense genetic markers and that has led to the development of various models to derive a predictive equation. However, the current genomic selection software faces several issues such as low prediction accuracy, low c...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Wenwu, Liu, Xiaodong, Liao, Mingfu, Xiao, Shijun, Zheng, Min, Yao, Tianxiong, Chen, Zuoquan, Huang, Lusheng, Zhang, Zhiyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637923/
https://www.ncbi.nlm.nih.gov/pubmed/34868200
http://dx.doi.org/10.3389/fgene.2021.721600
_version_ 1784608846845575168
author Xu, Wenwu
Liu, Xiaodong
Liao, Mingfu
Xiao, Shijun
Zheng, Min
Yao, Tianxiong
Chen, Zuoquan
Huang, Lusheng
Zhang, Zhiyan
author_facet Xu, Wenwu
Liu, Xiaodong
Liao, Mingfu
Xiao, Shijun
Zheng, Min
Yao, Tianxiong
Chen, Zuoquan
Huang, Lusheng
Zhang, Zhiyan
author_sort Xu, Wenwu
collection PubMed
description Genomic selection is an approach to select elite breeding stock based on the use of dense genetic markers and that has led to the development of various models to derive a predictive equation. However, the current genomic selection software faces several issues such as low prediction accuracy, low computational efficiency, or an inability to handle large-scale sample data. We report the development of a genomic prediction model named FMixFN with four zero-mean normal distributions as the prior distributions to optimize the predictive ability and computing efficiency. The variance of the prior distributions in our model is precisely determined based on an F2 population, and genomic estimated breeding values (GEBV) can be obtained accurately and quickly in combination with an iterative conditional expectation algorithm. We demonstrated that FMixFN improves computational efficiency and predictive ability compared to other methods, such as GBLUP, SSgblup, MIX, BayesR, BayesA, and BayesB. Most importantly, FMixFN may handle large-scale sample data, and thus should be able to meet the needs of large breeding companies or combined breeding schedules. Our study developed a Bayes genomic selection model called FMixFN, which combines stable predictive ability and high computational efficiency, and is a big data-oriented genomic selection model that has potential in the future. The FMixFN method can be freely accessed at https://zenodo.org/record/5560913 (DOI: 10.5281/zenodo.5560913).
format Online
Article
Text
id pubmed-8637923
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86379232021-12-03 FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm Xu, Wenwu Liu, Xiaodong Liao, Mingfu Xiao, Shijun Zheng, Min Yao, Tianxiong Chen, Zuoquan Huang, Lusheng Zhang, Zhiyan Front Genet Genetics Genomic selection is an approach to select elite breeding stock based on the use of dense genetic markers and that has led to the development of various models to derive a predictive equation. However, the current genomic selection software faces several issues such as low prediction accuracy, low computational efficiency, or an inability to handle large-scale sample data. We report the development of a genomic prediction model named FMixFN with four zero-mean normal distributions as the prior distributions to optimize the predictive ability and computing efficiency. The variance of the prior distributions in our model is precisely determined based on an F2 population, and genomic estimated breeding values (GEBV) can be obtained accurately and quickly in combination with an iterative conditional expectation algorithm. We demonstrated that FMixFN improves computational efficiency and predictive ability compared to other methods, such as GBLUP, SSgblup, MIX, BayesR, BayesA, and BayesB. Most importantly, FMixFN may handle large-scale sample data, and thus should be able to meet the needs of large breeding companies or combined breeding schedules. Our study developed a Bayes genomic selection model called FMixFN, which combines stable predictive ability and high computational efficiency, and is a big data-oriented genomic selection model that has potential in the future. The FMixFN method can be freely accessed at https://zenodo.org/record/5560913 (DOI: 10.5281/zenodo.5560913). Frontiers Media S.A. 2021-11-18 /pmc/articles/PMC8637923/ /pubmed/34868200 http://dx.doi.org/10.3389/fgene.2021.721600 Text en Copyright © 2021 Xu, Liu, Liao, Xiao, Zheng, Yao, Chen, Huang and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Wenwu
Liu, Xiaodong
Liao, Mingfu
Xiao, Shijun
Zheng, Min
Yao, Tianxiong
Chen, Zuoquan
Huang, Lusheng
Zhang, Zhiyan
FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm
title FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm
title_full FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm
title_fullStr FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm
title_full_unstemmed FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm
title_short FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm
title_sort fmixfn: a fast big data-oriented genomic selection model based on an iterative conditional expectation algorithm
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637923/
https://www.ncbi.nlm.nih.gov/pubmed/34868200
http://dx.doi.org/10.3389/fgene.2021.721600
work_keys_str_mv AT xuwenwu fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT liuxiaodong fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT liaomingfu fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT xiaoshijun fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT zhengmin fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT yaotianxiong fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT chenzuoquan fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT huanglusheng fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm
AT zhangzhiyan fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm