Cargando…
FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm
Genomic selection is an approach to select elite breeding stock based on the use of dense genetic markers and that has led to the development of various models to derive a predictive equation. However, the current genomic selection software faces several issues such as low prediction accuracy, low c...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637923/ https://www.ncbi.nlm.nih.gov/pubmed/34868200 http://dx.doi.org/10.3389/fgene.2021.721600 |
_version_ | 1784608846845575168 |
---|---|
author | Xu, Wenwu Liu, Xiaodong Liao, Mingfu Xiao, Shijun Zheng, Min Yao, Tianxiong Chen, Zuoquan Huang, Lusheng Zhang, Zhiyan |
author_facet | Xu, Wenwu Liu, Xiaodong Liao, Mingfu Xiao, Shijun Zheng, Min Yao, Tianxiong Chen, Zuoquan Huang, Lusheng Zhang, Zhiyan |
author_sort | Xu, Wenwu |
collection | PubMed |
description | Genomic selection is an approach to select elite breeding stock based on the use of dense genetic markers and that has led to the development of various models to derive a predictive equation. However, the current genomic selection software faces several issues such as low prediction accuracy, low computational efficiency, or an inability to handle large-scale sample data. We report the development of a genomic prediction model named FMixFN with four zero-mean normal distributions as the prior distributions to optimize the predictive ability and computing efficiency. The variance of the prior distributions in our model is precisely determined based on an F2 population, and genomic estimated breeding values (GEBV) can be obtained accurately and quickly in combination with an iterative conditional expectation algorithm. We demonstrated that FMixFN improves computational efficiency and predictive ability compared to other methods, such as GBLUP, SSgblup, MIX, BayesR, BayesA, and BayesB. Most importantly, FMixFN may handle large-scale sample data, and thus should be able to meet the needs of large breeding companies or combined breeding schedules. Our study developed a Bayes genomic selection model called FMixFN, which combines stable predictive ability and high computational efficiency, and is a big data-oriented genomic selection model that has potential in the future. The FMixFN method can be freely accessed at https://zenodo.org/record/5560913 (DOI: 10.5281/zenodo.5560913). |
format | Online Article Text |
id | pubmed-8637923 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86379232021-12-03 FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm Xu, Wenwu Liu, Xiaodong Liao, Mingfu Xiao, Shijun Zheng, Min Yao, Tianxiong Chen, Zuoquan Huang, Lusheng Zhang, Zhiyan Front Genet Genetics Genomic selection is an approach to select elite breeding stock based on the use of dense genetic markers and that has led to the development of various models to derive a predictive equation. However, the current genomic selection software faces several issues such as low prediction accuracy, low computational efficiency, or an inability to handle large-scale sample data. We report the development of a genomic prediction model named FMixFN with four zero-mean normal distributions as the prior distributions to optimize the predictive ability and computing efficiency. The variance of the prior distributions in our model is precisely determined based on an F2 population, and genomic estimated breeding values (GEBV) can be obtained accurately and quickly in combination with an iterative conditional expectation algorithm. We demonstrated that FMixFN improves computational efficiency and predictive ability compared to other methods, such as GBLUP, SSgblup, MIX, BayesR, BayesA, and BayesB. Most importantly, FMixFN may handle large-scale sample data, and thus should be able to meet the needs of large breeding companies or combined breeding schedules. Our study developed a Bayes genomic selection model called FMixFN, which combines stable predictive ability and high computational efficiency, and is a big data-oriented genomic selection model that has potential in the future. The FMixFN method can be freely accessed at https://zenodo.org/record/5560913 (DOI: 10.5281/zenodo.5560913). Frontiers Media S.A. 2021-11-18 /pmc/articles/PMC8637923/ /pubmed/34868200 http://dx.doi.org/10.3389/fgene.2021.721600 Text en Copyright © 2021 Xu, Liu, Liao, Xiao, Zheng, Yao, Chen, Huang and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Xu, Wenwu Liu, Xiaodong Liao, Mingfu Xiao, Shijun Zheng, Min Yao, Tianxiong Chen, Zuoquan Huang, Lusheng Zhang, Zhiyan FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm |
title | FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm |
title_full | FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm |
title_fullStr | FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm |
title_full_unstemmed | FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm |
title_short | FMixFN: A Fast Big Data-Oriented Genomic Selection Model Based on an Iterative Conditional Expectation algorithm |
title_sort | fmixfn: a fast big data-oriented genomic selection model based on an iterative conditional expectation algorithm |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8637923/ https://www.ncbi.nlm.nih.gov/pubmed/34868200 http://dx.doi.org/10.3389/fgene.2021.721600 |
work_keys_str_mv | AT xuwenwu fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT liuxiaodong fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT liaomingfu fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT xiaoshijun fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT zhengmin fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT yaotianxiong fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT chenzuoquan fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT huanglusheng fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm AT zhangzhiyan fmixfnafastbigdataorientedgenomicselectionmodelbasedonaniterativeconditionalexpectationalgorithm |