Cargando…

G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction

Genotype-to-phenotype (G2P) prediction has become a mainstream paradigm to facilitate genomic selection (GS)-assisted breeding in the seed industry. Many methods have been introduced for building GS models, but their prediction precision may vary depending on species and specific traits. Therefore,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Qian, Jiang, Shan, Li, Tong, Qiu, Zhixu, Yan, Jun, Fu, Ran, Ma, Chuang, Wang, Xiangfeng, Jiang, Shuqin, Cheng, Qian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10437076/
https://www.ncbi.nlm.nih.gov/pubmed/37600179
http://dx.doi.org/10.3389/fpls.2023.1207139
_version_ 1785092430725382144
author Wang, Qian
Jiang, Shan
Li, Tong
Qiu, Zhixu
Yan, Jun
Fu, Ran
Ma, Chuang
Wang, Xiangfeng
Jiang, Shuqin
Cheng, Qian
author_facet Wang, Qian
Jiang, Shan
Li, Tong
Qiu, Zhixu
Yan, Jun
Fu, Ran
Ma, Chuang
Wang, Xiangfeng
Jiang, Shuqin
Cheng, Qian
author_sort Wang, Qian
collection PubMed
description Genotype-to-phenotype (G2P) prediction has become a mainstream paradigm to facilitate genomic selection (GS)-assisted breeding in the seed industry. Many methods have been introduced for building GS models, but their prediction precision may vary depending on species and specific traits. Therefore, evaluation of multiple models and selection of the appropriate one is crucial to effective GS analysis. Here, we present the G2P container developed for the Singularity platform, which not only contains a library of 16 state-of-the-art GS models and 13 evaluation metrics. G2P works as an integrative environment offering comprehensive, unbiased evaluation analyses of the 16 GS models, which may be run in parallel on high-performance computing clusters. Based on the evaluation outcome, G2P performs auto-ensemble algorithms that not only can automatically select the most precise models but also can integrate prediction results from multiple models. This functionality should further improve the precision of G2P prediction. Another noteworthy function is the refinement design of the training set, in which G2P optimizes the training set based on the genetic diversity analysis of a studied population. Although the training samples in the optimized set are fewer than in the original set, the prediction precision is almost equivalent to that obtained when using the whole set. This functionality is quite useful in practice, as it reduces the cost of phenotyping when constructing training population. The G2P container and source codes are freely accessible at https://g2p-env.github.io/.
format Online
Article
Text
id pubmed-10437076
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104370762023-08-19 G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction Wang, Qian Jiang, Shan Li, Tong Qiu, Zhixu Yan, Jun Fu, Ran Ma, Chuang Wang, Xiangfeng Jiang, Shuqin Cheng, Qian Front Plant Sci Plant Science Genotype-to-phenotype (G2P) prediction has become a mainstream paradigm to facilitate genomic selection (GS)-assisted breeding in the seed industry. Many methods have been introduced for building GS models, but their prediction precision may vary depending on species and specific traits. Therefore, evaluation of multiple models and selection of the appropriate one is crucial to effective GS analysis. Here, we present the G2P container developed for the Singularity platform, which not only contains a library of 16 state-of-the-art GS models and 13 evaluation metrics. G2P works as an integrative environment offering comprehensive, unbiased evaluation analyses of the 16 GS models, which may be run in parallel on high-performance computing clusters. Based on the evaluation outcome, G2P performs auto-ensemble algorithms that not only can automatically select the most precise models but also can integrate prediction results from multiple models. This functionality should further improve the precision of G2P prediction. Another noteworthy function is the refinement design of the training set, in which G2P optimizes the training set based on the genetic diversity analysis of a studied population. Although the training samples in the optimized set are fewer than in the original set, the prediction precision is almost equivalent to that obtained when using the whole set. This functionality is quite useful in practice, as it reduces the cost of phenotyping when constructing training population. The G2P container and source codes are freely accessible at https://g2p-env.github.io/. Frontiers Media S.A. 2023-08-04 /pmc/articles/PMC10437076/ /pubmed/37600179 http://dx.doi.org/10.3389/fpls.2023.1207139 Text en Copyright © 2023 Wang, Jiang, Li, Qiu, Yan, Fu, Ma, Wang, Jiang and Cheng https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Wang, Qian
Jiang, Shan
Li, Tong
Qiu, Zhixu
Yan, Jun
Fu, Ran
Ma, Chuang
Wang, Xiangfeng
Jiang, Shuqin
Cheng, Qian
G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction
title G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction
title_full G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction
title_fullStr G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction
title_full_unstemmed G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction
title_short G2P Provides an Integrative Environment for Multi-model genomic selection analysis to improve genotype-to-phenotype prediction
title_sort g2p provides an integrative environment for multi-model genomic selection analysis to improve genotype-to-phenotype prediction
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10437076/
https://www.ncbi.nlm.nih.gov/pubmed/37600179
http://dx.doi.org/10.3389/fpls.2023.1207139
work_keys_str_mv AT wangqian g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT jiangshan g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT litong g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT qiuzhixu g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT yanjun g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT furan g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT machuang g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT wangxiangfeng g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT jiangshuqin g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction
AT chengqian g2pprovidesanintegrativeenvironmentformultimodelgenomicselectionanalysistoimprovegenotypetophenotypeprediction