Cargando…
(Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
This study investigates nonlinear kernels for multitrait (MT) genomic prediction using support vector regression (SVR) models. We assessed the predictive ability delivered by single-trait (ST) and MT models for 2 carcass traits (CT1 and CT2) measured in purebred broiler chickens. The MT models also...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411556/ https://www.ncbi.nlm.nih.gov/pubmed/37216670 http://dx.doi.org/10.1093/g3journal/jkad109 |
_version_ | 1785086691097182208 |
---|---|
author | Alves, Anderson Antonio Carvalho Fernandes, Arthur Francisco Araujo Lopes, Fernando Brito Breen, Vivian Hawken, Rachel Gianola, Daniel Rosa, Guilherme Jordão de Magalhães |
author_facet | Alves, Anderson Antonio Carvalho Fernandes, Arthur Francisco Araujo Lopes, Fernando Brito Breen, Vivian Hawken, Rachel Gianola, Daniel Rosa, Guilherme Jordão de Magalhães |
author_sort | Alves, Anderson Antonio Carvalho |
collection | PubMed |
description | This study investigates nonlinear kernels for multitrait (MT) genomic prediction using support vector regression (SVR) models. We assessed the predictive ability delivered by single-trait (ST) and MT models for 2 carcass traits (CT1 and CT2) measured in purebred broiler chickens. The MT models also included information on indicator traits measured in vivo [Growth and feed efficiency trait (FE)]. We proposed an approach termed (quasi) multitask SVR (QMTSVR), with hyperparameter optimization performed via genetic algorithm. ST and MT Bayesian shrinkage and variable selection models [genomic best linear unbiased predictor (GBLUP), BayesC (BC), and reproducing kernel Hilbert space (RKHS) regression] were employed as benchmarks. MT models were trained using 2 validation designs (CV1 and CV2), which differ if the information on secondary traits is available in the testing set. Models’ predictive ability was assessed with prediction accuracy (ACC; i.e. the correlation between predicted and observed values, divided by the square root of phenotype accuracy), standardized root-mean-squared error (RMSE*), and inflation factor (b). To account for potential bias in CV2-style predictions, we also computed a parametric estimate of accuracy ([Formula: see text]). Predictive ability metrics varied according to trait, model, and validation design (CV1 or CV2), ranging from 0.71 to 0.84 for ACC, 0.78 to 0.92 for RMSE*, and between 0.82 and 1.34 for b. The highest ACC and smallest RMSE* were achieved with QMTSVR-CV2 in both traits. We observed that for CT1, model/validation design selection was sensitive to the choice of accuracy metric (ACC or ACC(par)). Nonetheless, the higher predictive accuracy of QMTSVR over MTGBLUP and MTBC was replicated across accuracy metrics, besides the similar performance between the proposed method and the MTRKHS model. Results showed that the proposed approach is competitive with conventional MT Bayesian regression models using either Gaussian or spike–slab multivariate priors. |
format | Online Article Text |
id | pubmed-10411556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-104115562023-08-10 (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers Alves, Anderson Antonio Carvalho Fernandes, Arthur Francisco Araujo Lopes, Fernando Brito Breen, Vivian Hawken, Rachel Gianola, Daniel Rosa, Guilherme Jordão de Magalhães G3 (Bethesda) Genomic Prediction This study investigates nonlinear kernels for multitrait (MT) genomic prediction using support vector regression (SVR) models. We assessed the predictive ability delivered by single-trait (ST) and MT models for 2 carcass traits (CT1 and CT2) measured in purebred broiler chickens. The MT models also included information on indicator traits measured in vivo [Growth and feed efficiency trait (FE)]. We proposed an approach termed (quasi) multitask SVR (QMTSVR), with hyperparameter optimization performed via genetic algorithm. ST and MT Bayesian shrinkage and variable selection models [genomic best linear unbiased predictor (GBLUP), BayesC (BC), and reproducing kernel Hilbert space (RKHS) regression] were employed as benchmarks. MT models were trained using 2 validation designs (CV1 and CV2), which differ if the information on secondary traits is available in the testing set. Models’ predictive ability was assessed with prediction accuracy (ACC; i.e. the correlation between predicted and observed values, divided by the square root of phenotype accuracy), standardized root-mean-squared error (RMSE*), and inflation factor (b). To account for potential bias in CV2-style predictions, we also computed a parametric estimate of accuracy ([Formula: see text]). Predictive ability metrics varied according to trait, model, and validation design (CV1 or CV2), ranging from 0.71 to 0.84 for ACC, 0.78 to 0.92 for RMSE*, and between 0.82 and 1.34 for b. The highest ACC and smallest RMSE* were achieved with QMTSVR-CV2 in both traits. We observed that for CT1, model/validation design selection was sensitive to the choice of accuracy metric (ACC or ACC(par)). Nonetheless, the higher predictive accuracy of QMTSVR over MTGBLUP and MTBC was replicated across accuracy metrics, besides the similar performance between the proposed method and the MTRKHS model. Results showed that the proposed approach is competitive with conventional MT Bayesian regression models using either Gaussian or spike–slab multivariate priors. Oxford University Press 2023-05-22 /pmc/articles/PMC10411556/ /pubmed/37216670 http://dx.doi.org/10.1093/g3journal/jkad109 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of The Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genomic Prediction Alves, Anderson Antonio Carvalho Fernandes, Arthur Francisco Araujo Lopes, Fernando Brito Breen, Vivian Hawken, Rachel Gianola, Daniel Rosa, Guilherme Jordão de Magalhães (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers |
title | (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers |
title_full | (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers |
title_fullStr | (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers |
title_full_unstemmed | (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers |
title_short | (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers |
title_sort | (quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers |
topic | Genomic Prediction |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411556/ https://www.ncbi.nlm.nih.gov/pubmed/37216670 http://dx.doi.org/10.1093/g3journal/jkad109 |
work_keys_str_mv | AT alvesandersonantoniocarvalho quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers AT fernandesarthurfranciscoaraujo quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers AT lopesfernandobrito quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers AT breenvivian quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers AT hawkenrachel quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers AT gianoladaniel quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers AT rosaguilhermejordaodemagalhaes quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers |