Cargando…

(Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers

This study investigates nonlinear kernels for multitrait (MT) genomic prediction using support vector regression (SVR) models. We assessed the predictive ability delivered by single-trait (ST) and MT models for 2 carcass traits (CT1 and CT2) measured in purebred broiler chickens. The MT models also...

Descripción completa

Detalles Bibliográficos
Autores principales: Alves, Anderson Antonio Carvalho, Fernandes, Arthur Francisco Araujo, Lopes, Fernando Brito, Breen, Vivian, Hawken, Rachel, Gianola, Daniel, Rosa, Guilherme Jordão de Magalhães
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411556/
https://www.ncbi.nlm.nih.gov/pubmed/37216670
http://dx.doi.org/10.1093/g3journal/jkad109
_version_ 1785086691097182208
author Alves, Anderson Antonio Carvalho
Fernandes, Arthur Francisco Araujo
Lopes, Fernando Brito
Breen, Vivian
Hawken, Rachel
Gianola, Daniel
Rosa, Guilherme Jordão de Magalhães
author_facet Alves, Anderson Antonio Carvalho
Fernandes, Arthur Francisco Araujo
Lopes, Fernando Brito
Breen, Vivian
Hawken, Rachel
Gianola, Daniel
Rosa, Guilherme Jordão de Magalhães
author_sort Alves, Anderson Antonio Carvalho
collection PubMed
description This study investigates nonlinear kernels for multitrait (MT) genomic prediction using support vector regression (SVR) models. We assessed the predictive ability delivered by single-trait (ST) and MT models for 2 carcass traits (CT1 and CT2) measured in purebred broiler chickens. The MT models also included information on indicator traits measured in vivo [Growth and feed efficiency trait (FE)]. We proposed an approach termed (quasi) multitask SVR (QMTSVR), with hyperparameter optimization performed via genetic algorithm. ST and MT Bayesian shrinkage and variable selection models [genomic best linear unbiased predictor (GBLUP), BayesC (BC), and reproducing kernel Hilbert space (RKHS) regression] were employed as benchmarks. MT models were trained using 2 validation designs (CV1 and CV2), which differ if the information on secondary traits is available in the testing set. Models’ predictive ability was assessed with prediction accuracy (ACC; i.e. the correlation between predicted and observed values, divided by the square root of phenotype accuracy), standardized root-mean-squared error (RMSE*), and inflation factor (b). To account for potential bias in CV2-style predictions, we also computed a parametric estimate of accuracy ([Formula: see text]). Predictive ability metrics varied according to trait, model, and validation design (CV1 or CV2), ranging from 0.71 to 0.84 for ACC, 0.78 to 0.92 for RMSE*, and between 0.82 and 1.34 for b. The highest ACC and smallest RMSE* were achieved with QMTSVR-CV2 in both traits. We observed that for CT1, model/validation design selection was sensitive to the choice of accuracy metric (ACC or ACC(par)). Nonetheless, the higher predictive accuracy of QMTSVR over MTGBLUP and MTBC was replicated across accuracy metrics, besides the similar performance between the proposed method and the MTRKHS model. Results showed that the proposed approach is competitive with conventional MT Bayesian regression models using either Gaussian or spike–slab multivariate priors.
format Online
Article
Text
id pubmed-10411556
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104115562023-08-10 (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers Alves, Anderson Antonio Carvalho Fernandes, Arthur Francisco Araujo Lopes, Fernando Brito Breen, Vivian Hawken, Rachel Gianola, Daniel Rosa, Guilherme Jordão de Magalhães G3 (Bethesda) Genomic Prediction This study investigates nonlinear kernels for multitrait (MT) genomic prediction using support vector regression (SVR) models. We assessed the predictive ability delivered by single-trait (ST) and MT models for 2 carcass traits (CT1 and CT2) measured in purebred broiler chickens. The MT models also included information on indicator traits measured in vivo [Growth and feed efficiency trait (FE)]. We proposed an approach termed (quasi) multitask SVR (QMTSVR), with hyperparameter optimization performed via genetic algorithm. ST and MT Bayesian shrinkage and variable selection models [genomic best linear unbiased predictor (GBLUP), BayesC (BC), and reproducing kernel Hilbert space (RKHS) regression] were employed as benchmarks. MT models were trained using 2 validation designs (CV1 and CV2), which differ if the information on secondary traits is available in the testing set. Models’ predictive ability was assessed with prediction accuracy (ACC; i.e. the correlation between predicted and observed values, divided by the square root of phenotype accuracy), standardized root-mean-squared error (RMSE*), and inflation factor (b). To account for potential bias in CV2-style predictions, we also computed a parametric estimate of accuracy ([Formula: see text]). Predictive ability metrics varied according to trait, model, and validation design (CV1 or CV2), ranging from 0.71 to 0.84 for ACC, 0.78 to 0.92 for RMSE*, and between 0.82 and 1.34 for b. The highest ACC and smallest RMSE* were achieved with QMTSVR-CV2 in both traits. We observed that for CT1, model/validation design selection was sensitive to the choice of accuracy metric (ACC or ACC(par)). Nonetheless, the higher predictive accuracy of QMTSVR over MTGBLUP and MTBC was replicated across accuracy metrics, besides the similar performance between the proposed method and the MTRKHS model. Results showed that the proposed approach is competitive with conventional MT Bayesian regression models using either Gaussian or spike–slab multivariate priors. Oxford University Press 2023-05-22 /pmc/articles/PMC10411556/ /pubmed/37216670 http://dx.doi.org/10.1093/g3journal/jkad109 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of The Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomic Prediction
Alves, Anderson Antonio Carvalho
Fernandes, Arthur Francisco Araujo
Lopes, Fernando Brito
Breen, Vivian
Hawken, Rachel
Gianola, Daniel
Rosa, Guilherme Jordão de Magalhães
(Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
title (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
title_full (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
title_fullStr (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
title_full_unstemmed (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
title_short (Quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
title_sort (quasi) multitask support vector regression with heuristic hyperparameter optimization for whole-genome prediction of complex traits: a case study with carcass traits in broilers
topic Genomic Prediction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411556/
https://www.ncbi.nlm.nih.gov/pubmed/37216670
http://dx.doi.org/10.1093/g3journal/jkad109
work_keys_str_mv AT alvesandersonantoniocarvalho quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers
AT fernandesarthurfranciscoaraujo quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers
AT lopesfernandobrito quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers
AT breenvivian quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers
AT hawkenrachel quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers
AT gianoladaniel quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers
AT rosaguilhermejordaodemagalhaes quasimultitasksupportvectorregressionwithheuristichyperparameteroptimizationforwholegenomepredictionofcomplextraitsacasestudywithcarcasstraitsinbroilers