Cargando…

AUTALASSO: an automatic adaptive LASSO for genome-wide prediction

BACKGROUND: Genome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individual...

Descripción completa

Detalles Bibliográficos
Autores principales: Waldmann, Patrik, Ferenčaković, Maja, Mészáros, Gábor, Khayatzadeh, Negar, Curik, Ino, Sölkner, Johann
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6444607/
https://www.ncbi.nlm.nih.gov/pubmed/30940067
http://dx.doi.org/10.1186/s12859-019-2743-3
_version_ 1783408057832701952
author Waldmann, Patrik
Ferenčaković, Maja
Mészáros, Gábor
Khayatzadeh, Negar
Curik, Ino
Sölkner, Johann
author_facet Waldmann, Patrik
Ferenčaković, Maja
Mészáros, Gábor
Khayatzadeh, Negar
Curik, Ino
Sölkner, Johann
author_sort Waldmann, Patrik
collection PubMed
description BACKGROUND: Genome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individuals is usually smaller which results in problems where model sparsity is of major concern. The LASSO technique has proven to be very well-suited for sparse problems often providing excellent prediction accuracy. Several computationally efficient LASSO algorithms have been developed, but optimization of hyper-parameters can be demanding. RESULTS: We have developed a novel automatic adaptive LASSO (AUTALASSO) based on the alternating direction method of multipliers (ADMM) optimization algorithm. The two major hyper-parameters of ADMM are the learning rate and the regularization factor. The learning rate is automatically tuned with line search and the regularization factor optimized using Golden section search. Results show that AUTALASSO provides superior prediction accuracy when evaluated on simulated and real bull data compared to the adaptive LASSO, LASSO and ridge regression implemented in the popular glmnet software. CONCLUSIONS: The AUTALASSO provides a very flexible and computationally efficient approach to GWP, especially when it is important to obtain high prediction accuracy and genetic gain. The AUTALASSO also has the capability to perform GWAS of both additive and dominance effects with smaller prediction error than the ordinary LASSO.
format Online
Article
Text
id pubmed-6444607
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64446072019-04-11 AUTALASSO: an automatic adaptive LASSO for genome-wide prediction Waldmann, Patrik Ferenčaković, Maja Mészáros, Gábor Khayatzadeh, Negar Curik, Ino Sölkner, Johann BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individuals is usually smaller which results in problems where model sparsity is of major concern. The LASSO technique has proven to be very well-suited for sparse problems often providing excellent prediction accuracy. Several computationally efficient LASSO algorithms have been developed, but optimization of hyper-parameters can be demanding. RESULTS: We have developed a novel automatic adaptive LASSO (AUTALASSO) based on the alternating direction method of multipliers (ADMM) optimization algorithm. The two major hyper-parameters of ADMM are the learning rate and the regularization factor. The learning rate is automatically tuned with line search and the regularization factor optimized using Golden section search. Results show that AUTALASSO provides superior prediction accuracy when evaluated on simulated and real bull data compared to the adaptive LASSO, LASSO and ridge regression implemented in the popular glmnet software. CONCLUSIONS: The AUTALASSO provides a very flexible and computationally efficient approach to GWP, especially when it is important to obtain high prediction accuracy and genetic gain. The AUTALASSO also has the capability to perform GWAS of both additive and dominance effects with smaller prediction error than the ordinary LASSO. BioMed Central 2019-04-02 /pmc/articles/PMC6444607/ /pubmed/30940067 http://dx.doi.org/10.1186/s12859-019-2743-3 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Waldmann, Patrik
Ferenčaković, Maja
Mészáros, Gábor
Khayatzadeh, Negar
Curik, Ino
Sölkner, Johann
AUTALASSO: an automatic adaptive LASSO for genome-wide prediction
title AUTALASSO: an automatic adaptive LASSO for genome-wide prediction
title_full AUTALASSO: an automatic adaptive LASSO for genome-wide prediction
title_fullStr AUTALASSO: an automatic adaptive LASSO for genome-wide prediction
title_full_unstemmed AUTALASSO: an automatic adaptive LASSO for genome-wide prediction
title_short AUTALASSO: an automatic adaptive LASSO for genome-wide prediction
title_sort autalasso: an automatic adaptive lasso for genome-wide prediction
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6444607/
https://www.ncbi.nlm.nih.gov/pubmed/30940067
http://dx.doi.org/10.1186/s12859-019-2743-3
work_keys_str_mv AT waldmannpatrik autalassoanautomaticadaptivelassoforgenomewideprediction
AT ferencakovicmaja autalassoanautomaticadaptivelassoforgenomewideprediction
AT meszarosgabor autalassoanautomaticadaptivelassoforgenomewideprediction
AT khayatzadehnegar autalassoanautomaticadaptivelassoforgenomewideprediction
AT curikino autalassoanautomaticadaptivelassoforgenomewideprediction
AT solknerjohann autalassoanautomaticadaptivelassoforgenomewideprediction