Cargando…
AUTALASSO: an automatic adaptive LASSO for genome-wide prediction
BACKGROUND: Genome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individual...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6444607/ https://www.ncbi.nlm.nih.gov/pubmed/30940067 http://dx.doi.org/10.1186/s12859-019-2743-3 |
_version_ | 1783408057832701952 |
---|---|
author | Waldmann, Patrik Ferenčaković, Maja Mészáros, Gábor Khayatzadeh, Negar Curik, Ino Sölkner, Johann |
author_facet | Waldmann, Patrik Ferenčaković, Maja Mészáros, Gábor Khayatzadeh, Negar Curik, Ino Sölkner, Johann |
author_sort | Waldmann, Patrik |
collection | PubMed |
description | BACKGROUND: Genome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individuals is usually smaller which results in problems where model sparsity is of major concern. The LASSO technique has proven to be very well-suited for sparse problems often providing excellent prediction accuracy. Several computationally efficient LASSO algorithms have been developed, but optimization of hyper-parameters can be demanding. RESULTS: We have developed a novel automatic adaptive LASSO (AUTALASSO) based on the alternating direction method of multipliers (ADMM) optimization algorithm. The two major hyper-parameters of ADMM are the learning rate and the regularization factor. The learning rate is automatically tuned with line search and the regularization factor optimized using Golden section search. Results show that AUTALASSO provides superior prediction accuracy when evaluated on simulated and real bull data compared to the adaptive LASSO, LASSO and ridge regression implemented in the popular glmnet software. CONCLUSIONS: The AUTALASSO provides a very flexible and computationally efficient approach to GWP, especially when it is important to obtain high prediction accuracy and genetic gain. The AUTALASSO also has the capability to perform GWAS of both additive and dominance effects with smaller prediction error than the ordinary LASSO. |
format | Online Article Text |
id | pubmed-6444607 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64446072019-04-11 AUTALASSO: an automatic adaptive LASSO for genome-wide prediction Waldmann, Patrik Ferenčaković, Maja Mészáros, Gábor Khayatzadeh, Negar Curik, Ino Sölkner, Johann BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individuals is usually smaller which results in problems where model sparsity is of major concern. The LASSO technique has proven to be very well-suited for sparse problems often providing excellent prediction accuracy. Several computationally efficient LASSO algorithms have been developed, but optimization of hyper-parameters can be demanding. RESULTS: We have developed a novel automatic adaptive LASSO (AUTALASSO) based on the alternating direction method of multipliers (ADMM) optimization algorithm. The two major hyper-parameters of ADMM are the learning rate and the regularization factor. The learning rate is automatically tuned with line search and the regularization factor optimized using Golden section search. Results show that AUTALASSO provides superior prediction accuracy when evaluated on simulated and real bull data compared to the adaptive LASSO, LASSO and ridge regression implemented in the popular glmnet software. CONCLUSIONS: The AUTALASSO provides a very flexible and computationally efficient approach to GWP, especially when it is important to obtain high prediction accuracy and genetic gain. The AUTALASSO also has the capability to perform GWAS of both additive and dominance effects with smaller prediction error than the ordinary LASSO. BioMed Central 2019-04-02 /pmc/articles/PMC6444607/ /pubmed/30940067 http://dx.doi.org/10.1186/s12859-019-2743-3 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Waldmann, Patrik Ferenčaković, Maja Mészáros, Gábor Khayatzadeh, Negar Curik, Ino Sölkner, Johann AUTALASSO: an automatic adaptive LASSO for genome-wide prediction |
title | AUTALASSO: an automatic adaptive LASSO for genome-wide prediction |
title_full | AUTALASSO: an automatic adaptive LASSO for genome-wide prediction |
title_fullStr | AUTALASSO: an automatic adaptive LASSO for genome-wide prediction |
title_full_unstemmed | AUTALASSO: an automatic adaptive LASSO for genome-wide prediction |
title_short | AUTALASSO: an automatic adaptive LASSO for genome-wide prediction |
title_sort | autalasso: an automatic adaptive lasso for genome-wide prediction |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6444607/ https://www.ncbi.nlm.nih.gov/pubmed/30940067 http://dx.doi.org/10.1186/s12859-019-2743-3 |
work_keys_str_mv | AT waldmannpatrik autalassoanautomaticadaptivelassoforgenomewideprediction AT ferencakovicmaja autalassoanautomaticadaptivelassoforgenomewideprediction AT meszarosgabor autalassoanautomaticadaptivelassoforgenomewideprediction AT khayatzadehnegar autalassoanautomaticadaptivelassoforgenomewideprediction AT curikino autalassoanautomaticadaptivelassoforgenomewideprediction AT solknerjohann autalassoanautomaticadaptivelassoforgenomewideprediction |