Cargando…
Fingerprint resampling: A generic method for efficient resampling
In resampling methods, such as bootstrapping or cross validation, a very similar computational problem (usually an optimization procedure) is solved over and over again for a set of very similar data sets. If it is computationally burdensome to solve this computational problem once, the whole resamp...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657057/ https://www.ncbi.nlm.nih.gov/pubmed/26597870 http://dx.doi.org/10.1038/srep16970 |
_version_ | 1782402326789619712 |
---|---|
author | Mestdagh, Merijn Verdonck, Stijn Duisters, Kevin Tuerlinckx, Francis |
author_facet | Mestdagh, Merijn Verdonck, Stijn Duisters, Kevin Tuerlinckx, Francis |
author_sort | Mestdagh, Merijn |
collection | PubMed |
description | In resampling methods, such as bootstrapping or cross validation, a very similar computational problem (usually an optimization procedure) is solved over and over again for a set of very similar data sets. If it is computationally burdensome to solve this computational problem once, the whole resampling method can become unfeasible. However, because the computational problems and data sets are so similar, the speed of the resampling method may be increased by taking advantage of these similarities in method and data. As a generic solution, we propose to learn the relation between the resampled data sets and their corresponding optima. Using this learned knowledge, we are then able to predict the optima associated with new resampled data sets. First, these predicted optima are used as starting values for the optimization process. Once the predictions become accurate enough, the optimization process may even be omitted completely, thereby greatly decreasing the computational burden. The suggested method is validated using two simple problems (where the results can be verified analytically) and two real-life problems (i.e., the bootstrap of a mixed model and a generalized extreme value distribution). The proposed method led on average to a tenfold increase in speed of the resampling method. |
format | Online Article Text |
id | pubmed-4657057 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-46570572015-11-30 Fingerprint resampling: A generic method for efficient resampling Mestdagh, Merijn Verdonck, Stijn Duisters, Kevin Tuerlinckx, Francis Sci Rep Article In resampling methods, such as bootstrapping or cross validation, a very similar computational problem (usually an optimization procedure) is solved over and over again for a set of very similar data sets. If it is computationally burdensome to solve this computational problem once, the whole resampling method can become unfeasible. However, because the computational problems and data sets are so similar, the speed of the resampling method may be increased by taking advantage of these similarities in method and data. As a generic solution, we propose to learn the relation between the resampled data sets and their corresponding optima. Using this learned knowledge, we are then able to predict the optima associated with new resampled data sets. First, these predicted optima are used as starting values for the optimization process. Once the predictions become accurate enough, the optimization process may even be omitted completely, thereby greatly decreasing the computational burden. The suggested method is validated using two simple problems (where the results can be verified analytically) and two real-life problems (i.e., the bootstrap of a mixed model and a generalized extreme value distribution). The proposed method led on average to a tenfold increase in speed of the resampling method. Nature Publishing Group 2015-11-24 /pmc/articles/PMC4657057/ /pubmed/26597870 http://dx.doi.org/10.1038/srep16970 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Mestdagh, Merijn Verdonck, Stijn Duisters, Kevin Tuerlinckx, Francis Fingerprint resampling: A generic method for efficient resampling |
title | Fingerprint resampling: A generic method for efficient resampling |
title_full | Fingerprint resampling: A generic method for efficient resampling |
title_fullStr | Fingerprint resampling: A generic method for efficient resampling |
title_full_unstemmed | Fingerprint resampling: A generic method for efficient resampling |
title_short | Fingerprint resampling: A generic method for efficient resampling |
title_sort | fingerprint resampling: a generic method for efficient resampling |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657057/ https://www.ncbi.nlm.nih.gov/pubmed/26597870 http://dx.doi.org/10.1038/srep16970 |
work_keys_str_mv | AT mestdaghmerijn fingerprintresamplingagenericmethodforefficientresampling AT verdonckstijn fingerprintresamplingagenericmethodforefficientresampling AT duisterskevin fingerprintresamplingagenericmethodforefficientresampling AT tuerlinckxfrancis fingerprintresamplingagenericmethodforefficientresampling |