Cargando…
Splitting Gaussian processes for computationally-efficient regression
Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian p...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8384217/ https://www.ncbi.nlm.nih.gov/pubmed/34428233 http://dx.doi.org/10.1371/journal.pone.0256470 |
_version_ | 1783741874278760448 |
---|---|
author | Terry, Nick Choe, Youngjun |
author_facet | Terry, Nick Choe, Youngjun |
author_sort | Terry, Nick |
collection | PubMed |
description | Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian process models can be a limiting factor in applications. We propose an algorithm for sequentially partitioning the input space and fitting a localized Gaussian process to each disjoint region. The algorithm is shown to have superior time and space complexity to existing methods, and its sequential nature allows the model to be updated efficiently. The algorithm constructs a model for which the time complexity of updating is tightly bounded above by a pre-specified parameter. To the best of our knowledge, the model is the first local Gaussian process regression model to achieve linear memory complexity. Theoretical continuity properties of the model are proven. We demonstrate the efficacy of the resulting model on several multi-dimensional regression tasks. |
format | Online Article Text |
id | pubmed-8384217 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83842172021-08-25 Splitting Gaussian processes for computationally-efficient regression Terry, Nick Choe, Youngjun PLoS One Research Article Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian process models can be a limiting factor in applications. We propose an algorithm for sequentially partitioning the input space and fitting a localized Gaussian process to each disjoint region. The algorithm is shown to have superior time and space complexity to existing methods, and its sequential nature allows the model to be updated efficiently. The algorithm constructs a model for which the time complexity of updating is tightly bounded above by a pre-specified parameter. To the best of our knowledge, the model is the first local Gaussian process regression model to achieve linear memory complexity. Theoretical continuity properties of the model are proven. We demonstrate the efficacy of the resulting model on several multi-dimensional regression tasks. Public Library of Science 2021-08-24 /pmc/articles/PMC8384217/ /pubmed/34428233 http://dx.doi.org/10.1371/journal.pone.0256470 Text en © 2021 Terry, Choe https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Terry, Nick Choe, Youngjun Splitting Gaussian processes for computationally-efficient regression |
title | Splitting Gaussian processes for computationally-efficient regression |
title_full | Splitting Gaussian processes for computationally-efficient regression |
title_fullStr | Splitting Gaussian processes for computationally-efficient regression |
title_full_unstemmed | Splitting Gaussian processes for computationally-efficient regression |
title_short | Splitting Gaussian processes for computationally-efficient regression |
title_sort | splitting gaussian processes for computationally-efficient regression |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8384217/ https://www.ncbi.nlm.nih.gov/pubmed/34428233 http://dx.doi.org/10.1371/journal.pone.0256470 |
work_keys_str_mv | AT terrynick splittinggaussianprocessesforcomputationallyefficientregression AT choeyoungjun splittinggaussianprocessesforcomputationallyefficientregression |