Cargando…

Splitting Gaussian processes for computationally-efficient regression

Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian p...

Descripción completa

Detalles Bibliográficos
Autores principales: Terry, Nick, Choe, Youngjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8384217/
https://www.ncbi.nlm.nih.gov/pubmed/34428233
http://dx.doi.org/10.1371/journal.pone.0256470
_version_ 1783741874278760448
author Terry, Nick
Choe, Youngjun
author_facet Terry, Nick
Choe, Youngjun
author_sort Terry, Nick
collection PubMed
description Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian process models can be a limiting factor in applications. We propose an algorithm for sequentially partitioning the input space and fitting a localized Gaussian process to each disjoint region. The algorithm is shown to have superior time and space complexity to existing methods, and its sequential nature allows the model to be updated efficiently. The algorithm constructs a model for which the time complexity of updating is tightly bounded above by a pre-specified parameter. To the best of our knowledge, the model is the first local Gaussian process regression model to achieve linear memory complexity. Theoretical continuity properties of the model are proven. We demonstrate the efficacy of the resulting model on several multi-dimensional regression tasks.
format Online
Article
Text
id pubmed-8384217
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83842172021-08-25 Splitting Gaussian processes for computationally-efficient regression Terry, Nick Choe, Youngjun PLoS One Research Article Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian process models can be a limiting factor in applications. We propose an algorithm for sequentially partitioning the input space and fitting a localized Gaussian process to each disjoint region. The algorithm is shown to have superior time and space complexity to existing methods, and its sequential nature allows the model to be updated efficiently. The algorithm constructs a model for which the time complexity of updating is tightly bounded above by a pre-specified parameter. To the best of our knowledge, the model is the first local Gaussian process regression model to achieve linear memory complexity. Theoretical continuity properties of the model are proven. We demonstrate the efficacy of the resulting model on several multi-dimensional regression tasks. Public Library of Science 2021-08-24 /pmc/articles/PMC8384217/ /pubmed/34428233 http://dx.doi.org/10.1371/journal.pone.0256470 Text en © 2021 Terry, Choe https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Terry, Nick
Choe, Youngjun
Splitting Gaussian processes for computationally-efficient regression
title Splitting Gaussian processes for computationally-efficient regression
title_full Splitting Gaussian processes for computationally-efficient regression
title_fullStr Splitting Gaussian processes for computationally-efficient regression
title_full_unstemmed Splitting Gaussian processes for computationally-efficient regression
title_short Splitting Gaussian processes for computationally-efficient regression
title_sort splitting gaussian processes for computationally-efficient regression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8384217/
https://www.ncbi.nlm.nih.gov/pubmed/34428233
http://dx.doi.org/10.1371/journal.pone.0256470
work_keys_str_mv AT terrynick splittinggaussianprocessesforcomputationallyefficientregression
AT choeyoungjun splittinggaussianprocessesforcomputationallyefficientregression