Cargando…

Fast methods for training Gaussian processes on large datasets

Gaussian process regression (GPR) is a non-parametric Bayesian technique for interpolating or fitting data. The main barrier to further uptake of this powerful tool rests in the computational costs associated with the matrices which arise when dealing with large datasets. Here, we derive some simple...

Descripción completa

Detalles Bibliográficos
Autores principales: Moore, C. J., Chua, A. J. K., Berry, C. P. L., Gair, J. R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4892455/
https://www.ncbi.nlm.nih.gov/pubmed/27293793
http://dx.doi.org/10.1098/rsos.160125
Descripción
Sumario:Gaussian process regression (GPR) is a non-parametric Bayesian technique for interpolating or fitting data. The main barrier to further uptake of this powerful tool rests in the computational costs associated with the matrices which arise when dealing with large datasets. Here, we derive some simple results which we have found useful for speeding up the learning stage in the GPR algorithm, and especially for performing Bayesian model comparison between different covariance functions. We apply our techniques to both synthetic and real data and quantify the speed-up relative to using nested sampling to numerically evaluate model evidences.