Cargando…

Spatial Multivariate Trees for Big Data Bayesian Regression

High resolution geospatial data are challenging because standard geostatistical models based on Gaussian processes are known to not scale to large data sizes. While progress has been made towards methods that can be computed more efficiently, considerably less attention has been devoted to methods f...

Descripción completa

Detalles Bibliográficos
Autores principales: Peruzzi, Michele, Dunson, David B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311452/
https://www.ncbi.nlm.nih.gov/pubmed/35891979
_version_ 1784753600156663808
author Peruzzi, Michele
Dunson, David B.
author_facet Peruzzi, Michele
Dunson, David B.
author_sort Peruzzi, Michele
collection PubMed
description High resolution geospatial data are challenging because standard geostatistical models based on Gaussian processes are known to not scale to large data sizes. While progress has been made towards methods that can be computed more efficiently, considerably less attention has been devoted to methods for large scale data that allow the description of complex relationships between several outcomes recorded at high resolutions by different sensors. Our Bayesian multivariate regression models based on spatial multivariate trees (SpamTrees) achieve scalability via conditional independence assumptions on latent random effects following a treed directed acyclic graph. Information-theoretic arguments and considerations on computational efficiency guide the construction of the tree and the related efficient sampling algorithms in imbalanced multivariate settings. In addition to simulated data examples, we illustrate SpamTrees using a large climate data set which combines satellite data with land-based station data. Software and source code are available on CRAN at https://CRAN.R-project.org/package=spamtree.
format Online
Article
Text
id pubmed-9311452
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-93114522022-07-25 Spatial Multivariate Trees for Big Data Bayesian Regression Peruzzi, Michele Dunson, David B. J Mach Learn Res Article High resolution geospatial data are challenging because standard geostatistical models based on Gaussian processes are known to not scale to large data sizes. While progress has been made towards methods that can be computed more efficiently, considerably less attention has been devoted to methods for large scale data that allow the description of complex relationships between several outcomes recorded at high resolutions by different sensors. Our Bayesian multivariate regression models based on spatial multivariate trees (SpamTrees) achieve scalability via conditional independence assumptions on latent random effects following a treed directed acyclic graph. Information-theoretic arguments and considerations on computational efficiency guide the construction of the tree and the related efficient sampling algorithms in imbalanced multivariate settings. In addition to simulated data examples, we illustrate SpamTrees using a large climate data set which combines satellite data with land-based station data. Software and source code are available on CRAN at https://CRAN.R-project.org/package=spamtree. 2022 /pmc/articles/PMC9311452/ /pubmed/35891979 Text en https://creativecommons.org/licenses/by/4.0/License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v23/20-1361.html.
spellingShingle Article
Peruzzi, Michele
Dunson, David B.
Spatial Multivariate Trees for Big Data Bayesian Regression
title Spatial Multivariate Trees for Big Data Bayesian Regression
title_full Spatial Multivariate Trees for Big Data Bayesian Regression
title_fullStr Spatial Multivariate Trees for Big Data Bayesian Regression
title_full_unstemmed Spatial Multivariate Trees for Big Data Bayesian Regression
title_short Spatial Multivariate Trees for Big Data Bayesian Regression
title_sort spatial multivariate trees for big data bayesian regression
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9311452/
https://www.ncbi.nlm.nih.gov/pubmed/35891979
work_keys_str_mv AT peruzzimichele spatialmultivariatetreesforbigdatabayesianregression
AT dunsondavidb spatialmultivariatetreesforbigdatabayesianregression