CFE-CMStatistics 2025: Start Registration
View Submission - CFE-CMStatistics 2025
A1256
Title: REX-SUB: A scalable subsampling strategy for modeling large spatial datasets Authors:  Nicholas Rios - George Mason University (United States)
Seiyon Lee - George Mason University (United States) [presenting]
Abstract: Recent advances in data collection technologies have led to the emergence of massive spatial datasets, with measurements obtained at millions of spatial locations. Geostatistical models typically employ Gaussian processes (GPs) to capture spatial dependence, but standard GP fitting becomes prohibitive at such scales. A promising solution is optimal subsampling, where a subset of locations is selected that optimizes a criterion. The aim is to propose a randomized exchange algorithm for subsampling (REX-SUB), which efficiently selects small subsamples that minimize prediction errors in the fitted spatial GP models. To further improve computational efficiency, a scalable Vecchia approximation is embedded to GP's joint likelihood, which takes advantage of sparsity in the precision matrix to enable fast inference on the selected subsamples. Through a simulation study and an application to a remotely sensed precipitable water dataset, it is shown that REX-SUB yields lower mean squared prediction errors and interval scores compared to competing subsampling strategies.