A Case Study Competition Among Methods for Analyzing Large Spatial Data

Matthew J. Heaton, Abhirup Datta, Andrew O. Finley, Reinhard Furrer, Joseph Guinness, Rajarshi Guhaniyogi, Florian Gerber, Robert B. Gramacy, Dorit Hammerling, Matthias Katzfuss, Finn Lindgren, Douglas W. Nychka, Furong Sun, Andrew Zammit-Mangion

Research output: Contribution to journalArticlepeer-review

66 Scopus citations

Abstract

The Gaussian process is an indispensable tool for spatial data analysts. The onset of the “big data” era, however, has lead to the traditional Gaussian process being computationally infeasible for modern spatial data. As such, various alternatives to the full Gaussian process that are more amenable to handling big spatial data have been proposed. These modern methods often exploit low-rank structures and/or multi-core and multi-threaded computing environments to facilitate computation. This study provides, first, an introductory overview of several methods for analyzing large spatial data. Second, this study describes the results of a predictive competition among the described methods as implemented by different groups with strong expertise in the methodology. Specifically, each research group was provided with two training datasets (one simulated and one observed) along with a set of prediction locations. Each group then wrote their own implementation of their method to produce predictions at the given location and each was subsequently run on a common computing environment. The methods were then compared in terms of various predictive diagnostics. Supplementary materials regarding implementation details of the methods and code are available for this article online.

Original languageEnglish (US)
Pages (from-to)398-425
Number of pages28
JournalJournal of Agricultural, Biological, and Environmental Statistics
Volume24
Issue number3
DOIs
StatePublished - Sep 15 2019
Externally publishedYes

Keywords

  • Big data
  • Gaussian process
  • Low-rank approximation
  • Parallel computing

ASJC Scopus subject areas

  • Statistics and Probability
  • General Environmental Science
  • Agricultural and Biological Sciences (miscellaneous)
  • General Agricultural and Biological Sciences
  • Statistics, Probability and Uncertainty
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'A Case Study Competition Among Methods for Analyzing Large Spatial Data'. Together they form a unique fingerprint.

Cite this