Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exact Gaussian Processes on a Million Data Points

Mar 19, 2019

Ke Alexander Wang, Geoff Pleiss, Jacob R. Gardner, Stephen Tyree, Kilian Q. Weinberger, Andrew Gordon Wilson

Figure 1 for Exact Gaussian Processes on a Million Data Points

Figure 2 for Exact Gaussian Processes on a Million Data Points

Figure 3 for Exact Gaussian Processes on a Million Data Points

Figure 4 for Exact Gaussian Processes on a Million Data Points

Share this with someone who'll enjoy it:

Abstract:Gaussian processes (GPs) are flexible models with state-of-the-art performance on many impactful applications. However, computational constraints with standard inference procedures have limited exact GPs to problems with fewer than about ten thousand training points, necessitating approximations for larger datasets. In this paper, we develop a scalable approach for exact GPs that leverages multi-GPU parallelization and methods like linear conjugate gradients, accessing the kernel matrix only through matrix multiplication. By partitioning and distributing kernel matrix multiplies, we demonstrate that an exact GP can be trained on over a million points in 3 days using 8 GPUs and can compute predictive means and variances in under a second using 1 GPU at test time. Moreover, we perform the first-ever comparison of exact GPs against state-of-the-art scalable approximations on large-scale regression datasets with $10^4-10^6$ data points, showing dramatic performance improvements.

View paper on

Share this with someone who'll enjoy it:

Title:Exact Gaussian Processes on a Million Data Points

Paper and Code