Guassian Processes







Resources

  • Guassian Processes - mathematical monk@youtube on guassian processes (also follows on through 10 parts).



Guassian Process

Definition

For any set , a Guassian Process (GP) on  is a  set of random variables  such that  is multivariate gaussian.

Examples

From ML 19.1 (mathematicalmonk).

Trivial Example

Here we have d random variables, and any subset of these can form a perfectly valid multivariate guassian.

Random Lines

For any  randomly sampled form the guassian, the resulting  is a line. Hence, random lines.

Then forming given , we have

which we know is a multivariate guassian distribution because multivariate guassian distributions are closed under affine transforms (i.e. linear). It has zero mean and covariance:

This is equivalent to saying the guassian process has a covariance function:

Random Planes

 

Given data .

 

Then bayesian linear regression models the  with random variables  which are assumed to be independent given some parameter vector . The distribution we assign the random variables here is of course guassian:

 

 

or equivalently:

 

 

The parameter vector we assume to have a prior of the form:

 

 

Note that  is a multivariate guassian. So for any , if we consider  (or ), this is then just a linear combination of univariate guassians which from the closure property of guassians  (Guassian Distributions:2) implies that  is also guassian, and more specifically, univariate guassian.

 

CLAIM is a guassian process on

 

To see this, take any . Now since we know  is univariate,

 

which is an affine transform of a multivariate guassian, so it too must be a multivariate guassian. Hence  is indeed a guassian process. The mean of the guassian process:

and the covariance of the guassian process comes from:

where we have made use of the equation for the covariance (?) in breaking it down and substituting back for  at the end.  This is exactly the kernel from guassian process examples of linear planes.


Existence of Guassian Processes

For any set , any mean function  ,and any kernel (covariance) function , there exists a Guassian Process  on  such that , and covariance .

Examples

  • Random Lines/Planes : 
  • Brownian Motion :  
  • Smooth Function : 

The first function is curious - the singular value decomposition for the guassian generator (?) permits only the very first element of  to contribute to the random vector. This causes all remaining elements to remain locked into the random lines. For the others, the covariance rules effect the sampled random vector by weighting covariant values closer or farther from each other differently (when covariances are higher for close points, and almost zero for far points, this intuitively contributes to continuity).  More details in mathematical monk's video  ML 19.3.

Stationary and Non-Stationary GPs

A kernel function  which is only dependent on , i.e. , is called a stationary guassian process. This induces similar behaviour of the GP everywhere over the entire domain. If instead, you wish for different behaviour (e.g. different smoothness properties in different subsets of the domain), then a nonstationary guassian process is required. These are generally far more computationally heavy.