Guassian Processes

Guassian Processes







Resources

  • Guassian Processes - mathematical monk@youtube on guassian processes (also follows on through 10 parts).



Guassian Process

Definition

For any set 

Loading
, a Guassian Process (GP) on 
Loading
 is a  set of random variables 
Loading
 such that 
Loading
 is multivariate gaussian.

Examples

From ML 19.1 (mathematicalmonk).

Trivial Example

Loading

Here we have d random variables, and any subset of these can form a perfectly valid multivariate guassian.

Random Lines

Loading

For any 

Loading
 randomly sampled form the guassian, the resulting 
Loading
 is a line. Hence, random lines.

Then forming given 

Loading
, we have

Loading

which we know is a multivariate guassian distribution because multivariate guassian distributions are closed under affine transforms (i.e. linear). It has zero mean and covariance:

Loading

This is equivalent to saying the guassian process has a covariance function:

Loading

Random Planes

 

Given data 

Loading
Loading
.

 

Then bayesian linear regression models the 

Loading
 with random variables 
Loading
 which are assumed to be independent given some parameter vector 
Loading
. The distribution we assign the random variables here is of course guassian:

 

Loading

 

or equivalently:

 

Loading

 

The parameter vector we assume to have a prior of the form:

 

Loading

 

Note that 

Loading
 is a multivariate guassian. So for any 
Loading
, if we consider 
Loading
 (or 
Loading
), this is then just a linear combination of univariate guassians which from the closure property of guassians  
Loading
implies that 
Loading
 is also guassian, and more specifically, univariate guassian.

 

CLAIM

Loading
is a guassian process on
Loading

 

To see this, take any 

Loading
. Now since we know 
Loading
 is univariate,

 

Loading

which is an affine transform of a multivariate guassian, so it too must be a multivariate guassian. Hence 

Loading
 is indeed a guassian process. The mean of the guassian process:

Loading

and the covariance of the guassian process comes from:

Loading

where we have made use of the equation for the covariance 

Loading
 in breaking it down and substituting back for 
Loading
 at the end.  This is exactly the kernel from guassian process examples of linear planes.


Existence of Guassian Processes

For any set 

Loading
, any mean function 
Loading
 ,and any kernel (covariance) function 
Loading
, there exists a Guassian Process 
Loading
 on 
Loading
 such that 
Loading
, and covariance 
Loading
.

Examples

  • Random Lines/Planes : 
    Loading
  • Brownian Motion :  
    Loading
  • Smooth Function : 
    Loading

The first function is curious - the singular value decomposition for the guassian generator 

Loading
 permits only the very first element of 
Loading
 to contribute to the random vector. This causes all remaining elements to remain locked into the random lines. For the others, the covariance rules effect the sampled random vector by weighting covariant values closer or farther from each other differently (when covariances are higher for close points, and almost zero for far points, this intuitively contributes to continuity).  More details in mathematical monk's video  ML 19.3.

Stationary and Non-Stationary GPs

A kernel function 

Loading
 which is only dependent on 
Loading
, i.e. 
Loading
, is called a stationary guassian process. This induces similar behaviour of the GP everywhere over the entire domain. If instead, you wish for different behaviour (e.g. different smoothness properties in different subsets of the domain), then a nonstationary guassian process is required. These are generally far more computationally heavy.