Resources
- Guassian Processes - mathematical monk@youtube on guassian processes (also follows on through 10 parts).
Guassian Process
Definition
For any set
Loading
, a
Guassian Process (GP) on
Loading
is a set of random variables
Loading
such that
Loading
is multivariate gaussian.
Examples
From ML 19.1 (mathematicalmonk).
Trivial Example
Loading
Here we have d random variables, and any subset of these can form a perfectly valid multivariate guassian.
Random Lines
Loading
For any
Loading
randomly sampled form the guassian, the resulting
Loading
is a line. Hence, random lines.
Then forming given
Loading
, we have
Loading
which we know is a multivariate guassian distribution because multivariate guassian distributions are closed under affine transforms (i.e. linear). It has zero mean and covariance:
Loading
This is equivalent to saying the guassian process has a covariance function:
Loading
Random Planes
Given data
Loading
,
Loading
.
Then bayesian linear regression models the
Loading
with random variables
Loading
which are assumed to be independent given some parameter vector
Loading
. The distribution we assign the random variables here is of course guassian:
Loading
or equivalently:
Loading
The parameter vector we assume to have a prior of the form:
Loading
Note that
Loading
is a multivariate guassian. So for any
Loading
, if we consider
Loading
(or
Loading
), this is then just a linear combination of univariate guassians which from the closure property of guassians
Loading
implies that
Loading
is also guassian, and more specifically, univariate guassian.
To see this, take any
Loading
. Now since we know
Loading
is univariate,
Loading
which is an affine transform of a multivariate guassian, so it too must be a multivariate guassian. Hence
Loading
is indeed a guassian process. The mean of the guassian process:
Loading
and the covariance of the guassian process comes from:
Loading
where we have made use of the equation for the covariance
Loading
in breaking it down and substituting back for
Loading
at the end. This is exactly the kernel from
guassian process examples of linear planes.
Existence of Guassian Processes
For any set
Loading
, any mean function
Loading
,and any kernel (covariance) function
Loading
, there exists a
Guassian Process Loading
on
Loading
such that
Loading
, and covariance
Loading
.
Examples
- Random Lines/Planes :
Loading
- Brownian Motion :
Loading
- Smooth Function :
Loading
The first function is curious - the singular value decomposition for the guassian generator
Loading
permits only the very first element of
Loading
to contribute to the random vector. This causes all remaining elements to remain locked into the random lines. For the others, the covariance rules effect the sampled random vector by weighting covariant values closer or farther from each other differently (when covariances are higher for close points, and almost zero for far points, this intuitively contributes to continuity). More details in mathematical monk's video
ML 19.3.
Stationary and Non-Stationary GPs
A kernel function
Loading
which is only dependent on
Loading
, i.e.
Loading
, is called a
stationary guassian process. This induces similar behaviour of the GP everywhere over the entire domain. If instead, you wish for different behaviour (e.g. different smoothness properties in different subsets of the domain), then a
nonstationary guassian process is required. These are generally far more computationally heavy.