be the 3-dimensional spatial coordinate system and t the temporal dimension. We consider the image time series I as a discretely sampled spatio-temporal signal of dimensions , where N is the dimension of the sampling grid on a single spatial axis, and is the number of time points1. In the following sections we represent the image time-series I as a single dimensional array of dimensions . We model the image time series I(u, t) as a realization of a latent spatio-temporal process f(u, t) with additive noise:
(1)
This is a valid modeling assumption when the temporal properties of the signal are similar across space; for instance, when analyzing within-subject time series of brain MRIs in AD the expected pathological change rates are generally mild and slowly varying across the brain. Second, a central assumption made in this paper is that the spatial dependencies of the signal are local, i.e. that the image intensities are smoothly varying and correlated within a spatial neighborhood of radius . We note that our assumptions about separability and stationarity are compatible with the spatio-temporal correlation models commonly assumed by registration-based approaches.
A reasonable choice for such a local spatial covariance structure is a negative squared exponential model , where is the global spatial amplitude parameter, and is the length-scale of the Gaussian spatial neighborhood. We observe that such a covariance structure is stationary with respect to the space parameters. Furthermore we can exploit the separability properties of the negative exponential function to note that given two separate spatial locations and we have
For this reason the covariance matrix can be further decomposed as the Kronecker product of covariance matrices of 1-dimensional processes: . We observe that the model is here conveniently represented by the product of independent covariances of significantly smaller size, and is completely identified by the spatial, temporal and noise parameters. In particular the proposed model is flexible with respect to the temporal covariance matrix , which can be expressed in terms of complex mixed-effects structure, and can account for covariates and different progression models. For instance, in this work the matrix is first specified in order to model the temporal progression observed in time series of images (Sect. 4), and then is used to model the influence of anatomical, genetic, clinical, and sociodemographic covariates on individual atrophy rates modelled by non-linear registration (Sect. 5).
3 Inference in Gaussian Processes with Kronecker Structure
The GP-based generative model with Kronecker covariance structure outlined in this work provides a powerful and efficient framework for prediction using image time series. Here we provide the main results concerning the marginal likelihood computation, the hyper-parameter optimization and the posterior prediction.
Let and be the eigenvectors and eigenvalues associated to the one-dimensional spatial and temporal covariance matrices and . This eigendecomposition problem can be easily and efficiently solved beforehand offline. We further introduce the shortform notation .
Log-Marginal Likelihood. The marginal likelihood of the model (1) is the following:
with , , and where is the matricization of I into a 2 dimensional matrix of dimension , and and are the eigenvalues of respectively and . The computation of the vector requires the storage and multiplication of matrices of relatively small sizes, respectively , and . The product can be finally computed as the solution of the linear system , which is straightforward since is diagonal.
(2)
Hyperparameter Optimization. The derivative of the log-likelihood (2) with respect to the model parameters is:
It can be shown that formula (3) can be efficiently computed with respect to each model parameters. For instance, the gradient with respect to the noise parameter can be expressed in the form:
Prediction. A major strength of a GP framework for image time series is that it easily enables probabilistic predictions based on given observations. The proposed generative model allows us to consider the predictive distributions of the latent spatio-temporal process at any testing locations and timepoints . Given image time series I(u, t), we now aim at predicting the image at testing coordinates . Let us define the cross-covariance matrix of training and testing data, and the covariance evaluated on the new coordinates. The joint GP model of training and testing data is:
and it can be easily shown that the posterior distribution of conditioned on the observed time series I and parameters is [11]:
From the practical perspective, we notice that by definition the new covariance matrices still have a Kronecker product form: , and . The predicted mean at coordinates is then
(3)
(4)
(5)
(6)