Mixed-Effects Model with Time Reparametrization for Longitudinal Univariate Manifold-Valued Data

($$A\beta _{1-42}$$) peptide, increasing CSF tau protein and neurodegeneration, characterized by cerebral atrophy and neuron loss. In order to increase the chances of success of a given treatment, the need for an early diagnosis and the need to understand the progression of the disease are crucial.


Longitudinal data, which consists in repeated observations of a given individual or group of individuals over time, allows to study the progression of a disease and the statistical analysis of longitudinal data may provide useful informations to help diagnose AD in its early phase. Mixed effects models provide a rich framework to analyze longitudinal measurements. The fixed (and respectively random) effects allow to describe the model at the group (respectively subject) level and the distribution of the random effects accounts for inter-subject variability. A particular case of the linear mixed-effects models introduced by Laird and Ware in [5] can be used to model a linear relationship between univariate longitudinal measurement and the times of observation : the random slope and intercept model. If $$y_{i,j}$$ denotes the j-th observation of the i-th subject associated to time $$t_{i,j}$$, the random slope and intercept model would write : $$y_{i,j} = (\overline{a}+a_{i})(t_{i,j}-t_{0}) + (\overline{b}+b_{i}) + \varepsilon _{i,j}$$, where $$\varepsilon _{i,j}$$ denotes a Gaussian noise with zero mean. The fixed effects $$\overline{a}$$ and $$\overline{b}$$ respectively correspond to the slope and intercept of an average trajectory $$\overline{D}(t) = \overline{a}(t-t_{0}) + \overline{b}$$ whereas $$t_{0}$$ plays the role of a reference time. The subject-specific slope $$a_{i}$$ and $$b_{i}$$ correspond to the slope and intercept of the trajectory of the i-th individual.

If the longitudinal observations arise, for example, from developmental studies, breeding studies or pharmacological studies, there is a natural choice for the baseline time $$t_{0}$$, which may represent the time of birth, beginning of the study or time at which a certain drug has been administered. In those situations, the baseline time is known and is therefore not estimated along with the parameters of the model. However, in studies on neurodegenerative diseases, such as Alzheimer’s disease, there may be no natural choice for the reference time. As pointed out by Yang et al. in [4] and Delor et al. in [3], the disease-onset time is probably different for every individual and, at a given age, two individuals may be at very different stages of disease progression. When the baseline (or disease-onset) time is unknown, one may consider it as a fixed effect of the given model. In [2], the authors proposed an interesting mixed-effect model for scalar measurements but the model did not account for the difference in stages progression among individuals. Yang et al. [4] and Delor et al. [3] addressed this issue by including time shifts in their models but Yang et al. did not estimate the time shifts in a statistical framework. In Delor et al., the observations of a given individual were shifted in time before the estimation of a disease-onset time. In both situations, the reference time was not estimated. In [8], Singh et al. proposed an interesting model for longitudinal manifold-valued observations using parallel transport. However, if the model is written for univariate longitudinal data, the model does include a random effect to account for the variability in speed of disease progression.

When the time reference time $$t_{0}$$ is considered unknown, the linear model given above is not identifiable and there is a relation between the distribution of the intercept random effect $$b_{i}$$ and the fixed effect $$t_{0}$$, which makes the interpretation of the distribution of the intercept random effect $$b_{i}$$ difficult.

To address the issues we pointed out, we propose to replace the intercept random effect $$b_{i}$$, which characterizes the distribution of the measurement values at time $$t_{0}$$, by a time shift $$\tau _{i}$$ which measures the delay (or advance) in disease progression. We therefore consider the model : $$y_{i,j} = v_{0}\alpha _{i}(t_{i,j} - t_{0} - \tau _{i}) + \varepsilon _{i,j}$$ where $$\alpha _{i}$$ is a random effect log-normally distributed with mean equal to 1 and $$\tau _{i}$$ is the time shift random effect normally distributed with mean 0. The fixed effect $$v_{0}$$ correspond to the slope of the mean trajectory and is the fixed effect associated to $$\alpha _{i}$$ whereas $$t_{0}$$ is now the fixed effect associated to the random effect $$\tau _{i}$$. If D denotes the straight line $$D(t) = v_{0}(t-t_{0})$$, the model writes : $$y_{i,j} = D \big ( \alpha _{i}(t_{i,j} - t_{0} - \tau _{i}) + t_{0} \big ) + \varepsilon _{i,j}$$. As a consequence, the random effects $$\alpha _{i}$$ and $$\tau _{i}$$ allow to define a subject-specific affine reparametrization of the straight line D, which can be thought of as an average trajectory. The acceleration factor $$\alpha _{i}$$ and the time shift $$\tau _{i}$$ of the affine subject-specific reparametrization provide informations on the dynamical evolution of a given subject with respect to the average model: the acceleration factor will determine whether an individual is progressing faster than the individual trajectory. The time shift determines whether the individual is lagging behind or ahead of the average model. Therefore, the random effects of the model will allow to determine, among a group of individuals, who are fast or slow progressing individuals and whether a given individual might develop the disease earlier (early-onset) or later (late-onset) than the estimated reference time $$t_{0}$$.

Moreover, since the subject-specific trajectories are affine reparametrizations of an average trajectory, this model can easily be generalized to longitudinal measurements on a Riemannian manifold. The average trajectory may be replaced with a geodesic of the manifold and the subject-specific trajectories will remain geodesics on the manifold. As a matter of fact, longitudinal measurements arising from neuroimaging, neuropsychological tests or clinical examinations may belong to nonlinear spaces. The scores to a neuropsychological test often belong to a bounded interval. Riemannian manifolds offer a very flexible framework which includes the previous examples. The generalization of this statistical model to Riemannian manifold-valued observations could be used either to analyze shapes, cortical thickness measurements or neuropsychological assessment scores. After a proper renormalization, the ADAS-Cog 13 scores belong to the open unit inverval (0, 1) which, equipped with a specific metric, can be seen as a univariate Riemannian manifold. For this metric, the geodesics are logistic curves. The models presented in [2] and [3] do not generalize to manifold-valued data. In [9], Lorenzi et al. used Riemannian manifold techniques to estimate a model of the brain’s normal ageing from healthy individuals T1 MR scans. The model was used to compute a time shift, called morphological age shift, which corresponds to the actual anatomical age of the subject with respect to an estimated average age for healthy subjects. However, the subject-specific time shifts were not estimated as parameters of a statistical model. In [7], Datar et al. generalized a particular linear mixed-effects model to longitudinal shape analysis but the proposed model did not include time shifts to account for the variability in stages of disease progression among the population. Even though the model is used to analyze shapes (given by a set of points), the model is not described in a Riemannian manifold setting.

The work presented herein will generalize the model written above to a Riemannian manifold included in $$\mathbb {R}$$. In Sect. 2, we will introduce our model and we will present two particular cases of this model : the straight lines model and the logistic curves model. We also explain how the parameters of the model are estimated. In Sect. 3, the straight lines model and logistic curves model were used to analyze longitudinal ADAS-Cog and cortical thickness measurements from the ADNI database. We show that the estimated random effects of the model allowed to distinguish between slow versus fast progressing individuals and early versus late-onset individuals.



2 A Mixed-Effects Model with Time Reparametrization for Manifold-Valued Observations



2.1 Model Description


Let us assume that we observe p different individuals. For each individual, we have $$n_{i}$$ observations, obtained at times $$t_{i,1} < \ldots < t_{i,n_{i}}$$ and the observations of the i-th individual are denoted by $$y_{i,1}, \ldots , y_{i,n_{i}}$$. We assume that each observation $$y_{i,j}$$ is a point on a one-dimensional Riemannian manifold (Mg) included in $$\mathbb {R}$$. In addition to this, we assume that the geodesics of M are defined on the entire real line.

Let $$p_{0}$$ be a point in M, $$t_{0}$$ in $$\mathbb {R}$$ and $$v_{0} \in T_{p_{0}}M$$, a tangent vector to M at the point $$p_{0}$$. The triplet $$(p_{0},t_{0},v_{0})$$ allows to define an average trajectory $$\gamma _{p_{0},t_{0},v_{0}}$$, the geodesic of M which passes through the point $$p_{0} \in M$$ at time $$t_{0}$$ and with the velocity $$v_{0} \in T_{p_{0}}M \simeq \mathbb {R}$$. This average trajectory summarises the progression of all the individuals and could be interpreted as the evolution of an “average subject”. While the average subject passes through the point $$p_{0}$$ at time $$t_{0}$$ with the velocity $$v_{0}$$, we assume that the trajectory of the i-th subject is the geodesic $$\gamma _{i}$$ which passes through the point $$p_{0}$$ at time $$t_{0}+\tau _{i}$$ with velocity $$\alpha _{i}v_{0}$$. The parameters $$(p_{0},t_{0},v_{0})$$ will be the fixed effects of the model whereas $$\alpha _{i}$$ and $$\tau _{i}$$ will be the random effects of the model. The point $$p_{0}$$ in M can be understood as an observation point. A simplistic interpretation of the role played by $$\alpha _{i}$$ and $$\tau _{i}$$ is the following : an observer standing at the point $$p_{0}$$ in M and aware that the average subject passes through the point at the time $$t_{0}$$, with velocity $$v_{0}$$, will be able to tell whether the i-th subject passes through this point earlier or later than the average subject and whether he passes faster or slower. More precisely, $$\tau _{i}$$ is a time shift which will provide an information on whether the progression of the i-th trajectory is ahead of the average trajectory or not and $$\alpha _{i}$$ provides the information on whether the i-th subject is evolving faster or slower than the average subject. As a consequence, the random effects $$\alpha _{i}$$ and $$\tau _{i}$$ take into account the fact that two different subjects may be at very different stages of disease progression and allow for inter-subjects comparaisons. The description above makes the assumption that every subject-specific trajectory of disease progression will eventually reach a common value $$p_{0}$$. This assumption does not make sense in a higher-dimensional Riemannian manifold. In order to generalize the proposed model to a higher-dimensional setting, one would need to add another random effect, associated to the fixed effect $$p_{0}$$.

Only gold members can continue reading. Log In or Register to continue

Stay updated, free articles. Join our Telegram channel

Sep 16, 2016 | Posted by in GENERAL RADIOLOGY | Comments Off on Mixed-Effects Model with Time Reparametrization for Longitudinal Univariate Manifold-Valued Data

Full access? Get Clinical Tree

Get Clinical Tree app for offline access