-amyloid deposition begins in Alzheimer’s disease (AD) years before the onset of any clinical symptoms. It is therefore important to determine the temporal trajectories of amyloid deposition in these earliest stages in order to better understand their associations with progression to AD. A method for estimating the temporal trajectories of voxelwise amyloid as measured using longitudinal positron emission tomography (PET) imaging is presented. The method involves the estimation of a score for each subject visit based on the PET data that reflects their amyloid progression. This amyloid progression score allows subjects with similar progressions to be aligned and analyzed together. The estimation of the progression scores and the amyloid trajectory parameters are performed using an expectation-maximization algorithm. The correlations among the voxel measures of amyloid are modeled to reflect the spatial nature of PET images. Simulation results show that model parameters are captured well at a variety of noise and spatial correlation levels. The method is applied to longitudinal amyloid imaging data considering each cerebral hemisphere separately. The results are consistent across the hemispheres and agree with a global index of brain amyloid known as mean cortical DVR. Unlike mean cortical DVR, which depends on a priori defined regions, the progression score extracted by the method is data-driven and does not make assumptions about regional longitudinal changes. Compared to regressing on age at each voxel, the longitudinal trajectory slopes estimated using the proposed method show better localized longitudinal changes.
Cortical -amyloid deposition is a neuropathological hallmark associated with Alzheimer’s disease (AD), and begins years before any cognitive symptoms of AD are evident [8]. Studying within-subject longitudinal changes is limited by the number of follow-up visits. The relatively short span of longitudinal positron emission tomograpy (PET) studies of amyloid deposition compared to its hypothesized timeline makes it difficult to extensively study the longitudinal brain amyloid changes that occur in the preclinical stages of AD.
It is possible to “stitch” data across subjects in order to obtain temporal biomarker trajectories that fit an underlying model. This is the premise of the Disease Progression Score method, which has been applied to studying changes in cognitive and biological markers related to Alzheimer’s disease [3, 9]. It is assumed that there is an underlying progression score (PS) for each subject visit that is a linear transform of the subject’s age, and given this PS, it is possible to place biomarker measurements across a group of subjects onto a common timeline. The linear transformation of age removes across-subject variability in baseline biomarker measures as well as in their rates of longitudinal progression. Each biomarker is associated with a parametric trajectory as a function of PS, whose parameters are estimated along with the PS for each subject.
Other methods for analyzing trajectories of biomarkers have been proposed. One approach involves fitting a piecewise linear model to longitudinal data assuming that each biomarker becomes abnormal a certain number of years before clinical diagnosis, and this duration is estimated for each biomarker to yield the longitudinal trajectories as a function of time to diagnosis [13]. In another approach, an event-based probabilistic framework is used to determine the ordering of changes in longitudinal biomarker measures as well as the appropriate thresholds for separating normal from abnormal measures [14]. The second method is agnostic to clinical diagnosis, but does not allow for the characterization of longitudinal biomarker trajectories except for determining the ordering of the change points. The first method does delineate the longitudinal trajectories for each biomarker, but requires knowledge of clinical diagnosis and therefore is not suitable for studying the earliest changes in healthy individuals.
Here, we adopt the disease progression score principle, but make two substantial innovations. First, the voxelwise PET measures constitute the biomarkers in the model. Since voxelwise PET measures have an underlying spatial correlation, we incorporate the modeling of the spatial correlations among the biomarker error terms and estimate the spatial correlations along with the subject and trajectory parameters. Modeling spatial correlations makes the inference of the subject-specific progression scores less susceptible to the inherent correlations among the voxels. Second, instead of using an alternating least-squares approach for parameter estimation, we formulate the model fitting as an expectation-maximization (EM) algorithm, which guarantees optimality and convergence. In experiments using this approach, we first show using simulated data that the model parameters are estimated accurately, then apply the method to each cerebral hemisphere separately using distribution volume ratio (DVR) images derived from Pittsburgh compound B (PiB) PET imaging, which show the distribution of amyloid. Models fitted using data for 75 participants with a total of 271 visits reveal that the precuneus and frontal lobes show the greatest longitudinal increases in amyloid, with smaller increases in lateral temporal and temporoparietal regions, and minimal increases in the occipital lobe and the sensorimotor strip. Results are consistent across the two hemispheres, and the estimated PS agrees with a widely used global index of brain amyloid known as mean cortical DVR.
The progression score for subject i at visit j is assumed to be a linear transformation of age :
where are assumed to be uniformly distributed random variables on a fixed and large domain . The collection of K biomarker measurements (the intensities at each PiB-PET DVR voxel reflecting amyloid levels) make up the vector . The longitudinal trajectories associated with these biomarkers are assumed have a linear form parameterized by vectors and :
Here, is the observation noise. The covariance matrix R is assumed to be of the form , where is a positive scalar and C is a correlation matrix parameterized by . The parameters make up .
(1)
(2)
Let be the vector consisting of all biomarker measurements stacked across all visits of subject i and let . We then define and , where n is the number of subjects. The pair is the complete data. The complete log-likelihood for the model, ignoring the constants, is given by
where and .
(3)
The observations include biomarker measurements at each visit. The hidden variables are the subject-specific parameters and . The unknown parameters are .
E-Step: Let be the previous estimate of the parameters. The expectation step involves evaluating an expression for
Note that
where
, and is a multivariate Gaussian density with mean and covariance . For ease of notation, we have used to denote stacked across the visits of subject i and to denote diagonally stacked matrices across the visits of subject i. is obtained by diagonally stacking across the visits of subject i.
(4)
(5)
(6)
(7)
The uniform prior assumption on allows us to write , where c is a constant. Rewriting the expectation using Bayes’ rule and plugging in the result from (6) yields
If is large enough, we can approximate the expectation as an infinite domain integration over to obtain
where .
(8)
(9)
(10)
M-Step: Let be the function obtained at the end of the E-step. We set the derivatives and equal to 0 and solve for the parameters to obtain
(11)