where X is an m-by-n ( is a column vector of length n, and s () are identical and independently distributed (i.i.d.) variables following a Gaussian distribution . In research applications, Y may be an fMRI time serials at a voxel location and X may be the design matrix of functional tasks, or in a tensor-based morphometry (TBM) study Y may be subjects’ Jacobian determinant at a voxel location and X may be a factor matrix of age, gender or disease states, etc. Please note:
We assume that X is full column-rank so that the inverse of exists.
We assume that .
The maximum-likelihood and unbiased estimate of is
The residuals and the unbiased estimate of are
Let us focus on the probabilistic and geometric properties of .
1.
It follows a -variance isotropic multi-variate Gaussian distribution embedded in the null space of X’s columns. More specifically, there exists an m-by-df matrix Z which satisfies and , such that and .
2.
It is independent of .
3.
Its normalized vector uniformly distributes on a unit hyper-sphere in the null space of X’s columns, independent of and .
Property 1 is the most insightful and it easily derives properties 2 and 3. We outline its proof as follows:
1.
Because X is a full column-rank m-by-n matrix, its column null space has dimensions.
2.
Define Z as an m-by-df matrix whose columns are a set orthonormal bases of the null space of X’s columns. By definition, Z satisfies and .
3.
Define , then this df-element random vector follows a Gaussian distribution , because:
, as a linear combination of , follows a multi-variate Gaussian distribution;
The expected value of is ;
The variance of is .
4.
Z also satisfies because:
Both and equal ;
is a full-rank m-by-m matrix so its inverse exists;
Both and equal .
5.
equals because
2.2 Weighted FDR in Volume
Let denote the detected region, the underlying truth, and the volume of a region. The volume-based FDR is defined as follows
Genovese, Lazar, and Nichols [9] defined the volumetric measure in the image space, and consequently it can be translated as the number of voxels in voxel-based analysis. Benjamini and Hochberg’s step-up procedure [2] was applied to control the FDR. The step-up procedure finds
where q is the user specified FDR level, is the k-th smallest voxel p-value, and N is the number of voxels. This step-up procedure is able to handle positive dependence among tests, as Benjamini and Yekutieli discussed in [4]. For more general dependence among tests, please refer to [4].
(1)
Benjamini and Hochberg (1997) [3] upgraded it to a weighted version whose FDR and control procedure are
where is the weight associated with a voxel.
(2)