In models of discrete spatial variation, the geographical space under study is regarded as a fixed set of spatial sampling units, typically defined by a partitioning of a continuous region into politically defined sub-regions.
In the following example, the region is the north of England and the sub-regions are counties.
Models for discrete spatial variation are usually defined in terms of their so-called full conditional distributions, incorporating notions of “local” dependence between spatial units.
More formally, if denote a set of outcome variables associated with each of spatial units, the model is specified by the univariate distributions
where a multivariate distribution
Note that a mutually consistent specification of the full conditionals involves non-obvious constraints on the allowable forms of distribution, which are set out in the celebrated Hammersley-Clifford Theorem (Besag (1974)).
Typically, simplifying assumptions are made so that only a few of the terms in the conditioning set play any part. A neighbourhood structure needs to be defined. For example, we may assume that there is correlation between a county and its adjacent counties only, ignoring the rest.
The following shows one example of how this might be done.
Lip cancer in Scotland; this example was originally analysed in Clayton and Kaldor (1987), with further comment and analysis in Clayton and Bernardinelli (1992) and in Breslow and Clayton (1993a).
spatial units are the counties of Scotland;
the response from each county is , the total number of cases during the years 1975-1980 inclusive;
let denote the risk for county , and the size of the population at risk
a natural model to fit to the data is that
an available covariate is , the percentage of the population of county who are engaged in agriculture, fishing or forestry
stronger predictors of lip cancer would be tobacco and alcohol consumption, but these are not available
To model residual spatial variation in risk, after adjusting for the available covariate, we assume that
where the are spatially correlated random effects that follow a discrete spatial variation model in which:
two counties are neighbours if they share a common boundary
the full conditionals of county depend only on the neighbours of county
neighbours where
mean of from counties which are neighbours of county ;
, where number of neighbours of county
Note that this specification (an example of a conditional autoregressive (CAR) model) corresponds to an improper joint distribution for , with joint pdf
where indicates that counties and are neighbours.