5 Discrete Spatial Variation

5.2 Including both spatial and non-spatial variation

When there is a ‘natural’ statistical model for the data, such as the Poisson model for count data or the binomial for binary data, it is often the case that:

  • after adjusting for all known covariates, the residual variation is bigger than can be explained by the assumed distributional model

  • but this extra-variation may or may not be spatially structured

A cautious strategy is to contemplate a model with both spatial and non-spatial sources of extra-variation.

5.2.1 Monte Carlo test for residual spatial variation

A simple test for the presence of residual spatial variation for area-level data is as follows. Given residuals Ti:

  1. 1.

    For each ij, calculate Tij*=Ti-Tj and uij*=ui-uj, where ui is the centroid of region i.

  2. 2.

    Calculate the sample correlation c*=Corr(u*,T*).

  3. 3.

    Randomly reallocate the Ti values to the sampled areas, and repeat steps 1 and 2 to find the sample correlation c.

  4. 4.

    Carry out step 3 a total of m-1 times (a large number).

  5. 5.

    Calculate a p-value for a test of residual spatial correlation as (n*+1)/m, where n* is the number of c values from the sample that are greater than or equal to c*.

Note that there are considerable drawbacks to this method, including the lack of power to detect residual spatial correlation in small samples and the inability to detect directional effects. However, a small p-value is usually a clear indication that there is residual spatial correlation present, which may necessitate further modelling.

Example 5.2.

Lip cancer in Scotland (continued)

Replace the previous model for risk by

logRi=α+βxi+Ui+Si

where

  • Si again follow a discrete spatial variation model, and

  • Ui are mutually independent N(0,ν2) without any spatial structure.

The following table of results is adapted from Breslow and Clayton (1993a). Three different models are considered:

  1. 1.

    Poisson regression (include neither Ui nor Si terms)

  2. 2.

    non-spatial extra-Poisson variation (include Ui)

  3. 3.

    spatial extra-Poisson variation (include Si)

Model Estimates ± standard errors
α β×10 σ ν
1 -0.54±0.07 0.74±0.06
2 -0.44±0.16 0.68±0.14 0.60±0.08
3 -0.18±0.12 0.35±0.12 0.73±0.13
  1. 1.

    Poisson regression model seriously underestimates standard errors of α and β;

  2. 2.

    non-spatial extra-Poisson variation model fixes up standard errors, point estimates only change slightly;

  3. 3.

    spatial extra-Poisson variation model gives comparable standard errors to model 2 but materially changes point estimates.

To assess the goodness of fit of each model and compare between different models standard diagnostic tests for Poisson regression models can be used. See for example Section 4.3.