7 Continuous Spatial Variation

7.5 Generalized linear geostatistical models

A different way to think about the geostatistical model is in its conditional form:

Yi|S(x),μ(x)N{μ(xi)+S(xi),τ2}

This led Diggle et al. (1998) to embed the model within the class of generalized linear mixed models (Breslow and Clayton, 1993b), to allow for example:

  • Poisson log-linear geostatistical models

    S(x) SGP{m,σ2,ρ(u)},m=0
    αi = k=1pzk(xi)βk+S(xi)
    Yi|S(xi),β1,,βp Poisson{exp(αi)}
  • Binary logistic-linear geostatistical models

    S(x) SGP{m,σ2,ρ(u)},m=0
    αi = k=1pzk(xi)βk+S(xi)
    Yi|S(xi),β1,,βp Bernoulli(exp(αi)1+exp(αi))
Example 7.7.

Childhood malaria in Gambia

A survey was conducted in village communities throughout Gambia (Diggle et al., 2002), see Figure 7.6.

Figure 7.6: Link, Caption: The Gambia is a long, narrow country on the west coast of Africa. This plot shows the border of the country and also the locations (as black hollow circles) of village communities in Gambia from which malaria samples, and other local information were collected.

For each child in the survey, the following information was recorded:

  • Yij= presence/absence of malarial parasites in blood-sample, for jth child in ith village

  • age, sex, bednet use

  • satellite-derived vegetation green-ness index

A logistic regression model, allowing for residual spatial variation and residual non-spatial variation between villages, is:

logit{(Yij=1)}=k=1pzijkβk+Ui+S(xi)

where UiN(0,ν2) and S(x) is a zero-mean stationary Gaussian process on 2.

Results:

  • inclusion of either Ui or S(xi) term materially affects inferences about βk

  • map of U^i against village locations suggests strong spatial structure

  • map of S^(x) shows spatial smoothing of unexplained spatial variation in risk

Figure 7.7: First Link, Second Link, Caption: Top: a map of the Gambia, green areas here have a higher density of vegetation, something that mosquitos require as a source of food. Bottom (pair of plots): Showing spatial predictions of the probability that a malaria blood sample will be positive; the country has been split in half in order to be able to zoom in more effectively. The more westerly area, of the country shown in the topmost of these two plots, is the coastal region of the Gambia, the large black area is the Atlantic Ocean. In these two plots, blue areas are areas of low risk and red areas are areas of high risk, with the intensity of the colour being in some sense proportional to the amount of risk.

Further extensions of spatial geostatistical models are often required for all but the simplest problems. The assumption of stationarity is often inappropriate, and may datasets suitable for geostatistical analysis also have a temporal dimension, for example in environmental and healthcare monitoring. Much recent research in the subject has focused on developing suitable methods to extend geostatistical methods into the spatio-temporal domain.