3 Linear models for stationary time series.5 Forecasting (prediction, extrapolation) using ARIMA models.

4 Linear models for non-stationary and seasonal time series.

4.1

The term non-stationary covers a host of stochastic processes. The simplest forms arise when either the mean of the series is varying, or if the process is represented by a random walk.

4.2 Random walk models.

The variations observed in many series are explained as the cumulative sum of independent variables from IID( $\mu,\sigma^{2}$ ) distributions, the two parameters being the mean and variance of the distribution. Reverting now to calling the observed series $x_{t}$ , if the mean $\mu$ is zero the series is called a simple random walk and the independent variables are white noise $e_{t}$ :
4.2.1

x_{t}=x_{t-1}+e_{t}.

A non-zero mean $\mu$ represents a mean increase per unit time and the model is called a random walk with drift. To see if this model is appropriate for a time series $x_{t}$ the first differences of the series:
4.2.2

x_{t}-x_{t-1}=(1-B)x_{t}=\nabla x_{t}

are calculated, and should appear to be white noise $e_{t}$ , or $\mu+e_{t}$ if $\mu\neq 0$ .

The typical appearance of a simple random walk is difficult to appreciate intuitively. The series is best predicted by the most recent level and is equally likely to move up or down, yet the eye persists in seeing patterns of changing trends and cycles. It may be considered as a borderline case of the AR( $1$ ) model as $\phi\rightarrow 1$ . Figure 24 shows a real series of daily interest rates, and an artificially generated random walk. The similarities in behaviour are quite striking. Random walk models are widely used in financial time series analysis, typically after taking logarithms of the original series. An added feature, though, is that the variance of the white noise disturbance (or returns) $e_{t}$ , which is called the volatility of the series, may change slowly over time. It then becomes important to model the process $e_{t}$ .

Figure 24: First Link, Second Link, Caption: Series of daily interest rates and an artificial random walk series.

4.3 The exponentially weighted moving average, EWMA, predictor.

Forecasting is a topic yet to be covered, but is one of the main application areas of time series modelling. One of the earliest and simplest forecasting schemes, apart from fitting trend lines, was the EWMA predictor. This does not assume any particular structure of the observations and can be routinely used for any data. However, this was particularly designed to predict future values of a series which had a fluctuating level. The compromise between averaging many values to get a good estimate of the level, and using only a few values so as to estimate the recent level rather than an out-of-date level, was achieved by a choice of discounting factor $\theta$ applied to past values. The average used at time $t$ , $\bar{x}_{t}$ to predict $x_{t+1}$ was therefore of the theoretical form:
4.3.1

\bar{x}_{t}=(1-\theta)(x_{t}+\theta x_{t-1}+\theta^{2}x_{t-2}+\cdots),

where the factor $(1-\theta)$ is present so that the weights sum to one, giving a true average. The practical point is that this average could easily be calculated by a recursive or updating formula:
4.3.2

\bar{x}_{t}=(1-\theta)x_{t}+\theta\bar{x}_{t-1}.

As each new ‘day’ came with its most recent record of $x_{t}$ , the old EWMA $\bar{x}_{t-1}$ was updated by this to give the new EWMA. These last two equations are conveniently written using operator notation:
4.3.3

\bar{x}_{t}=(1-\theta)(1-\theta B)^{-1}x_{t}\mbox{       and        }(1-\theta B% )\bar{x}_{t}=(1-\theta)x_{t}.

Consideration of when this might be ‘the best thing’ to do leads to the assumption that the prediction errors $e_{t}=x_{t}-\bar{x}_{t-1}$ are white noise, completely unpredictable from (independent of) past information. On substituting for $\bar{x}_{t-1}$ this becomes:
4.3.4

$\displaystyle x_{t}$	$\displaystyle=$	$\displaystyle\bar{x}_{t-1}+e_{t}$
$\displaystyle\Rightarrow(1-\theta B)x_{t}$	$\displaystyle=$	$\displaystyle(1-\theta)x_{t-1}+(1-\theta B)e_{t}$
$\displaystyle\Rightarrow(1-B)x_{t}$	$\displaystyle=$	$\displaystyle(1-\theta B)e_{t}.$

Thus the first difference $\nabla x_{t}$ of $x_{t}$ follows a MA( $1$ ) model.

This is known as the Integrated Moving Average or IMA( $1,1$ ) model for the series $x_{t}$ . The series of weekly business transactions is well represented by this model - recall that the first differences (after a log transformation) followed a MA( $1$ ) model (see Figures 6–8).

Alternatively, in an IMA( $1,1$ ) series $\{x_{t}\}$ given by $(1-B)x_{t}=(1-\theta B)e_{t}$ where $\{e_{t}\}$ is white noise, is predicted using EWMA, then the errors in prediction $x_{t}-\bar{x}_{t-1}$ is the white noise $e_{t}$ . To see it, note that

e_{t}=\frac{(1-B)}{(1-\theta B)}x_{t}=(1-B)(1+\theta B+\theta B^{2}+\ldots)x_{% t}.

Collecting the coefficients of $\{B^{j};j\geq 0\}$ , the right hand side is equal to $x_{t}-\bar{x}_{t-1}$ .

Figure 25 shows a plot of that series, with the EWMA prediction of each term in the series added to the figure. The smoothing parameter was estimated to be $\theta=0.66$ , a typical value. For the purposes of illustration, another plot is shown using $\theta=0.9$ . The EWMA is then much smoother - but a less accurate predictor.

Figure 25: First Link, Second Link, Caption: EWMA predictions of the weekly transactions series, using smoothing parameters 0.66 and 0.9.