1 Introduction and Regression models.3 Linear models for stationary time series.

2 Descriptive statistics for stationary time series.

2.1 Stationary series and autocorrelation.

2.1.1 Introduction. Many time series are evidently not IID - they may have long runs of values of the same sign, or show other patterns of association between successive values. Yet they often show a statistical similarity of appearance throughout their sample length. This may be after trends, seasonality and strong cyclical features have been removed by regression.

Differencing a time series, i.e. looking at the change from one sample point to another, may also lead to such an appearance, an example being inflation levels - monthly price increases - which are generally of more concern than the absolute price which continues its trend inexorably.

In this section we shall make simple assumptions about the structure of a single, or univariate time series so we shall now use the notation $x_{t}$ rather than $y_{t}$ which was more appropriate for the regression context.

The assumption that the statistical behaviour of a series is not changing as time progresses is called stationarity. It’s simplest description or measure uses the correlations between values in the series and leads to the following:

2.1.2 Definition. A time series $x_{t}$ is second order (weakly) stationary if

•

E $(x_{t})=\mu_{x}$ and Var $(x_{t})=\sigma_{x}^{2}$ are the same for all $t$ .
•

For each $k=0,1,2,\ldots$ Cov $(x_{t},x_{t+k})=\gamma_{x,k}$ are the same for all $t$ .

Note that $\gamma_{x,0}=\sigma_{x}^{2}$ . If we also consider negative lags

\gamma_{x,-k}=\mbox{Cov}(x_{t},x_{t-k})=\mbox{Cov}(x_{t-k},x_{(t-k)+k})=\gamma% _{x,k}

The set of values $\gamma_{x,k}$ is called the autocovariance function (of the lag) for the series $x_{t}$ .

Note: strict stationarity is that the joint pdf of any set of values of the series is the same as if they were all shifted in time by the same lag.

Also, a series is said to be Gaussian if this joint pdf is Normal. The assumption of second order stationarity and Gaussianity implies strict stationarity.

2.1.3 Definition. The autocorrelation function (acf) of a stationary time series $x_{t}$ is

\rho_{x,k}=\mbox{corr}(x_{t},x_{t+k})=\gamma_{x,k}/\gamma_{x,0},\ \ k=0,1,2\ldots

Again is at times useful to think of $\rho_{x,k}$ being defined also for negative $k$ such that $\rho_{x,-k}=\rho_{x,k}$ . Obviously $\rho_{x,0}=1$ always.

2.1.4 Examples.
(a) If $x_{t}$ is white noise then by definition $\rho_{x,k}=0$ for $k\neq 0$

(b) Take $x_{t}$ to be white noise and consider the constructed series $y_{t}=x_{t}+x_{t-1}$ . Then

\gamma_{y,0}=\mbox{Var}(y_{t})=\mbox{Var}(x_{t})+\mbox{Var}(x_{t-1})=2\sigma_{% x}^{2}

\gamma_{y,1}=\mbox{Cov}(x_{t}+x_{t-1},x_{t+1}+x_{t})=

\mbox{Cov}(x_{t},x_{t+1})+\mbox{Cov}(x_{t},x_{t})+\mbox{Cov}(x_{t-1},x_{t+1})+% \mbox{Cov}(x_{t-1},x_{t})=0+\sigma_{x}^{2}+0+0=\sigma_{x}^{2}.

It is similarly found that $\gamma_{y,k}=0$ for $k>1$ so:

\rho_{y,1}=\frac{\sigma_{x}^{2}}{2\sigma_{x}^{2}}=\frac{1}{2}\ ;\ \rho_{x,k}=0% \mbox{\ for\ }k>1.

Given a time series data set $x_{1},x_{2}\ldots x_{n}$ the following estimates of the above quantities are used. Note however that they differ from the usual sample estimates used in statistics because they are not based on independent observations.

2.1.5 Time series sample estimates.

•

sample mean, an estimate of $\mu_{x}$ : $\bar{x}=\frac{1}{n}\sum_{t=1}^{n}x_{t}.$
•

sample variance, an estimate of $\sigma_{x}^{2}$ : $s_{x}^{2}=\frac{1}{n}\sum_{t=1}^{n}(x_{t}-\bar{x})^{2}.$
•

sample autocovariance, an estimate of $\gamma_{x,k}$ : $C_{x,k}=\frac{1}{n}\sum_{t=1}^{n-k}(x_{t}-\bar{x})(x_{t+k}-\bar{x})$ for $k=0,1\ldots n$ and 0 if $k\geq n$ .
•

sample autocorrelation, an estimate of $\rho_{x,k}$ : $r_{x,k}=\frac{C_{x,k}}{C_{x,0}}.$

Remark. The divisor of $n$ is used for defining $s_{x}^{2}$ and $C_{x,k}$ . Some authors use the divisor $(n-k)$ when defining $C_{x,k}$ .

Warning. These sample quantities can be automatically generated for any data set so the assumption of stationarity requires some check - usually a visual inspection of the series - before they are used.

In particular if the data contain trends, seasonality or cycles which appear deterministic and have not been removed by regression, then they will affect, and tend to dominate, the pattern of the sample acf, obscuring other statistical features. However, series may appear trend-like in a short sample yet stationary in the long term. The appearance depends on sample length. Figure 4 shows the sample autocorrelations of the residuals from the harmonic model fitted to the CO2 series. The figure also shows the series, the partial autocorrelations, and the sample spectral density (known as the periodogram), which we shall define later in this course. Figure 5 shows similar plots for the random series.

Figure 4: Link, Caption: Sample statistics of residuals from a regression model for Monthly CO2.

Figure 5: Link, Caption: Sample statistics of a random Gaussian time series.

2.2 The covariance matrix of a stationary time series.

Consider $n$ successive time series values as a vector of random variables

Z=\left(\begin{array}[]{cccc}x_{t+1}&x_{t+2}&\ldots&x_{t+n}\end{array}\right)^% {\prime}

where ${}^{\prime}$ indicates transpose. For stationary series, the autocorrelation function determines their $n\times n$ correlation matrix as
2.2.1

R_{n}=\left(\begin{array}[]{ccccc}1&\rho_{1}&\rho_{2}&\cdots&\rho_{n-1}\\ \rho_{1}&1&\rho_{1}&\cdots&\rho_{n-2}\\ \rho_{2}&\rho_{1}&1&\cdots&\vdots\\ \vdots&\vdots&\vdots&\ddots&\rho_{1}\\ \rho_{n-1}&\rho_{n-2}&\cdots&\rho_{1}&1\end{array}\right)

and the covariance matrix is defined by $V_{n}=\sigma_{x}^{2}R_{n}$ .

The matrices $R_{n}$ and $V_{n}$ have special structure: the elements are the same down any diagonal. Such matrices are called Toeplitz matrices.

Covariance matrices are useful for calculating the variance of a linear combination of variables such as $z=\alpha_{1}x_{t+1}+\cdots+\alpha_{n}x_{t+n}=\alpha^{\prime}x$ where $\alpha=\left(\begin{array}[]{cccc}\alpha_{1}&\alpha_{2}&\ldots&\alpha_{n}\end{% array}\right)^{\prime}$ , as
2.2.2

\mbox{Var}(z)=\sigma_{x}^{2}\alpha^{\prime}R_{n}\alpha=\sigma_{x}^{2}\left(% \begin{array}[]{ccccc}\alpha_{1}&\alpha_{2}&\ldots&\alpha_{n}\end{array}\right% )\left(\begin{array}[]{ccccc}1&\rho_{1}&\rho_{2}&\cdots&\rho_{n-1}\\ \rho_{1}&1&\rho_{1}&\cdots&\rho_{n-2}\\ \rho_{2}&\rho_{1}&1&\cdots&\vdots\\ \vdots&\vdots&\vdots&\ddots&\rho_{1}\\ \rho_{n-1}&\rho_{n-2}&\cdots&\rho_{1}&1\end{array}\right)\left(\begin{array}[]% {c}\alpha_{1}\\ \alpha_{2}\\ \ldots\\ \alpha_{n}\end{array}\right).

A simple example is to evaluate Var $(x_{1}+x_{2}+\cdots+x_{n})$ by taking $\alpha^{\prime}=(\begin{array}[]{cccc}1&1&\ldots&1\end{array})$ . The result is just $\sigma_{x}^{2}$ multiplied by the sum of the elements of $R_{n}$ :
2.2.3

\mbox{Var}(x_{1}+x_{2}+\cdots+x_{n})=\sigma_{x}^{2}\left\{n+2(n-1)\rho_{1}+2(n% -2)\rho_{2}+\ldots+2\rho_{n-1}\right\}.

We can deduce

\mbox{Var}(\bar{x})=\frac{1}{n^{2}}\mbox{Var}(x_{1}+x_{2}+\cdots+x_{n})=\frac{% 1}{n}\sigma_{x}^{2}\left\{1+2\left[\left(1-\frac{1}{n}\right)\rho_{1}+\left(1-% \frac{2}{n}\right)\rho_{2}+\cdots+\frac{1}{n}\rho_{n-1}\right]\right\}.

For large $n$ this may be approximated by
2.2.4

\mbox{Var}(\bar{x})\approx\frac{1}{n}\sigma_{x}^{2}\left(1+2\sum_{j=1}^{\infty% }\rho_{j}\right).

A similar formula holds for the sample variance if the series is Gaussian, such that
2.2.5

\mbox{Var}(s_{x}^{2})\approx\frac{1}{n}2\sigma_{x}^{4}\left(1+2\sum_{j=1}^{% \infty}\rho_{j}^{2}\right).

Remark. For highly autocorrelated series precise estimation of the mean and variance is very difficult because of the high magnitude of $2\sum_{j=1}^{\infty}\rho_{j}$ and $2\sum_{j=1}^{\infty}\rho_{j}^{2}$ appearing in the variances.

2.3 Partial autocorrelation.

The autocorrelation measures the direct relationship between two values of a time series at different lags. In time series an alternative measure is the strength of association conditional upon the values in between.

2.3.1 Definition. The partial autocorrelation (pacf) at lag $k$ for a stationary time series is

\rho|_{1}=\rho_{1}\mbox{\,\, and for\,\,}k\geq 2,\rho|_{k}=\mbox{corr}(x_{t},x% _{t+k}|x_{t+1},x_{t+2},\ldots,x_{t+k-1}).

This may be calculated from the covariance matrix of $x_{t},x_{t+1},\ldots,x_{t+k}$ and so it depends only on $\rho_{1},\ldots,\rho_{k}$ . The same quantities are in fact used when the Gaussian assumption is not appropriate, and we shall see later their value and interpretation in the context of model selection.

2.3.2 The sample partial autocorrelations.
Sample values $r|_{k}$ of the pacf $\rho|_{k}$ are calculated simply by using the sample values $r_{k}$ in place of the acf $\rho_{k}$ in the calculations.

A similar warning to that given regarding sample autocorrelations also applies here, that any strong trends and other deterministic components not removed from the series will tend to dominate the appearance of the sample pacf.