Home page for accesible maths 8.1.2 Variance of least squares estimator

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

8.2 Linear combinations of regression coefficients

Recall model 6.2 for birth weight in Example 6.4.5. This model includes separate intercepts for males (β1) and females (β2). We might be interested in the difference between male and female birth weights, β1-β2, estimated by β^1-β^2. In particular, we might be interested in testing whether or not there is a difference between β1 and β2,

H0:β1-β2=0

vs.

H1:β1-β20.

What is an appropriate test statistic for this test? What sampling distribution should we use to obtain the critical region, p-value or confidence interval? Since β^1-β^2 is a linear combination of the regression coefficients, we can find its distribution, and hence a test statistic for this test.

In general for the linear combination

aβ^=a1β^1++apβ^p,

then

𝔼[aβ^] =a𝔼[β^]
=aβ.

and

Var(aβ^) =aVar(β^)a
=aσ2(XX)-1a
=σ2a(XX)-1a.

Further, because β^ follows a multivariate normal distribution, so aβ^ follows a normal distribution, {mdframed}

aβ^N(aβ,σ2a(XX)-1a).

In practice, the unknown residual variance σ2 is replaced with the estimate σ^2.

TheoremExample 8.2.1 Birth weights cont.

Recall that model 6.2 for birthweight is

𝔼[Yi]=β1xi,1+β2xi,2+β3xi,3

where xi,1 and xi,2 are indicators for male and female respectively, and xi,3 is gestational age. Using the data in Table 7.1, we have

X=[104010381040103801400140],
(XX)-1=[19.719.8-0.51219.820.1-0.517-0.512-0.5170.0133],
σ^2=31370.

What are the expectation and variance of β^1-β^2?

First write β^1-β^2=a(β^1,β^2,β^3) for some a,

β^1-β^2=1.β^1+(-1)β^2+(0)β^3=(1,-1,0)β^.

So a=(1,-1,0). Consequently,

𝔼[β^1-β^2] =(1,-1,0)β
=β1-β2

and

Var(β^1-β^2) =31370×(1,-1,0)[19.719.8-0.51219.820.1-0.517-0.512-0.5170.0133][1-10]
=31370×0.169
=5301.