MATH235 MATH235 Week 2 - Workshop problems MATH235 Week 2 - Moodle Quiz-assessed problems

MATH235 Week 2 - Assessed problems (coursework)

Submission is due on Tuesday in Week 3.

CW2.1

Table 0.8 gives the consumer price index (CPI) for the UK, US, France and Germany from 2001 to 2010. CPI is a measure of the change in time of the price paid by the customer for a basket of fixed goods and services; here it has been standardised to be 100 in 2005. The data were provided by the World Bank International Monetary Fund and were obtained from http://datamarket.com/.

Year	France	Germany	United Kingdom	United States
2001	92.5	94.5	94.2	90.7
2002	94.3	95.8	95.4	92.1
2003	96.2	96.8	96.7	94.2
2004	98.3	98.5	98.0	96.7
2005	100.0	100.0	100.0	100.0
2006	101.7	101.6	102.3	103.2
2007	103.2	103.9	104.7	106.2
2008	106.1	106.6	108.5	110.2
2009	106.2	107.0	110.8	109.9
2010	107.8	108.2	114.5	111.7

Table 0.5: Consumer price index (CPI) from 2001 to 2010 for four western countries. CPI has been standardised to 100 in 2005.

Assume that the data for the US are an IID sample from a Normal $(\mu_{1},\sigma^{2}_{1})$ distribution and that the data for Germany are an IID sample from a Normal $(\mu_{2},\sigma^{2}_{2})$ distribution. We wish to test whether or not the mean CPI for the US is the same as the mean CPI for Germany.

(a)

Explain why a paired approach is appropriate for this test.

[marks: 1]
(b)

Calculate the differences $d_{i}=x_{i}-y_{i}$ .

[marks: 1]
(c)

Using your answer to part (b) or otherwise, test at the 10% level whether or not there is evidence that the mean CPI for the US is the same as the mean CPI for Germany. You should state your $p$ -value.

[marks: 3]

CW2.2

Table 0.6 shows UK interest rates for January split into two decades, 1960–1969 and 1970–1979. Is there evidence that the mean interest rate in the 1960’s is lower than the mean interest rate in the 1970’s?

1960–1969	1970–1979
5.0	8.0
5.0	7.0
6.0	5.0
4.0	8.8
4.0	12.8
7.0	11.0
6.0	10.5
6.5	12.2
8.0	6.5
7.0	12.5

Table 0.6: UK interest rates for January split into two decades (1960–1969 and 1970–1979). Interest rates are in percentages.

(a)

Calculate the pooled sample variance for the data.

[marks: 2]
(b)

Carry out an appropriate $t$ -test at the 5% level. What do you conclude about the difference between the mean interest rates in the two decades?

[marks: 3]

CW2.3

Challenge
The geometric distribution gives probabilities for the number of Bernoulli trials required before the first success. The distribution has a single parameter $p\in[0,1]$ which can be interpreted as the probability of success for each trial. In the game of ludo a ‘six’ must be thrown on a six-sided dice before a player can start. A player recorded the number of throws that they made before obtaining a six in each of ten games. The same dice was used throughout all games, and the results are as follows:

1,0,6,7,0,1,3,11,3,1.

Assuming that these data come from a Geometric $(p)$ distribution, is there evidence that $p$ is greater than $\frac{1}{6}$ , i.e. that the dice is loaded?

(a)

Given that the expectation of a $\mathrm{Geometric}(p)$ random variable is $p^{-1}$ , explain why $\hat{p}=\bar{X}^{-1}$ is a sensible estimator for $p$ . Obtain an estimate of $p$ using the above sample.

[marks: 2]
(b)

By writing appropriate code in R, carry out a non-parametric bootstrap routine to obtain a 90% confidence interval for $p$ . Hint you might want to adapt the code given in workshop question WS2.1.

[marks: 2]
(c)

Is there evidence at the 5% level that $p$ is greater than 1/6?

[marks: 1]