MATH235

MATH235 Week 2 - Assessed problems (coursework)

Submission is due on Tuesday in Week 3.

CW2.1 

Table 0.8 gives the consumer price index (CPI) for the UK, US, France and Germany from 2001 to 2010. CPI is a measure of the change in time of the price paid by the customer for a basket of fixed goods and services; here it has been standardised to be 100 in 2005. The data were provided by the World Bank International Monetary Fund and were obtained from http://datamarket.com/.

Year France Germany United Kingdom United States
2001 92.5 94.5 94.2 90.7
2002 94.3 95.8 95.4 92.1
2003 96.2 96.8 96.7 94.2
2004 98.3 98.5 98.0 96.7
2005 100.0 100.0 100.0 100.0
2006 101.7 101.6 102.3 103.2
2007 103.2 103.9 104.7 106.2
2008 106.1 106.6 108.5 110.2
2009 106.2 107.0 110.8 109.9
2010 107.8 108.2 114.5 111.7
Table 0.5: Consumer price index (CPI) from 2001 to 2010 for four western countries. CPI has been standardised to 100 in 2005.

Assume that the data for the US are an IID sample from a Normal(μ1,σ12) distribution and that the data for Germany are an IID sample from a Normal(μ2,σ22) distribution. We wish to test whether or not the mean CPI for the US is the same as the mean CPI for Germany.

  1. (a)

    Explain why a paired approach is appropriate for this test.

    [marks: 1]

  2. (b)

    Calculate the differences di=xi-yi.

    [marks: 1]

  3. (c)

    Using your answer to part (b) or otherwise, test at the 10% level whether or not there is evidence that the mean CPI for the US is the same as the mean CPI for Germany. You should state your p-value.

    [marks: 3]

CW2.2 

Table 0.6 shows UK interest rates for January split into two decades, 1960–1969 and 1970–1979. Is there evidence that the mean interest rate in the 1960’s is lower than the mean interest rate in the 1970’s?

1960–1969 1970–1979
5.0 8.0
5.0 7.0
6.0 5.0
4.0 8.8
4.0 12.8
7.0 11.0
6.0 10.5
6.5 12.2
8.0 6.5
7.0 12.5
Table 0.6: UK interest rates for January split into two decades (1960–1969 and 1970–1979). Interest rates are in percentages.
  1. (a)

    Calculate the pooled sample variance for the data.

    [marks: 2]

  2. (b)

    Carry out an appropriate t-test at the 5% level. What do you conclude about the difference between the mean interest rates in the two decades?

    [marks: 3]

CW2.3 

Challenge
The geometric distribution gives probabilities for the number of Bernoulli trials required before the first success. The distribution has a single parameter p[0,1] which can be interpreted as the probability of success for each trial. In the game of ludo a ‘six’ must be thrown on a six-sided dice before a player can start. A player recorded the number of throws that they made before obtaining a six in each of ten games. The same dice was used throughout all games, and the results are as follows:

1,0,6,7,0,1,3,11,3,1.

Assuming that these data come from a Geometric(p) distribution, is there evidence that p is greater than 16, i.e. that the dice is loaded?

  1. (a)

    Given that the expectation of a Geometric(p) random variable is p-1, explain why p^=X¯-1 is a sensible estimator for p. Obtain an estimate of p using the above sample.

    [marks: 2]

  2. (b)

    By writing appropriate code in R, carry out a non-parametric bootstrap routine to obtain a 90% confidence interval for p. Hint you might want to adapt the code given in workshop question WS2.1.

    [marks: 2]

  3. (c)

    Is there evidence at the 5% level that p is greater than 1/6?

    [marks: 1]