Review 2 2004.6.9
Chapter 12
Least square estimate and fitted equation
Testing hypothesis (F statistic and t statistic) and confidence interval
prediction
residual plots
Example:
Given are 5 observations for two variables x and y.
/ 2 / 3 / 5 / 1 / 8/ 25 / 25 / 20 / 30 / 16
Suppose the model is
,
(a)Find the least square estimate and the fitted regression equation
(b)Provide an ANOVA table and use F statistic to test at .
(c) Use t statistic to test at .
(d)Find the 95% confidence interval for .and use the confidence interval to test .
(e)Find the ANOVA table for and use F statistic to test the hypothesis at .
(f)Determine and the sample correlation coefficient.
(g)Find the 90% confidence interval for .
(h)Develop a plot of the residuals.
[solutions:]
(a)Since
thus,
Then, the least square estimate is
The fitted regression equation is
.
(b)
Since
The ANOVA table is
Source / df / SS / MS / FRegression / 1 / SSR=108.467 / /
Residual (Error) / n-2=3 / SSE=6.333 /
Total (corrected) / n-1=4 / SST=114.8
Since , we reject .
(c)
.
Since
,
we do not reject .
(d)
The 95% confidence interval for is
.
Since , we do not reject.
(e)
Since , the ANOVA table for is
Source / df / SS / MS / FRegression / 2 / 2799.667 / /
Residual (Error) / n-2=3 / SSE=6.333 /
Total / 5 / 2806
Since , we reject
(f)
,
,
(g)
Since , the 90% confidence interval for is
(h)
The residuals are
-1.577922 / 0.2987013 / -0.9480519 / 1.545455 / 0.6818182Chapter 11
Interval estimate, hypothesis tests, and p-value of the population proportion difference .
test for proportions of a multinomial population and for the independence (contingency table).
Example:
The following are the number of wrong answers for the number of the students.
Number of wrong answers / 0 / 1 / 2 / 3Number of the students / 21 / 31 / 12 / 0
Suppose X is the random variable representing the number of wrong answers. Please test X is distributed as Binomial(3,0.25) with .
(Note: the distribution function for Binomial(3,0.25) is
.
[solutions:]
As is true, the distribution for the number of wrong answers is
Since the sample size , the expected numbers under are
.
Therefore,
Since
,
we do not reject .
Chapter 10
Interval estimate, hypothesis tests, and p-value of the population mean difference .
Two methods, independent samples and matched samples, can be used for the population mean difference .
ANOVA for testing the equality of k population means.
Example:
The following data are from 4 different populations.
Population 1 / 8.2 / 8.7 / 9.4 / 9.2Population 2 / 7.7 / 8.4 / 8.6 / 8.1 / 8.0
Population 3 / 6.9 / 5.8 / 7.2 / 6.8 / 7.4 / 6.1
Population 4 / 6.8 / 7.3 / 6.3 / 6.9 / 7.1
Let , , and be the mean number of products of the 4 production lines.
(a) Provide the ANOVA table.
(b) Please test the hypothesis with .
[solution:]
(a)
,
Thus,
.
The ANOVA table is
Source / SS / df / MS / FBetween / SSB=15.462 / / /
Within / SSW=3.888 / /
Total / 19.35 /
(b)
Since
,
we reject .
1