251y0011 10/11/00 ECO251 QBA1 Name ____KEY______

FIRST HOUR EXAM SECTION MWF 10 11 TR 11 12:30

OCTOBER 7, 2000

Part I. Multiple Choice (10 points)

1.(D7-1) The major contribution of inferential statistics is that it

a. Allows us to take population information and make statements about samples.

b. Gives us a description of data contained in a sample.

c. Gives us a description of data contained in a population.

*d. Allows us to take sample information and make statements about the population.

e. None of the above.

2.(S-3) Debit balances owed in a retail store are an example of

a. Ordinal data.

b. Nominal data.

c. Interval data.

*d. Ratio data.

e. None of the above.

3. A used automobile dealer lists cars in the following classes. A - 100,000 miles or more on the odometer, B - less than 100,000 miles on the odometer, C - Diesel. Are these three categories

a. Mutually exclusive?

*b. Collectively exhaustive?

c. Both mutually exclusive and collectively exhaustive?

d. Neither mutually exclusive or collectively exhaustive?

e. Can't tell with the information given.

4. (D7-9)If a distribution is skewed to the right, we can say that it is likely that

*a. Mean > median > mode

b. Median > mean > mode

c. Mode > median > mean

d. Mode > mean > median

e. Mode = mean = median (Most people got this backwards - make a diagram!)

5. A graph that connects points, each of which represents the frequency is called a

a. Histogram

b. Ogive

*c. Frequency Polygon

d. Pie chart

e. None of the above


251y0011 10/11/00

Part II. Compute an appropriate answer, showing your work (except in a)) (15 Points maximum - if you do more than 15 points, only your right answers will be counted.):

a) Fill in the following table (3)

Class / / /
50-59.99 / _ / .12 / __
60-69.99 / 4 / __ / __
70-79.99 / _ / __ / 12
80-89.99 / 6 / __ / __
90-99.99 / 7 / _ / __
Total / 25 / __

Solution:

Class / / /
50-59.99 / 3 / .12 / 3
60-69.99 / 4 / .16 / 7
70-79.99 / 5 / .20 / 12
80-89.99 / 6 / .24 / 18
90-99.99 / 7 / .28 / 25
Total / 25 / 1.00

Note that .

b) Assume that we have sold 1000 life insurance policies in amounts between $5200 and $9800. If this data is to be presented in eight classes, what intervals would you use? Explain your reasoning using the appropriate formula and make a table showing the class intervals you would actually use. (3)

Solution: so use 600. This is only a suggestion. Any number somewhat above 575 will work.

Class / From / To
A / 5200 / 5799.99
B / 5800 / 6399.99
C / 6400 / 6999.99
D / 7000 / 7599.99
E / 7600 / 8199.99
F / 8200 / 8799.99
G / 8800 / 9399.99
H / 9400 / 9999.99

c) (S-30)If a population of 1000 items with an unknown distribution has a mean of 12 and a standard deviation of 1.2, what is the approximate minimum number of items that must be (i) between 6 and 18? (ii) What is the maximum that can be above 18? (3)

Solution: (i) If we use the formula , we find that and According to the Chebyshef inequality, the minimum fraction of the data that must be between is . of 1000 is 960. (ii) The answer is the opposite to the answer to (i). There are about 1000 - 960 = 40 items left over. All of these could be above 18.


251y0011 10/11/00

d) Do c) again assuming that the distribution is unimodal and symmetric.(2)

Solution: Since the Empirical Rule says that almost all points must be between , we would expect almost all of the 1000 points to be between 6 and 18 since these points are , and we would be quite surprised if even one point is above 18.

e) For the numbers 11.1, 13.2, 15.1 and 12.7, compute the i) Root-mean-square ii) Harmonic mean, iii) Geometric mean (2 each)

Solution: Note that . This is not used in any of the following calculations and there is

no reason why you should have computed it!

(i) The Root-Mean-Square.

. So .

(ii) The Harmonic Mean.

. So .

(iii) The Geometric Mean.

.

Or

. So . I got the last result by putting 2.56086 into the calculator and pressing 'inverse' and then 'ln x.'

Or

. So . I got the last result by putting 1.11217 into the calculator and pressing 'inverse' and then 'log x.'

Notice that the original numbers and all the means are between 11.1 and 15.1. In spite of everything that I said, there are many of you who think that: (i) You can find a sum of squares by summing numbers and squaring the sum; (ii) You can find the sum of by adding up the numbers and taking the reciprocal; (iii) You can find an nth by dividing by n. I can only recommend a remedial math class (unless, of course, you want to try listening in class and checking out the homework very carefully.)


251y0011 10/11/00

Part III. Do the following problems (25 Points)

1. In a period of 7 days you make the following numbers of sales(in millions):

Day : 1 2 3 4 5 6 7

3

Sales: 9.2 10.2 9.2 11.2 19.5 12.2 13.2

Compute the following (assuming that the numbers are a sample):

a) Mean Sales (1)

b) The Median (1)

c) The Standard Deviation (3)

d) The 2nd Quintile (2)

Solution: Compute the Following: Index

Note that x is in order 1 9.2 84.64 -2.9 8.41

2 9.2 84.64 –2.9 8.41

3 10.2 104.04 –1.9 3.61

4 11.2 125.44 or -0.9 0.81

5 12.2 148.84 0.1 0.01

6 13.2 174.24 1.1 1.21

7 19.5 380.25 7.4 54.76

84.7 1102.09 0.0 77.22

Isn't it wonderful how predictable so many of you are! I strongly recommended that you compute the variance by the computational formula in both this and the next problem. Many of you ignored me. Two thirds of those who used the definitional formula got the problem wrong because they had not checked out the method enough so that they knew what the formula meant. is not as some of you seem to have fooled yourselves into believing. Nor is equal to . If you had tried these in any of the homework problems, you would have found that these tricks didn’t work.
Note that, to be reasonable, the mean, median and 2nd quintile must fall between 9.2 and 19.5.

,, ,.

a)

b) Just put the numbers in order and pick the middle number, 11.2.

Or formally:

so

c) or

d) The 2nd quintile has 40% below it.

so

I warned you about quintiles - they are fifths, not fourths. This is an excellent warning! You can't answer a question that you haven't read carefully!


251y0011 10/11/00

3

2. A bank finds that the amounts overdue on its credit cards are the following. . (Assume that the numbers are a sample.) Are there reasons why so many of you (i) totally ignored the classes, (ii) decided that the frequency column was both and , (iii) computed the column by taking each value of and squaring it after I had specifically warned you not to?

3

amount (thousands) frequency

0-$1.99999 80

$2.000-3.99999 40

$4.000-5.99999 30

$6.000-7.99999 30

$8.000-9.99999 20

$10.000 and up 0

a. Calculate the Cumulative Frequency (1)

b. Calculate The Mean (1)

c. Calculate the Median (2)

d. Calculate the Mode (1)

e. Calculate the Variance (3)

f. Calculate the Standard Deviation (2)

g. Calculate the Interquartile Range (3)

h. Calculate a Statistic showing Skewness and Interpret it (3)

i. Make an histogram of the Data (Neatness Counts!)(2)

3

Solution: is the midpoint of the class. Our convention is to use the midpoint of 0 to 2, not 1.99999.

$0-$1.99999 80 80 1.0 80 80 80 -2.7 -216 583.2 -1574.64

$2.000-3.99999 40 120 3.0 120 360 1080 -0.7 -28 19.6 -13.72

$4.000-5.99999 30 150 5.0 150 750 3750 1.3 39 50.7 65.91

$6.000-7.99999 30 180 7.0 210 1470 10290 3.3 99 326.7 1078.11

$8.000-9.99999 20 200 9.0 180 1620 14580 5.3 106 561.8 2977.54

200 740 4280 29780 0 1542.0 2533.20

and and Note that, to be reasonable, the mean, median and quartiles must fall between 0 and 10. And no, I did not get the 1.0 in the column by rounding 0.999995, or, for that matter, by rounding anything else - Think!

a. Calculate the Cumulative Frequency (1): (See above) The cumulative frequency is the whole column.

b. Calculate the Mean (1):

c. Calculate the Median (2): . This is above 80 and below 120, so the interval is 2-3.99999. so

d. Calculate the Mode (1) The mode is the midpoint of the largest group. Since 80 is the largest frequency, the modal group is 0 to 1.99999 and the mode is 1.000.

e. Calculate the Variance (3): or

f. Calculate the Standard Deviation (2):


251x0011 10/11/00

g. Calculate the Interquartile Range (3): First Quartile: . This is above and below , so the group is 0 to 1.99999. gives us .

Third Quartile: . This is above 150 and below 180, so the group is 6.000 to 7.99999. . .

h. Calculate a Statistic showing Skewness and interpret it (3): .

or

or

or Pearson's Measure of Skewness

Because of the positive sign, the measures imply skewness to the right.

i. Make an histogram of the Data (Neatness Counts!)(2) A histogram is a bar graph of the frequency.

The first bar is between 0 and 2 on the x axis (or has a midpoint at 1) and has a height of 80.

3