251distrex3 1/14/08 (Open this document in 'Page Layout' view!)
Finding Probabilities for the Normal Distribution when the Distribution is not Standardized
We already know from the previous section (251disrtex2) that any probability for a Normally distributed variable (the standard notation to say that a given distribution is Normal with a certain mean and standard deviation is ) can be found using the Standardized Normal distribution by using the transformation .
This means that if and we want , we replace values of with . We thus say:
251distrex3 1/14/08 (Open this document in 'Page Layout' view!)
Since there is no particular reason to repeat the problems from class, here are some that I had available that seem to cover all the bases.
Assume that . Do the following:
a.
b.
c. (Cumulative)
d.
e.
f.
g.
h. Find probabilities for the following intervals: Below -3.4,,,, and above 7.4.
Make diagrams.
251distrex3 1/14/08 (Open this document in 'Page Layout' view!)
Solution:Material in italics below is a description of the diagrams you were asked to make or a general explanation, and the written description will not be part of your solution.General comment - I can't give you much credit for an answer with a negative probability or one above 1 because there is no such thing!!! .
a. norm
For make a Normal curve centered at 2 and shade the area from .50 to 4; for , make a Normal curve centered at zero and shade the area from -0.17 to0.22. Since the area in either diagram is on both sides of the mean, you add.
Note that on all these graphs, the x axis should be labeled and a vertical line should be added at zero.
b. norm
In this problem many students made diagrams showing, correctly, a mean of 2, but then showing 0.41 above it. Also, I got many probabilities like , which don't make any sense because -0.20 is below zero and this says that it is above zero. For make a Normal curve centered at 2 and shade the area from -0.41 to 0.41; for make a Normal curve centered at zero and shade the area from –0.27 to -0.18. Since the area in either diagram is on one side of the mean, you subtract. Like all diagrams in these examples, the diagram below is for z, since most students prefer to make z diagrams rather than x diagrams. A z diagram is always centered at zero.An x diagram is always centered at the population mean (2 in this case).Students who did this problem either (a) assumed that if -0.41 became -0.27, then +0.41 would become +0.27 or (b) didn’t get control of their calculators and got +0.41 – 2 for their first calculation. Doing 0 – 0.41 – 2 might help.
c. . This is the Cumulative distribution. ( is a notation for ‘equal by definition’)
We already know from the previous section (251disrtex2) that since the zero is the halfway point under the curve, and . norm
For make a Normal curve centered at 2 and shade the area below 0.24; for make a Normal curve centered at zero and shade the area below -0.20. Since the area in either diagram is on one side of the mean, you subtract.
Moral: For a cumulative distribution when is below the mean, change to , and subtract (from the standardized normal table) from 0.5.
.
d. normThis is the Cumulative distribution. ( is a notation for ‘equal by definition’)
For make a Normal curve centered at 2 and shade the entire area below 3.00; for , make a Normal curve centered at zero and shade the area below 0.11. Since the area in either diagram is on both sides of the mean, you add.
Moral: For a cumulative distribution when is above the mean, change to , and add (from the standardized normal table) to 0.5.
e.
There are two surprises in this problem. First 3.67 is not on the usual standardized table and, second, one value of is zero. For make a Normal curve centered at 2 and shade the area above 2.0; for , make a Normal curve centered at zero and shade the area above 0. Since the area in either diagram starts at the mean, you imply look up.This would also work for something like . To find 3.67, if you have my table,look at the bottom of the table. It is reproduced below. Since 3.67 is between 3.62 and 3.89, we say . If you don’t have my table and you know that probabilities between zero and any number above 4 are .5000 and that the conventional table will tell you that , you can usually make a pretty good guess. norm
For values above 3.09, see below
If is between is
3.08 and 3.10 .4990
3.11 and 3.13 .4991
3.14 and 3.17 .4992
3.18 and 3.21 .4993
3.22 and 3.26 .4994
3.27 and 3.32 .4995
3.33 and 3.38 .4996
3.39 and 3.48 .4997
3.49 and 3.61 .4998
3.62 and 3.89 .4999
3.90 and up .5000
f. norm
Students who did this problem often failed to change zero to -0.22. For make a Normal curve centered at 2 and shade the area from 0 to 1.00; for , make a Normal curve centered at zero and shade the area from -0.22 to 0.11. Since the area in either diagram is on both sides of the mean, you subtract.
g. norm
h. Find probabilities for the following intervals: Below -3.4, , , , and above 7.4. norm
This type of problem is a preface to a procedure commonly used in Economics 252 called the Kolmogorov – Smirnov test or the Chi-squared test, or, in the case with the Normal distribution where and are known rather than and and are used in their place, the Lilliefors test. They are based on the fact that the sums of the differences squared between the proportion of points on intervals in a hypothesized distribution and the proportion of the points on the same interval in a sample from the actual distribution have a well known distribution. To start, we divide the hypothesized distribution into intervals and figure out their probabilities. You can, of course, do each of these separately, but a mass production technique based on the cumulative distribution is somewhat faster, though I am waiting for one of my students to invent one that is more efficient. The way this is done is that you take your values of above in your first column, convert them to in the second column, compute as in c.) and d.) above in the third column, and difference the third column into the 4th column by subtracting each number in the column except the first one from the number above it. For make a Normal curve centered at 2 and shade the areas below-3.4, from -3.4 to -0.07, -0.07 to 2.0, 2.0 to 4.7, 4.7 to 7.4 and above 7.4; for make a Normal curve centered at zero and shade the areas below -0.6, between -0.6 and -0.3 etc. In either case, you will notice that the areas that you marked off are symmetrical about the mean, which means that the probabilities above the mean are unnecessary.
Row Probability
1 -3.4-0.6.5 - .2257 = .2743 .2743 = .2743
2 -0.7-0.3.5 - .1179 = .3821 .3821 - .2743 = .1078
3 2.0 0.0.5 = .5000 .5000 - .3821 = .1179
4 4.7 0.3.5 + .1179 = .6179 .6179 - .5000 = .1179
5 7.4 0.6.5 + .2257 = .7757 .7757 - .6179 = .1078
6 1 = 1.00001.0000 - .7757 = .2743
So , ,,, and .