ST361: Ch 5.5 + Ch 5.6 Sampling Distribution

Topics:

I. What is Sampling Distribution?

II. Sampling Distribution of a Sample Mean

(a) X ~ Normal Distribution

(b) X ~ Non-normal Distribution

III. Central Limit Theorem

IV. Sampling Distribution of the Sample Proportion p

------

I. Sampling Distribution

·  Population vs. Sample:

·  A Parameter is______

·  A Statistic is ______

·  The observed value of statistic depends on the particular sample; hence it ______from sample to sample. Such variability is called ______

·  The probability distribution of the statistics is called ______


Ex1. A neighborhood has 5 houses A, B, C, D and E. They respectively have 3, 2, 5, 3, and 4 bedrooms. We randomly draw 3 houses at a time and calculate the sample statistics median and mean. What is the sampling distribution of the sample median? What is the sampling distribution of the sample mean?

§  Population =

§  Variable of interest =

§  Sample =

Houses drawn in the sample / # of bedrooms / Sample median / Sample mean
ABC / 3,2,5 / 3 / 10/3=3.3
ABD
ABE / 3,2,4 / 3 / 9/3 = 3
ACD / 3,5,3 / 3 / 11/3 = 3.7
ACE / 3,5,4
ADE / 3,3,4 / 10/3 = 3.3
BCD
BCE / 2,5,4 / 4 / 11/3
BDE / 2,3,4 / 3 / 9/3 = 3
CDE / 5,3,4


II. Sampling Distribution of a Sample Mean

Let be the sample mean of a random sample from a population mean and SD . (That is, .) We want to know the sampling distribution of .

v  If X ~ Normal (mean=, SD=). Then , the mean of a random sample of n observations

·  follows a ______, with mean and standard deviation .

·  = ______, and = ______

·  is also called standard error (SE) of , or Standard error of the mean

Ex 2. Thousands of boxes contain nuts. The weights are normally distributed with mean =1 lb and SD =0.01 lb. We inspect 4 boxes and get their weights . The sample mean is

(a) What is the sampling distribution of? Mean and SE of?

(b) What is the probability that lies between 0.99 and 1.01 lb?

v  X ~ any non-normal distribution with mean=, SD=. The sampling distribution of based on samples of size n is

(a) If n is small (i.e., ______), then

·  Distribution:

·  Mean and SE :

(b) If n is large (i.e., ______), then

·  Distribution:

·  Mean and SE:

v  These results follow from Central Limit Theorem (CLT)

III. Central Limit Theorem

Assume X follows an arbitrary distribution with mean and SD.

When sample size is sufficiently large (i.e., n30), the sample distribution of always follows normal distribution with mean and SE

·  Usually the ______a distribution is, the ______the sample size will need to ensure normality of


Ex3. Let X be the number of major defects for each new automobile tested. Suppose the number of such defects for a certain model is with mean=3.2 and SD =2.4. A sample of 100 new cars is collected.

(a) What is the sampling distribution of based on samples of size 100? What is its center and what is the SE of?

(b) What is the probability that the sample average number of major defects exceeds 4?

v  Comments:

·  If be the sample mean of a random sample from a population mean and SD , then regardless of the sample size n and the distribution of X,

·  The variation of sample means is ______() than variation of the original data

·  As sample size n increases, (the SE of ) ______, and the shape of the sampling distribution becomes ______. This implies higher probability around its mean .

Ex4. The heights of college age students (denoted by X) are known to have mean =115 and SD =30.

(a) What is the sampling distribution of, the average height of 36 college age students? What are the mean and SE of the sampling distribution of?

(b) What is the sampling distribution ofbased on samples of 9 college age students? What are the mean and SE of the sampling distribution of ?

(c) Assume that we were told that the heights of college age students are normally distributed. What is the sampling distribution of based on samples of 9 college age students? What are the mean and SE?


IV. Sampling Distribution of a Sample Proportion p

Ex. Consider a basket containing 100 balls with 2 colors: Red and White. The proportion of Red balls is denoted by (and is not known). Assume 20 balls were randomly picked from the basket with replacement, and 14 balls out of the 20 balls were red.

(1)  In the sample, what is the proportion of red balls?

(2)  We refer such quantity, as ______and denote it by_____.

(Note that ______) Our question of interests: what is the distribution of the sample proportion ?

Thoughts: we can think a r.v such that X = 1 if “red” and X=0 if “not red”.

Then can be view as ______.

That is, is ______.

Thus by ______, ~______if large. (However, different criteria for “large ” are needed here.)

Sampling Distribution of p

(a) If _large n (i.e.,______), then the sample proportion has

·  A ____________ ( by ______)

·  Mean (denoted by ) = , and SE (denoted by )

(b) If _small n (i.e.,______), then the sample proportion has

·  ______

·  Mean (denoted by ) = ______, and SE (denoted by ) = ______


Ex5. In the population, the proportion of defectives =12%.

(a)  What is the sampling distribution of based on 100 observations? What is the mean? What is the standard error?

(b)  What is the probability that <0.05?

8