LT 8.3: Estimating a Population Mean
How long can you expect an AA battery to last? What proportion of college undergraduates have engaged in binge drinking? Is caffeine dependence real? Rather than collect every battery or ask all college undergraduates, we use a sample to answer our questions. We will then use our sample data to make some conclusions about the population.
- Confidence Intervals
Example #1: The admissions director at Big Bucks University wants to market his school using the IQ score of current students. He chooses a SRS of 50 students from the school’s 5000 freshmen and administers an IQ test. The mean of the sample is 112. What can the director say about the mean score of the population of all 5000 freshmen? Is the mean IQ of all freshmen 112? Somewhere close to 112? How do we know?
How close to 112 islikely to be? How would the sample mean vary (suppose the standard deviation is 15 for the population) if we took many samples of 50 freshmen from this same population?
One-Sample z Interval for a Population Mean
Draw an SRS of size n from a population having unknown mean μ and known standard deviation σ. As long as the Normal and Independent conditions are met, a level C confidence interval for μ is:
The critical value z* is found from the standard Normal distribution.
This method isn’t very useful in practice, however in most real-world settings. If we don’t know the population mean μ, then we don’t know the population standard deviation σ either. But we can use the one-sample z interval for a population mean to estimate the sample size needed to achieve a specified margin of error. The process mimics what we did for a population proportion.
The Reasoning behind Statistical Estimation in a Nutshell
- To estimate the unknown population meanμ, use the meanof our random sample.
- Although is an unbiased estimate ofμ, it will rarely be exactly equal toμ, so our estimate has some error.
- In repeated samples, the values offollow approximately a normal distribution with meanμand standard deviation of 2.1 ().
- The 95% part of the 68%-95%-99.7% rule for Normal distributions says that in about 95% of all samples, the mean IQ score for the sample will be within +/- 2 standard deviations or 4.2 points of the population mean.
- Wheneveris within 4.2 points ofμ,μis also within 4.2 points of. This happens in about 95% of all possible samples. So the unknownμlies between - 4.2 and + 4.2 in about 95% of all samples.
Sampling Distribution for the Mean IQ score of an SRS of Size 50.
In 95% of all samples, the mean lies within +/- 4.2 of the unknown population mean.
To say that +/- 4.2 is a 95% confidence interval for the population mean μis to say that, in repeated samples, 95% of these intervals capture μ.
Conclusion: Our sample of 50 freshmen gave =112. The resulting 95% confidence interval is 112 +/- 4.2 which can be written as (107.8, 116.2). We are 95% confident that the unknown mean IQ score for all university freshmen is between 107.8 and 116.2.
The interval we give is called the Confidence Interval and the 95% is the Confidence Level. The plus or minus 4.2 is the Margin of Error (ME).
The calculation of our interval is based on three assumptions:
Finding Nemo and Finding z*--Constructing a Confidence Interval
To construct an 80% confidence interval, we must catch the central 80% of the Normal sampling distribution of . In catching the central 80%, we leave out 20% or 10% in each tail. So z* is the point with 0.1 area to the right and 0.9 area to the left under the standard Normal curve. Looking at Table A the closest entry with 0.9 to the left is z*=1.28. 80%of the area under the normal curve lies between -1.28 and 1.28 standard deviations from the mean.
- Confidence Interval for a Population MeanwhereσisKnown)
Example #2: Video Terminals
A manufacturer of high resolution video terminals must control the tension on the mesh of fine wires that lies behind the surface of the viewing screen. Some variation is inherent in the production process. The standard deviation of the tension readings is = 43mV. Here are the readings from an SRS of 20:
269.5297.0269.6283.3304.8280.4233.5257.4317.5327.4
264.7307.7310.0343.3328.1342.6338.8340.1374.6336.1
Construct and interpret a 90% confidence interval for the mean tensionμof all the screens produced on this day using thePANICprocedure.
SOCS, BINS, BITS, DOFS & now PANIC (PHANTOM Comes Next)
Step 1--P: Population & Parameter. Identify the population of interest and the parameter you want to draw conclusions about.
Step 2--A: Assumptions & Conditions--Choose the appropriate inference procedure. Verify conditions for using it.
- SRS:
- Normality:
- Independence:
Normal: Since the sample size is small (n = 20), we must check whether it’s reasonable to believe that the population distribution is Normal. So we examine the sample data. The figure below shows (a) a dotplot, (b) a boxplot, and (c) a Normal probability plot of the tension readings in the sample. Neither the dotplot nor the boxplot shows strong skewness or any outliers. The Normal probability plot looks roughly linear. These graphs give us no reason to doubt the Normality of the population.
In Chapter 2, we noted that a data set with an approximately Normal shape will have a Normal probability plot that’s roughly linear. Now we’re trying to get information about the shape of the population distribution from a Normal probability plot of the sample data. This is much harder, because even samples drawn from perfectly Normal populations don’t always look Normal.
Step 3--N: Name & Formula. What is the name and formula for the confidence interval method you are using?
Step 4—I: (Confidence) Interval = Estimate +/- Margin of Error (ME)
Step 5—C: Conclusion in Context. Interpret your results in the context of the problem. Remember: Conclusion, connection and context.
Just a reminder…Larger samples give smaller intervals because the standard deviation is being divided by a larger value.
Example #3: How Confidence Intervals Behave
Suppose the manufacturer of the screens in the example wants 99% confidence rather than 90% confidence. The critical value for 99% confidence is z* = ______. The 99% confidence interval for based on an SRS of 20 with mean = 306.3 is:
Demanding 99% confidence instead of 90% confidence has increased the margin of error from ______to ______.
Example #4: Sample Size for a Desired Margin of Error
Researchers would like to estimate the mean cholesterol level ofμof a particular variety of monkey that is often used in laboratory experiments. They would like their estimate to be within 1 mg/dl of blood of the true value ofat a 95% confidence level. A previous study involving this variety of monkey suggests that the standard deviation of cholesterol level is about 5 mg/dl. Obtaining monkeys is time-consuming and expensive, so the researchers want to know the minimum number of monkeys they will need to generate a satisfactory estimate.
To determine the sample size n that will yield a confidence interval for a population mean with a specified margin of error (ME), set the expression for the margin of error to be less than or equal to m and solve for n.
Sample Size for a Desired Margin of Error When Estimating μ
To determine the sample size n that will yield a level C confidence interval for a population mean with a specified margin of error ME:
- Get a reasonable value for the population standard deviation σ from an earlier or pilot study.
- Find the critical value z* from a standard Normal curve for confidence level C.
- Set the expression for the margin of error to be less than or equal to ME and solve for n:
For 95% confidence, the table gives z* = 1.96. We know that = 5. Set the margin of error to be at most 1 and solve for n.
Always round up to the next whole number when finding n. Notice that the size of the sample determines the margin of error. The size of the population does not influence the sample size as long as the population is much larger than the sample)
C. When σ Is Unknown: The t Distributions
When the sampling distribution of is close to Normal, we can find probabilities involving by standardizing:
Recall that the sampling distribution of has mean μand standard deviation . What are the shape, center, and spread of the sampling distribution of the new statistic z? z has the standard Normal distribution N(0, 1). Therefore, we can use Table A or a calculator to find the related probability involving z. That’s how we have gotten the critical values for our confidence intervals so far.
When we don’t know σ, we estimate it using the sample standard deviation sx. What happens now when we standardize?
This new statistic does not have a standard Normal distribution.
When doing inference about a population mean μ, what happens when we use the sample standard deviation sx to estimate the population standard deviation σ? We’ll start with a Normal population having mean μ = 100 and standard deviation σ = 5.
The figure below shows the results of taking 500 SRSs of size n = 4 and standardizing the value of the sample mean . The values of z follow a standard Normal distribution, as expected. The standardized values using the sample standard deviation sx in place of the population standard deviation σ, show much greater spread. In fact, in a few samples, the statistic:
took values below −6 or above 6. This statistic has a distribution that is new to us, called a t distribution. It has a different shape than the standard Normal curve: still symmetric with a single peak at 0, but with much more area in the tails.
The statistic t has the same interpretation as any standardized statistic: it says how far is from its mean μ in standard deviation units. There is a different t distribution for each sample size. We specify a particular t distribution by giving its degrees of freedom (df). When we perform inference about a population mean μ using a t distribution, the appropriate degrees of freedom are found by subtracting 1 from the sample size n, making df = n − 1. We will write the t distribution with n − 1 degrees of freedom as tn−1 for short.
Thet Distributions; Degrees of Freedom
Draw an SRS of size n from a large population that has a Normal distribution with mean μ and standard deviation σ. The statistic:
has the t distribution with degrees of freedomdf = n − 1. This statistic will have approximately a tn−1 distribution as long as the sampling distribution of is close to Normal.
Think of degrees of freedom as a way of keeping score. A data set contains a number of observations, say, n. They constitute n individual pieces of information. These pieces of information can be used either to estimate parameters or variability. In general, each item being estimated costs one degree of freedom. The remaining degrees of freedom are used to estimate variability. All we have to do is count properly.
For a single sample: There are n observations. There's one parameter (the mean) that needs to be estimated. That leaves n-1 degrees of freedom for estimating variability. The degree of freedom is the number of values in a calculation that we can vary.
The figure to the right compares the density curves of the standard Normal distribution and the tdistributions with 2 and 9 degrees of freedom. The figure illustrates these facts about the tdistributions:
- The density curves of the tdistributions are similar in shape to the standard Normal curve. They are symmetric about 0, single-peaked, and bell-shaped.
- The spread of the t distributions is a bit greater than that of the standard Normal distribution. The t distributions have more probability in the tails and less in the center than does the standard Normal. This is true because substituting the estimate sx for the fixed parameter σ introduces more variation into the statistic.
- As the degrees of freedom increase, the t density curve approaches the standard Normal curve ever more closely. This happens because sx estimates σmore accurately as the sample size increases. So using sx in place of σ causes little extra variation when the sample is large.
Table B in the back of the book gives critical values t* for the tdistributions. Each row in the table contains critical values for the tdistribution whose degrees of freedom appear at the left of the row. For convenience, several of the more common confidence levels C (in percents) are given at the bottom of the table. By looking down any column, you can check that the t critical values approach the Normal critical values z* as the degrees of freedom increase.
Example # 5: Finding t*--Using Table B
Suppose you want to construct a 95% confidence interval for the mean μ of a Normal population based on an SRS of size n = 12. What critical value t* should you use?
In Table B, we consult the row corresponding to df = n − 1 = 11. We move across that row to the entry that is directly above 95% confidence level on the bottom of the chart. The desired critical value is t* = 2.201.
Notice that the corresponding standard Normal critical value for 95% confidence is z* = 1.96. We have to go out farther than 1.96 standard deviations to capture the central 95% of the t distribution with 11 degrees of freedom.
D. Constructing a Confidence Interval for μ
When the conditions for inference are satisfied, the sampling distribution of has roughly a Normal distribution with mean μ and standard deviation/. Because we don’t know σ, we estimate it by the sample standard deviation sx.
We then estimate the standard deviation of the sampling distribution by sx/. This value is called the standard error of the sample mean, or just the standard error of the mean.
DEFINITION: Standard Error of the Sample Mean
The standard error of the sample meanissx/where sxis the sample standard deviation. It describes how far will be from μ, on average, in repeated SRSs of size n.
To construct a confidence interval for μ, replace the standard deviation/ of by its standard error sx/in the formula for the one-sample z interval for a population mean. Use critical values from the tdistribution with n − 1 degrees of freedom in place of the z critical values. That is,
Statistic +/- (Critical Value) • (Standard Deviation of Statistic) = +/- t* (sx/
This one-sample t interval for a population mean is similar in both reasoning and computational detail to the one-sample z interval for a population proportion.
The One-Sample t Interval for a Population Mean
Choose an SRS of size n from a population having unknown mean μ. A level C confidence interval for μ is:
+/- t* (sx/
where t* is the critical value for the tn−1 distribution. Use this interval only when (1) the population distribution is Normal or the sample size is large (n ≥ 30), and (2) the population is at least 10 times as large as the sample.
As before, we have to verify three important conditions before we estimate a population mean. When we do inference in practice, verifying the conditions is often a bit more complicated.
Conditions for Inference about a Population Mean
- Random: The data come from a random sample of size n from the population of interest or a randomized experiment. This condition is very important.
- Normal: The population has a Normal distribution or the sample size is large (n ≥ 30).
- Independent: The method for calculating a confidence interval assumes that individual observations are independent. To keep the calculations reasonably accurate when we sample without replacement from a finite population, we should check the 10% condition: verify that the sample size is no more than 1/10 of the population size.
When we use Table B to determine the correct value of t* for a given confidence interval, all we need to know are the confidence level C and the degrees of freedom (df). Unfortunately, Table B does not include every possible sample size. When the actual df does not appear in the table, use the greatest df available that is less than your desired df. This guarantees a wider confidence interval than we need to justify a given confidence level. Better yet, use technology to find an accurate value of t*for any df.
Example #6: Auto Pollution--A one-sample t interval for μ
Environmentalists, government officials, and vehicle manufacturers are all interested in studying the auto exhaust emissions produced by motor vehicles.
The major pollutants in auto exhaust from gasoline engines are hydrocarbons, carbon monoxide, and nitrogen oxides (NOX). Researchers collected data on the NOX levels (in grams/mile) for a random sample of 40 light-duty engines of the same type. The mean NOX reading was 1.2675 and the standard deviation was 0.3332.
- Construct and interpret a 95% confidence interval for the mean amount of NOX emitted by light-duty engines of this type.
STATE: We want to estimate the true mean amount μ of NOX emitted by all light-duty engines of this type at a 95% confidence level.
PLAN: We should construct a one-sample t interval for μ if the conditions are met:
- Random: The data come from a “random sample” of 40 engines from the population of all light-duty engines of this type.
- Normal: We don’t know whether the population distribution of NOX emissions is Normal. Because the sample size, n = 40, is large (at least 30), we should be safe using t procedures.
- Independent: We are sampling without replacement, so we need to check the 10% condition: we must assume that there are at least 10(40) = 400 light-duty engines of this type.
DO: The formula for the one-sample tinterval is: +/- t* (sx/
The calculator command invT(.025,39) gives t = –2.023. Using the critical value t* = ± 2.023 for the 95% confidence interval gives:
1.2675 +/- 2.2023(0.332)/ = 1.2675 +/- 0.0166 = (1.1609, 1.3741)
CONCLUDE: We are 95% confident that the interval from 1.1599 to 1.3751 grams/mile contains the true mean level of nitrogen oxides emitted by this type of light-duty engine.
- The environmental Protection Agency (EPA) sets a limit of 1.0 gram/mile for NOX emissions. Are you convinced that this type of engine has a mean NOX level of 1.0 or less? Use your interval from (a) to support your answer.
The confidence interval from (a) tells us that any value from 1.1609 to 1.3741 g/mi is a plausible value of the mean NOX level μ for this type of engine. Since the entire interval exceeds 1.0, it appears that this type of engine violates EPA limits.
E. Using t Procedures Wisely
The stated confidence level of a one-sample t interval for μ is exactly correct when the population distribution is exactly Normal. No population of real data is exactly Normal. The usefulness of the t procedures in practice therefore depends on how strongly they are affected by lack of Normality.