Revised May1, 2004
Professor Ahmadi’s Lecture NotesPage 1
Chapter 1
Glossary of Terms:
Statistics
Data
Data Set
Elements
Variable
Observations
Sample and Population
Descriptive Statistics
Statistical Inference
Qualitative and Quantitative Data
Scales of Measurement:
Nominal Scale
Ordinal Scale
Interval Scale
Ratio Scale
Chapter 2
Summarizing Quantitative Data
Daily earnings of a sample of twelve individuals are shown below:
100, 126, 138, 142, 148, 150, 168, 182, 191, 193, 195, 199
Summarize the above data by constructing:
a.a frequency distribution
b.a cumulative frequency distribution
c.a relative frequency distribution
d.a cumulative relative frequency distribution
e.a histogram
f.an ogive
cumulativerelativecumulative
Classfrequencyfrequencyfrequencyrelative frequency
100 - 119
120 - 139
140 - 159
160 - 179
180 - 199
DOT PLOT
In a recent campaign, many airlines reduced their summer fares in order to gain a larger share of the market. The following data represent the prices of round-trip tickets from Atlanta to Boston for a sample of nine airlines:
120 / 140 / 140160 / 160 / 160
160 / 180 / 180
Construct a dot plot for the above data.
STEM-AND-LEAF DISPLAY
The test scores of 14 individuals on their first statistics examination are shown below:
95 87 52 43 77 84 78
75 63 92 81 83 91 88
a.Construct a stem-and-leaf display for these data.
b.What does the above stem-and-leaf show?
CROSSTABULATION
The following is a crosstabulation of starting salaries (in $1,000's) of a sample of business school graduates by their gender.
Starting SalaryGender / Less than 30 / 30 up to 35 / 35 and more / Total
Female / 12 / 84 / 24 / 120
Male / 20 / 48 / 12 / 80
Total / 32 / 132 / 36 / 200
a.What general comments can be made about the distribution of starting salaries and the gender of the individuals in the sample?
b.Compute row percentages and comment on the relationship between starting salaries and gender.
SCATTER DIAGRAM
The average grades of 8 students in professor Ahmadi’s statistics class and the number of absences they had during the semester are shown below:
Number of Absences / AverageGradeStudent / (x) / (y)
1 / 1 / 94
2 / 2 / 78
3 / 2 / 70
4 / 1 / 88
5 / 3 / 68
6 / 4 / 40
7 / 8 / 30
8 / 3 / 60
Develop a scatter diagram for the relationship between the number of absences (x) and their average grade (y).
Chapter 3 Formulas
Ungrouped DataSAMPLEPOPULATION
Mean
where n = sample sizewhere N = size of population
Interquartile Range
IQR = Q3 - Q1(Same as for sample)
where:Q3 = third quartile (i.e., 75th percentile)
Q1 = first quartile (i.e., 25th percentile)
Variance
or:or:
Standard Deviation
Coefficient of Variation (C.V.)
Covariance
Pearson Product Moment Correlation Coefficient
SAMPLEPOPULATION
where where
= Sample correlation coefficient = Population correlation coefficient
= Sample covariance = Population covariance
SX = Sample standard deviation of X Population standard deviation of X
= Sample standard deviation of Y Population standard deviation of Y
Weighted Meanwhere
Xi = data value i
wi = weight for data value i
Grouped DataMean
where
fi = frequency of class i
Mi = midpoint of class i
Variance
)
or
Chapter 3
Measures of Location & Dispersion (Ungrouped Data)
Hourly earnings (in dollars) of a sample of eight employees of Ahmadi, Inc. is shown below:
Individual / Earning (X)1 / 12
2 / 15
3 / 15
4 / 17
5 / 18
6 / 19
7 / 22
8 / 26
I.Measures of location
a.Compute the mean and explain and show its properties.
b.Determine the median and explain its properties.
c.Determine the 70th percentile.
d.Determine the 25th percentile.
eFind the mode.
II.Compute the followingmeasures of dispersionfor the above data:
a.Range
b.Interquartile range
c.Variance & the Standard deviation
- Coefficient of variation
- A sample of Chatt, Inc. employees had a mean of $21 and a standard deviation of $5. Which company shows a more dispersed data distribution?
f.Use “Descriptive Statistics” in Excel and determine all the statistical measures.
Chapter 3
Five-Number Summary
The weights of 12 individuals who enrolled in a fitness program are shown below:
IndividualWeight (Pounds)
1 100
2 105
3 110
4 130
5 135
6 138
7 142
8 145
9 150
10 170
11 240
12 300
a. Provide a five-number summary for the data.
b.Show the box plot for the weight data.
Chapter 3
Covariance & Coefficient of Correlation
The average grades of a sample of 8 students in professor Ahmadi’s statistics class and the number of absences they had during the semester are shown below.
Number of Absences / Average GradeStudent / () / ()
1 / 1 / 94
2 / 2 / 78
3 / 2 / 70
4 / 1 / 88
5 / 3 / 68
6 / 4 / 40
7 / 8 / 30
8 / 3 / 60
TOTAL / 24 / 528
a.Compute the sample covariance and interpret its meaning.
b.Compute the sample coefficient of correlation and interpret its meaning.
Chapter 3
Weighted Mean
The Michael Ahmadi Oil Company has purchased barrels of oil from several suppliers. The purchase price per barrel and the number of barrels purchased are shown below.
SupplierPrice Per Barrel ($)Number of Barrels
A174,000
B193,000
C189,000
D1620,000
Compute the weighted average price per barrel.
Chapter 3
Measures of Location & Dispersion (Grouped Data)
The yearly income distribution for a sample of 30 Ahmadi, Inc. employees is shown below.
Yearly IncomeFrequency
(In $10,000)fi
4 - 62
7 - 96
10 - 127
13 - 1510
16 - 185
Totalsn = 30
a.Compute the mean yearly income.
- Compute the variance and the standard deviation of the sample.
- A sample of Chatt, Inc. employees had a mean income of $132,000 with a standard deviation of $36,000. Which company shows a more dispersed income distribution?
Chapter 4 Formulas
Counting Rule for Multiple-step Experiments:
Total number of outcomes =
The number of Combinations of N objects taken n at a time:
Sum of the probability of Event A and its Complement:P(A) + P(Ac) = 1.0
Addition Law (the probability of the union of two events):
P(A B) = P(A) + P(B) - P(A B)
Multiplication Law (the probability of the intersection of two events):
P(A B) = P(A) P(B|A)orP(A B) = P(B) P(A|B)
Two Events A and B are Independent if:
P(A|B) = P(A)`orP(B|A) = P(B)
Multiplication Law for Independent Events:P(A B) = P(A)P(B)
Conditional Probability:
P(A|B) = orP(B|A) =
Bayes' Theorem in General:
P(Ai|B) =
Summary of Bayes' Theorem Calculations:
PriorConditionalJointPosterior
ProbabilitiesProbabilitiesProbabilitiesProbabilities
EventP(Ai)P(B|Ai)P(Ai B)P(Ai|B)
Chapter 4
Basic Probability Concepts
1.Assume you have applied to two different universities (let's refer to them as universities A and B) for your graduate work. In the past, 25% of students (with similar credentials as yours) who applied to university A were accepted; while university B had accepted 35% of the applicants (Assume events are independent of each other).
a.What is the probability that you will be accepted in both universities?
b.What is the probability that you will be accepted to at least one graduate program?
c.What is the probability that one and only one of the universities will accept you?
d.What is the probability that neither university will accept you?
2.In the two upcoming basketball games, the probability that UTC will defeat Marshall is 0.63, and the probability that UTC will defeat Furman is 0.55. The probability that UTC will defeat both opponents is 0.3465.
a.What is the probability that UTC defeats Furman given that they defeat Marshall?
b.Are the outcomes of the games independent? Explain and substantiate your answer.
c.What is the probability that UTC wins at least one of the games?
d.What is the probability of UTC winning both games?
Chapter 4
Conditional Probability
A research study investigating the relationship between smoking and heart disease in a sample of 500 individuals provided the following data:
Smoker / Nonsmoker / TotalRecord of Heart Disease / 50 / 40 / 90
No Record of Heart Disease / 100 / 310 / 410
Total / 150 / 350 / 500
a.Show the joint probability table.
b.What is the probability that an individual is a smoker and has a record of heart disease?
c.Compute and interpret the marginal probabilities.
d.Given that an individual is a smoker, what is the probability that this individual has heart disease?
e.Given that an individual is a nonsmoker, what is the probability that this individual has heart disease?
f.Does the research show that heart disease and smoking are independent events? Use probabilities to justify your answer.
g.What conclusion would you draw about the relationship between smoking and heart disease?
Chapter 4
BAYES' THEOREM
When Ahmadi, Inc. sets up their drill press machine, 70% of the time it is set up correctly. It is known that if the machine is set up correctly it produces 90% acceptable parts. On the other hand, when the machine is set up incorrectly, it produces 20% acceptable parts. One item from the production is selected and is observed to be acceptable.
a.What is the probability that the machine is set up correctly? That is, we are interested in computing:
P(Correct set up Acceptable part).
Let the following symbols represent the various events:
E1 = Correct set up
E2 = Incorrect set up
G = Good part (i.e., Acceptable part)
With the above notations we want to determine P(E1 G).
b.Compute all the posterior probabilities.
Professor Ahmadi’s Lecture NotesPage 1