Revised May1, 2004

Professor Ahmadi’s Lecture NotesPage 1

Chapter 1

Glossary of Terms:

Statistics

Data

Data Set

Elements

Variable

Observations

Sample and Population

Descriptive Statistics

Statistical Inference

Qualitative and Quantitative Data

Scales of Measurement:

Nominal Scale

Ordinal Scale

Interval Scale

Ratio Scale

Chapter 2

Summarizing Quantitative Data

Daily earnings of a sample of twelve individuals are shown below:

100, 126, 138, 142, 148, 150, 168, 182, 191, 193, 195, 199

Summarize the above data by constructing:

a.a frequency distribution

b.a cumulative frequency distribution

c.a relative frequency distribution

d.a cumulative relative frequency distribution

e.a histogram

f.an ogive

cumulativerelativecumulative

Classfrequencyfrequencyfrequencyrelative frequency

100 - 119

120 - 139

140 - 159

160 - 179

180 - 199

DOT PLOT

In a recent campaign, many airlines reduced their summer fares in order to gain a larger share of the market. The following data represent the prices of round-trip tickets from Atlanta to Boston for a sample of nine airlines:

120 / 140 / 140
160 / 160 / 160
160 / 180 / 180

Construct a dot plot for the above data.

STEM-AND-LEAF DISPLAY

The test scores of 14 individuals on their first statistics examination are shown below:

95 87 52 43 77 84 78

75 63 92 81 83 91 88

a.Construct a stem-and-leaf display for these data.

b.What does the above stem-and-leaf show?

CROSSTABULATION

The following is a crosstabulation of starting salaries (in $1,000's) of a sample of business school graduates by their gender.

Starting Salary
Gender / Less than 30 / 30 up to 35 / 35 and more / Total
Female / 12 / 84 / 24 / 120
Male / 20 / 48 / 12 / 80
Total / 32 / 132 / 36 / 200

a.What general comments can be made about the distribution of starting salaries and the gender of the individuals in the sample?

b.Compute row percentages and comment on the relationship between starting salaries and gender.

SCATTER DIAGRAM

The average grades of 8 students in professor Ahmadi’s statistics class and the number of absences they had during the semester are shown below:

Number of Absences / AverageGrade
Student / (x) / (y)
1 / 1 / 94
2 / 2 / 78
3 / 2 / 70
4 / 1 / 88
5 / 3 / 68
6 / 4 / 40
7 / 8 / 30
8 / 3 / 60

Develop a scatter diagram for the relationship between the number of absences (x) and their average grade (y).

Chapter 3 Formulas

Ungrouped Data

SAMPLEPOPULATION

Mean

where n = sample sizewhere N = size of population

Interquartile Range

IQR = Q3 - Q1(Same as for sample)

where:Q3 = third quartile (i.e., 75th percentile)

Q1 = first quartile (i.e., 25th percentile)

Variance

or:or:

Standard Deviation

Coefficient of Variation (C.V.)

Covariance

Pearson Product Moment Correlation Coefficient

SAMPLEPOPULATION

where where

= Sample correlation coefficient = Population correlation coefficient

= Sample covariance = Population covariance

SX = Sample standard deviation of X Population standard deviation of X

= Sample standard deviation of Y Population standard deviation of Y

Weighted Mean

where

Xi = data value i

wi = weight for data value i

Grouped Data

Mean

where

fi = frequency of class i

Mi = midpoint of class i

Variance

)

or

Chapter 3

Measures of Location & Dispersion (Ungrouped Data)

Hourly earnings (in dollars) of a sample of eight employees of Ahmadi, Inc. is shown below:

Individual / Earning (X)
1 / 12
2 / 15
3 / 15
4 / 17
5 / 18
6 / 19
7 / 22
8 / 26

I.Measures of location

a.Compute the mean and explain and show its properties.

b.Determine the median and explain its properties.

c.Determine the 70th percentile.

d.Determine the 25th percentile.

eFind the mode.

II.Compute the followingmeasures of dispersionfor the above data:

a.Range

b.Interquartile range

c.Variance & the Standard deviation

  1. Coefficient of variation
  1. A sample of Chatt, Inc. employees had a mean of $21 and a standard deviation of $5. Which company shows a more dispersed data distribution?

f.Use “Descriptive Statistics” in Excel and determine all the statistical measures.

Chapter 3

Five-Number Summary

The weights of 12 individuals who enrolled in a fitness program are shown below:

IndividualWeight (Pounds)

1 100

2 105

3 110

4 130

5 135

6 138

7 142

8 145

9 150

10 170

11 240

12 300

a. Provide a five-number summary for the data.

b.Show the box plot for the weight data.

Chapter 3

Covariance & Coefficient of Correlation

The average grades of a sample of 8 students in professor Ahmadi’s statistics class and the number of absences they had during the semester are shown below.

Number of Absences / Average Grade
Student / () / ()
1 / 1 / 94
2 / 2 / 78
3 / 2 / 70
4 / 1 / 88
5 / 3 / 68
6 / 4 / 40
7 / 8 / 30
8 / 3 / 60
TOTAL / 24 / 528

a.Compute the sample covariance and interpret its meaning.

b.Compute the sample coefficient of correlation and interpret its meaning.

Chapter 3

Weighted Mean

The Michael Ahmadi Oil Company has purchased barrels of oil from several suppliers. The purchase price per barrel and the number of barrels purchased are shown below.

SupplierPrice Per Barrel ($)Number of Barrels

A174,000

B193,000

C189,000

D1620,000

Compute the weighted average price per barrel.

Chapter 3

Measures of Location & Dispersion (Grouped Data)

The yearly income distribution for a sample of 30 Ahmadi, Inc. employees is shown below.

Yearly IncomeFrequency

(In $10,000)fi

4 - 62

7 - 96

10 - 127

13 - 1510

16 - 185

Totalsn = 30

a.Compute the mean yearly income.

  1. Compute the variance and the standard deviation of the sample.
  1. A sample of Chatt, Inc. employees had a mean income of $132,000 with a standard deviation of $36,000. Which company shows a more dispersed income distribution?

Chapter 4 Formulas

Counting Rule for Multiple-step Experiments:

Total number of outcomes =

The number of Combinations of N objects taken n at a time:

Sum of the probability of Event A and its Complement:P(A) + P(Ac) = 1.0

Addition Law (the probability of the union of two events):

P(A B) = P(A) + P(B) - P(A B)

Multiplication Law (the probability of the intersection of two events):

P(A B) = P(A) P(B|A)orP(A B) = P(B) P(A|B)

Two Events A and B are Independent if:

P(A|B) = P(A)`orP(B|A) = P(B)

Multiplication Law for Independent Events:P(A B) = P(A)P(B)

Conditional Probability:

P(A|B) = orP(B|A) =

Bayes' Theorem in General:

P(Ai|B) =

Summary of Bayes' Theorem Calculations:

PriorConditionalJointPosterior

ProbabilitiesProbabilitiesProbabilitiesProbabilities

EventP(Ai)P(B|Ai)P(Ai B)P(Ai|B)

Chapter 4

Basic Probability Concepts

1.Assume you have applied to two different universities (let's refer to them as universities A and B) for your graduate work. In the past, 25% of students (with similar credentials as yours) who applied to university A were accepted; while university B had accepted 35% of the applicants (Assume events are independent of each other).

a.What is the probability that you will be accepted in both universities?

b.What is the probability that you will be accepted to at least one graduate program?

c.What is the probability that one and only one of the universities will accept you?

d.What is the probability that neither university will accept you?

2.In the two upcoming basketball games, the probability that UTC will defeat Marshall is 0.63, and the probability that UTC will defeat Furman is 0.55. The probability that UTC will defeat both opponents is 0.3465.

a.What is the probability that UTC defeats Furman given that they defeat Marshall?

b.Are the outcomes of the games independent? Explain and substantiate your answer.

c.What is the probability that UTC wins at least one of the games?

d.What is the probability of UTC winning both games?

Chapter 4

Conditional Probability

A research study investigating the relationship between smoking and heart disease in a sample of 500 individuals provided the following data:

Smoker / Nonsmoker / Total
Record of Heart Disease / 50 / 40 / 90
No Record of Heart Disease / 100 / 310 / 410
Total / 150 / 350 / 500

a.Show the joint probability table.

b.What is the probability that an individual is a smoker and has a record of heart disease?

c.Compute and interpret the marginal probabilities.

d.Given that an individual is a smoker, what is the probability that this individual has heart disease?

e.Given that an individual is a nonsmoker, what is the probability that this individual has heart disease?

f.Does the research show that heart disease and smoking are independent events? Use probabilities to justify your answer.

g.What conclusion would you draw about the relationship between smoking and heart disease?

Chapter 4

BAYES' THEOREM

When Ahmadi, Inc. sets up their drill press machine, 70% of the time it is set up correctly. It is known that if the machine is set up correctly it produces 90% acceptable parts. On the other hand, when the machine is set up incorrectly, it produces 20% acceptable parts. One item from the production is selected and is observed to be acceptable.

a.What is the probability that the machine is set up correctly? That is, we are interested in computing:

P(Correct set up  Acceptable part).

Let the following symbols represent the various events:

E1 = Correct set up

E2 = Incorrect set up

G = Good part (i.e., Acceptable part)

With the above notations we want to determine P(E1 G).

b.Compute all the posterior probabilities.

Professor Ahmadi’s Lecture NotesPage 1