2.6 Draw Scatter Plots and Best Fitting Lines

Goal  Fit lines to data in scatter plots.

Your Notes

VOCABULARY

Scatter plot

A graph of a set of data pairs (x, y)

Positive correlation

The relationship between paired data when y tendto increase as x increases

Negative correlation

The relationship between paired data when y tends to decrease as x increases

Correlation coefficient

A number, denoted by r, from 1 to 1 that measures how well a line fits a set of data pairs (x, y)

Best-fitting line

The line that lies as close as possible to all the data points

Example 1

Estimate correlation coefficients

For each scatter plot, describe the correlation shown and tell whether the correlation coefficient is closest to 1, 0.5, 0, 0.5, or 1.

a.

b.

Solution

a.The scatter plot shows a _strong negative_ correlation. So, the best estimate

given is r = _1_.

b.The scatter plot shows a _weak positive_ correlation. So, r is between _0_ and _1_ but not too close to either one. The best estimate given is r = _0.5_.

Your Notes

APPROXIMATING A BEST-FITTING LINE

Step 1 Draw a _scatter plot_ of the data.

Step 2 Sketch the _line_ that appears to follow most closely the trend given by the data points. There should be about as many points _above_ the line as _below_ it.

Step 3 Choose _two points_ on the line, and estimate the coordinates of each point.

Step 4 Write an _equation_ of the line that passes through the two points from Step 3.

Example 2

Approximating a best-fitting line

The table below gives the number of people y who atended each of the first seven football games x of the season. Approximate the best-fitting line for the data.

x / 1 / 2 / 3 / 4 / 5 / 6 / 7
y / 722 / 763 / 772 / 826 / 815 / 857 / 897

1.Draw a _scatter plot_.

2.Sketch the best-fit line.

3.Choose two points on the line. For the scatter plot shown, you might

choose(1, _722_ ) and (2, _750_ ).

4.Write an equation of the line. The line that passes through the two points

has a slope of:

m == _28_

Use the point-slope form to write the equation.

y  y1 = m(x  x1) / Point-slope form
y _722_ = _28(x 1)_ / Substitute for m, x1and y1
y = _28x + 694 / Simplify.

An approximation of the best-fitting line is y = _28x + 694_.

Your Notes

Example 3

Use a line of fit to make predictions

Use the equation of the line of best fit from Example 2 to predict the number of people that will attend the tenth football game.

Because you are predicting the tenth game, substitute _10_ for x in the equation from Example 2.

y = _28x + 694_ = _28(10) + 694_ = _974_

You can predict that _974_ people will attend the tenth football game.

Checkpoint Complete the following exercises.

For each scatter plot (a) tell whether the data has positive correlation, negative correlation, or no correlation, and (b) tell whether the correlation coefficient is closest to 1, 0.5, 0, 0.5, or 1.

1.

  1. positive correlation
  2. 1

2.

  1. no correlation
  2. 0

3.The table gives the average class score y on each chapter test for the first six chapters x of the textbook.

x / 1 / 2 / 3 / 4 / 5 / 6
y / 84 / 83 / 86 / 88 / 87 / 90

a. Approximate the best-fitting line for the data.

b. Use your equation from part (a) to predict the test score for the 9th test that the class will take.

a.y= 1.3x + 82.1

b. about 94

Homework

______

______