00:01
We are given the sample of data points x, y, listed at the top of this whiteboard, and we want to use this data to answer the questions a through f as follows.
00:10
In part a on the left, we want to draw a scatter plot for these data points.
00:14
I've already included this scatter plot, and the data points x, y, are demarcated, with the black crosses or x's.
00:22
Next, in part b to the right, we want to compute the relevant sums with this data, as well as the pearson correlation coefficient r.
00:28
I've already included the value of the sums as they are simply found by following these equations exactly.
00:36
So some x is some of the x values, some of the y values, and so on.
00:41
Next, you complete compute r using the following formula, which takes as input the sum to just computed, as well as the sample size n.
00:48
Pluggyn gives r equals negative .973.
00:52
Next, for part c, let's find the line of best fit.
00:55
First, we have to find the x and y means, x bar, y bar, found by dividing by n, the sums for x and y.
01:03
Next, we find the slope and intercept for the best fit line.
01:08
B, the slope is given by the formula here...