00:01
We want to know why the regression line is associated with the two points, a, b, c, d, are the same as the line that passes through both.
00:09
Well, because if we only have two points, we know that between two points exists a line.
00:14
So if we have points a, b, and c, b, we know that in between these two points is a line, and that would be a regression line.
00:22
Some other conceptual questions we can answer about this is, why is the regression line, or what is the smallest possible sum of squares? well, we've already discussed the sum of squares is equal to the sum of the actual value minus the expected value square.
00:41
But if we see that, for example, we have y equals x, and then our points are going to be 1 -1 and 2 -2 and 3 -3, for example.
00:55
Then we see that there is no error.
00:57
These points are directly on the line.
00:59
So because of that we have a sum of square is equal to zero.
01:02
There is no error.
01:04
Some other conceptual questions we can seek to answer.
01:08
If we have the different points lying on a straight line, what can we say about the regression line? what goes back to our previous point? if all these lie on a straight line, this regression line is absolutely perfect.
01:18
It doesn't have any error, and the regression line for these three data points would be the actual line that connects all three.
01:25
It would touch all three points and connect them all.
01:28
If, however, we moved one of our points to be 3 -4, and then let's get rid of this one just to simplify things...