Problem 3. Suppose we have a set of observations (x1, y1), ..., (xn, yn) coming from the simple linear model:
yi = ̠0 + ̠1xi + ̡i, 1 ≤ i ≤ n.
Least square method finds the minimizer of
Q(̠0, ̠1) = ̓(yi - ̠0 - ̠1xi)".
Define
Sxx = ̓(xi - x̄)", Syy = ̓(yi - ȳ)", Sxy = ̓(xi - x̄)(yi - ȳ),
which are the sample variances of x and y, and the sample covariance between x and y, respectively.
(a) Show that the least square estimates can be written as ̢1 = Sxy/Sxx, and ̢0 = ȳ - ̢1x̄. Note that the expression of ̢1 appears different from what you see in class. Explain why the two expressions are in fact equivalent.
(b) Show that yi - ȳ = ̢1(xi - x̄) + (yi - ̲i).
(c) Show that Syy = ̢̓1"(xi - x̄)" + ̓(yi - ̲i)".
(d) The sample correlation between y and x is ̢ = Sxy/∑SxxSyy. Show that the coefficient of determination R" is equal to the square of the sample correlation ̢".