Question 6: In logistic regression, we assume Y = (Y1, . . . , Yn) T is a collection of n binary observations. For each Yi, we observe Xi = (Xi1, Xi2, Xi3) T as predictors. We assume log(pi / (1 - pi)) = Xi^T θ, where θ = (θ1, . . . , θ3) T is the vector of regression coefficients.
a) Formulate the overall log-likelihood of the dataset l(Y).
b) Derive the first derivative ∂l(Y)/∂θ2.
c) Derive the second derivative ∂^2 l(Y)/∂θ2∂θ3.