2. Decision Trees (5 pts)
Consider the training dataset given below. In this dataset, X1, X2, and X3 are the input attributes and Y is the class variable. "+ve" and "-ve" are the two outcome classes, similar to "Yes" and "no".
Example# X1 X2 X3 Y
E1 0 0 0 +ve
E2 0 0 1 -ve
E3 0 1 0 -ve
E4 0 1 1 +ve
E5 1 0 0 -ve
Which attribute has the highest information gain, i.e., reduction in Gini Index? Justify your answer. Use Gini Index ($1 - p_1^2 - p_2^2$) as measurement criteria