Question

(a) Sketch the tree corresponding to the CART partition given below. The word in each box indicates the class label for that region. Cat Dog Sheep X2 1 Cat 0 Rabbit 3 4 6 X1 (b) Create a diagram similar to that given in part (a) using the CART tree below. Indicate the class label in each region of the partitioned feature space. X2 < 1 X2 < 2 X1 < 1 Banana X1 < 2 X2 < 0 Orange Grapes Apple Apple Pear (c) The predictive performance of a single tree can be substantially improved by aggregating many decision trees. i. Briefly explain the random forest method for classification. ii. Should pruned or un-pruned trees be used in random forests? Explain. iii. Briefly explain the of out-of-bag error rate for random forests. (d) Briefly explain how K-fold cross-validation can be used to approximate a testing error rate. Describe one advantage of cross-validation over the validation set approach.

          (a) Sketch the tree corresponding to the CART partition given below. The word in each box indicates the class label for that region.

Cat Dog Sheep
X2 1
Cat
0
Rabbit
3 4 6
X1

(b) Create a diagram similar to that given in part (a) using the CART tree below. Indicate the class label in each region of the partitioned feature space.

X2 < 1
X2 < 2
X1 < 1
Banana X1 < 2
X2 < 0
Orange Grapes Apple
Apple Pear

(c) The predictive performance of a single tree can be substantially improved by aggregating many decision trees.
i. Briefly explain the random forest method for classification.
ii. Should pruned or un-pruned trees be used in random forests? Explain.
iii. Briefly explain the of out-of-bag error rate for random forests.

(d) Briefly explain how K-fold cross-validation can be used to approximate a testing error rate. Describe one advantage of cross-validation over the validation set approach.

(a) Sketch the tree corresponding to the CART partition given below. The word in each box indicates the class label for that region.

Cat Dog Sheep
X2 1
Cat
0
Rabbit
3 4 6
X1

(b) Create a diagram similar to that given in part (a) using the CART tree below. Indicate the class label in each region of the partitioned feature space.

X2 < 1
X2 < 2
X1 < 1
Banana X1 < 2
X2 < 0
Orange Grapes Apple
Apple Pear

(c) The predictive performance of a single tree can be substantially improved by aggregating many decision trees.
i. Briefly explain the random forest method for classification.
ii. Should pruned or un-pruned trees be used in random forests? Explain.
iii. Briefly explain the of out-of-bag error rate for random forests.

(d) Briefly explain how K-fold cross-validation can be used to approximate a testing error rate. Describe one advantage of cross-validation over the validation set approach.

Added by Sharon L.

Question

Please give Ace some feedback