Text: 3- (2 points) [Data Stream Mining] Suppose the window of a data stream is as shown below (Most recent bit is on the right side).
101101100010111011001011 4 - At least one One of Two of Two of size 4 of size 8 size 2 size 1
First, using the DGIM method, estimate the number of 1s in the last k positions, for the following values of k:
k = 5 k = 16
Second, for each value of k, what is the actual number of 1s (i.e., the exact number of 1s)?
4- (3 points) [Frequent Itemset Mining] A transaction database contains ten transactions as shown below:
TID Items bought 1 {s,u,s,a,n} 2 {s,a,r,a} 3 {s,a,m} 4 {r,i,c,h,a,r,d} 5 {e,r,i,c} 6 {n,i,c,k} 7 {p,a,t,r,i,c,k} 8 {e,m,i,l,y} 9 {c,h,a,r,1,e,s} 10 {1,i,d,a}
And minimum_support = 30%. Answer the following questions:
(a) Using A-priori, find all frequent itemsets that contain item i. Show the steps. (b) Calculate the confidence value of the following rules 1) {r,c}-{i} 2) {a,r}-{i} 3) {a}-{i} 4) {c}-{i} 5) {c}-{i,r} 6) {r}-{i,c} (c) Calculate the interest value for all the rules in (b).