Texts:
Q1) Suppose that a hospital tested the age and body fat data for 18 randomly selected adults with the following results (show your steps):
age: 23 23 27 27 39 41 47 49 50
%fat: 9.5 26.5 7.8 17.8 31.4 25.9 27.4 27.2 31.2
age: 52 54 54 56 57 58 58 60 61
%fat: 34.6 42.5 28.8 33.4 30.2 34.1 32.9 41.2 35.7
(a) Calculate the mean, median, and standard deviation of age and %fat.
(b) Draw the boxplots for age and %fat.
(c) Draw a scatter plot and a q-q plot based on these two variables.
Q2) Briefly outline how to compute the dissimilarity between objects described by the following:
(a) Nominal attributes
(b) Asymmetric binary attributes
(c) Numeric attributes
(d) Term-frequency vectors
Q3) Given two objects represented by the tuples (22, 1, 42, 10) and (20, 0, 36, 8): (show your steps)
(a) Compute the Euclidean distance between the two objects.
(b) Compute the Manhattan distance between the two objects.