Which of the following statements is/are true in the case of k-means clustering? 1. For using k-means clustering on the data, it requires the number of clusters to be specified. 2. The value of k can take any value in the range of 1 to n (number of data points). 3. The k-means algorithm does clustering based on the distance between the data points and the cluster centroids. All the statements are true (2) & (3) Only (1) (1) & (3)
Added by Cristina A.
Step 1
For using k-means clustering on the data, it requires the number of clusters to be specified. - This is true. The 'k' in k-means stands for the number of clusters which the user needs to specify. Show more…
Show all steps
Close
Your feedback will help us improve your experience
Lien Le and 70 other Intro Stats / AP Statistics educators are ready to help you.
Ask a new question
Labs
Want to see this concept in action?
Explore this concept interactively to see how it behaves as you change inputs.
Key Concepts
Recommended Videos
a) Clustering is a collection of data objects. b) Clustering analysis is also known as segmentation. c) There are no predetermined classes in clustering. d) Clustering is an unsupervised classification technique. Given k, the k-means algorithm is implemented in the following steps: a) Select k initial observations as seeds. b) Assign each of the other observations to the cluster with the nearest centroid. c) Recalculate cluster centroids (averages) based on the assignments from step b. d) Iterate between steps b and c, stop when there are no more new assignments. e) Select the desired number of clusters k. f) Calculate average cluster values (cluster centroids) over each variable. For the initial iteration, this will simply be the initial seed observations.
Md.Daniyal A.
Consider the following statements: 1) If the standard deviation is zero, the mean must be zero. 2) The standard deviation can be negative. 3) If the mean is large positive, the standard deviation will also be large positive. Which of the following is a correct about the above statements, select one. All are False. All are True. Only 2 and 3 are True. Only 1 is True.
Madhur L.
Which of the following is true of the value of k in the k-Nearest Neighbors algorithm? A) The value of k is always taken as a constant and is equal to 1. If the value of k is large, it drastically increases variability. If the value of k is very large, it introduces biases into the classification decisions. If the value of k is large, the classification of a record is very sensitive to the classification of the single record to which it is closest.
Hubert A.
Recommended Textbooks
Elementary Statistics a Step by Step Approach
The Practice of Statistics for AP
Introductory Statistics
Transcript
18,000,000+
Students on Numerade
Trusted by students at 8,000+ universities
Watch the video solution with this free unlock.
EMAIL
PASSWORD