CDSP 7 Quiz

1. What is the primary goal of k-means clustering?





2. In k-means clustering, what is a centroid?





3. What is the key challenge in k-means clustering?





4. What is a common technique used to determine the optimal number of clusters in k-means?





5. What is the elbow point in k-means clustering?





6. What is the global cost function in k-means clustering used for?





7. What is a disadvantage of k-means clustering?





8. What is the purpose of silhouette analysis?





9. What is hierarchical agglomerative clustering (HAC)?





10. When should hierarchical clustering be used over k-means?





11. What is the role of a dendrogram in hierarchical clustering?





12. What is the purpose of DBSCAN (Density-Based Spatial Clustering of Applications with Noise)?





13. What is a key advantage of DBSCAN over k-means?





14. What does the epsilon (ϵ) parameter represent in DBSCAN?





15. What is the main difference between DBSCAN and k-means clustering?





16. What does the silhouette coefficient close to 1 indicate?





17. What is the within-cluster sum of squares (WCSS) used for in clustering?





18. What is a primary metric used to evaluate density-based clustering models like DBSCAN?





19. What is the primary advantage of hierarchical clustering over k-means clustering?





20. What does the between-cluster sum of squares (BCSS) measure?