1. What is the primary goal of k-means clustering?
2. In k-means clustering, what is a centroid?
3. What is the key challenge in k-means clustering?
4. What is a common technique used to determine the optimal number of clusters in k-means?
5. What is the elbow point in k-means clustering?
6. What is the global cost function in k-means clustering used for?
7. What is a disadvantage of k-means clustering?
8. What is the purpose of silhouette analysis?
9. What is hierarchical agglomerative clustering (HAC)?
10. When should hierarchical clustering be used over k-means?
11. What is the role of a dendrogram in hierarchical clustering?
12. What is the purpose of DBSCAN (Density-Based Spatial Clustering of Applications with Noise)?
13. What is a key advantage of DBSCAN over k-means?
14. What does the epsilon (ϵ) parameter represent in DBSCAN?
15. What is the main difference between DBSCAN and k-means clustering?
16. What does the silhouette coefficient close to 1 indicate?
17. What is the within-cluster sum of squares (WCSS) used for in clustering?
18. What is a primary metric used to evaluate density-based clustering models like DBSCAN?
19. What is the primary advantage of hierarchical clustering over k-means clustering?
20. What does the between-cluster sum of squares (BCSS) measure?