CDSP 3 Quiz

1. What is the main purpose of Exploratory Data Analysis (EDA)?





2. Which method in pandas can you use to get information about the attributes of a dataset?





3. What is a target feature in a supervised dataset?





4. What type of bias occurs when a sample doesn’t represent the entire population?





5. Which technique is used to create new, non-observed data points in a dataset?





6. What is a common cause of imbalanced datasets?





7. What is the purpose of a confusion matrix in evaluating a model?





8. What does the Pearson correlation coefficient measure?





9. What kind of distribution is indicated by a bell-shaped, symmetrical curve?





10. What does a positive value of skewness indicate about a distribution?





11. What is one benefit of using box plots in data analysis?





12. What does feature scaling aim to achieve in data preprocessing?





13. Which of the following is a type of transformation function?





14. What is the goal of dimensionality reduction?





15. What is one purpose of feature splitting?





16. What is a common issue with datasets that have too many features relative to the number of data points?





17. Which of the following is an example of data encoding?





18. What is the advantage of using normalization for feature scaling?





19. What technique is most useful for handling datasets with a large number of missing values?





20. What is the primary goal of hyperparameter tuning in machine learning models?