What is the main objective of "k-means++" initialization in the K-means clustering algorithm?
A. To reduce the number of clusters
B. To add more clusters
C. To visualize data
D. To choose initial cluster centroids in a way that improves the convergence of the algorithm
Answer: Option D
A. Chi-squared test
B. T-test
C. ANOVA
D. Regression analysis
In the context of data ethics, what does "bias mitigation" refer to?
A. Increasing the sample size
B. Improving model accuracy
C. Removing outliers from a dataset
D. Reducing biases in data collection
What does the term "overfitting" mean in machine learning?
A. The model fits the training data too closely and performs poorly on new data
B. The model generalizes well to new data
C. The model is too simple and underperforms on the training data
D. The model is perfectly accurate on all data
In the CRISP-DM data mining process model, what does "DM" stand for?
A. Data Modeling
B. Data Mining
C. Data Manipulation
D. None of the above

Join The Discussion