Which Python library is commonly used for natural language processing tasks such as text tokenization and language understanding?
A. Matplotlib
B. Scikit-learn
C. spaCy
D. Seaborn
Answer: Option C
A. Matplotlib
B. Scikit-learn
C. spaCy
D. Seaborn
Answer: Option C
A. Chi-squared test
B. T-test
C. ANOVA
D. Regression analysis
In the context of data ethics, what does "bias mitigation" refer to?
A. Increasing the sample size
B. Improving model accuracy
C. Removing outliers from a dataset
D. Reducing biases in data collection
What does the term "overfitting" mean in machine learning?
A. The model fits the training data too closely and performs poorly on new data
B. The model generalizes well to new data
C. The model is too simple and underperforms on the training data
D. The model is perfectly accurate on all data
In the CRISP-DM data mining process model, what does "DM" stand for?
A. Data Modeling
B. Data Mining
C. Data Manipulation
D. None of the above
Join The Discussion