Which metric is commonly used to evaluate classification models and represents the ratio of correctly predicted positive instances to all positive instances?
Which method involves splitting the dataset into three parts: training, validation, and test sets, ensuring that the model's performance is assessed on unseen data?