Intermediate
Artificial Intelligence & Machine Learning
Q68 / 100
What is stratified sampling and why is it used when splitting a dataset?
Correct! Well done.
Incorrect.
The correct answer is B) Splitting data so that each subset preserves the same proportion of class labels as the original dataset, preventing skewed train/test distributions
B
Correct Answer
Splitting data so that each subset preserves the same proportion of class labels as the original dataset, preventing skewed train/test distributions
Explanation
Stratified sampling keeps the class ratio consistent across train, validation, and test sets, which is especially important for imbalanced datasets — otherwise a random split could leave a set with too few minority-class examples to evaluate fairly.
Progress
68/100