Intermediate Artificial Intelligence & Machine Learning
Q68 / 100

What is stratified sampling and why is it used when splitting a dataset?

Correct! Well done.

Incorrect.

The correct answer is B) Splitting data so that each subset preserves the same proportion of class labels as the original dataset, preventing skewed train/test distributions

B

Correct Answer

Splitting data so that each subset preserves the same proportion of class labels as the original dataset, preventing skewed train/test distributions

Explanation

Stratified sampling keeps the class ratio consistent across train, validation, and test sets, which is especially important for imbalanced datasets — otherwise a random split could leave a set with too few minority-class examples to evaluate fairly.

Progress
68/100