Advanced Big Data & Data Engineering
Q92 / 100

What is "predicate pushdown" and why is it especially effective with columnar file formats?

Correct! Well done.

Incorrect.

The correct answer is A) Filter conditions are pushed down to the storage layer so it can skip reading row groups or files that cannot match, using stored min/max statistics

A

Correct Answer

Filter conditions are pushed down to the storage layer so it can skip reading row groups or files that cannot match, using stored min/max statistics

Explanation

Columnar formats like Parquet store per-column statistics (min/max) per row group, so the reader can skip entire row groups that cannot satisfy a filter, drastically reducing I/O.

Progress
92/100