Advanced Big Data & Data Engineering
Q96 / 100

What is "salting" as a technique to address data skew in a distributed join?

Correct! Well done.

Incorrect.

The correct answer is A) Appending a random or computed suffix to skewed join keys on both sides to spread a hot key's rows across multiple partitions

A

Correct Answer

Appending a random or computed suffix to skewed join keys on both sides to spread a hot key's rows across multiple partitions

Explanation

Salting splits a single overloaded key into several "salted" sub-keys distributed across partitions, then the results are combined, balancing the workload across tasks.

Progress
96/100