Advanced
Big Data & Data Engineering
Q96 / 100
What is "salting" as a technique to address data skew in a distributed join?
Correct! Well done.
Incorrect.
The correct answer is A) Appending a random or computed suffix to skewed join keys on both sides to spread a hot key's rows across multiple partitions
A
Correct Answer
Appending a random or computed suffix to skewed join keys on both sides to spread a hot key's rows across multiple partitions
Explanation
Salting splits a single overloaded key into several "salted" sub-keys distributed across partitions, then the results are combined, balancing the workload across tasks.
Progress
96/100