Transitive data skew

In distributed computing problems, transitive data skew is an issue of data synchronization.

It arises with the uneven distribution of otherwise evenly distributed data across a number of devices while the data is in transition. If sorted data is being distributed across multiple devices and the column on which that data is sorted is the "key" used to identify the target device, the resulting transitive data skew may be self-correcting.


This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.