Description
We should port the client side KuduPartitioner implementation from KUDU-1713 (https://gerrit.cloudera.org/#/c/5775/) to the Java client.
This would allow Spark and other Java integrations to repartition and pre-sort the data before writing to Kudu.
Attachments
Issue Links
- relates to
-
KUDU-2672 Spark write to kudu, too many machines write to one tserver.
- Resolved