[KUDU-1806] Creating a list of scan tokens should retrieve tablets in larger batches - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Critical
Resolution: Fixed
Affects Version/s: 1.2.0
Fix Version/s: 1.2.0
Component/s: client
Labels:
None

Target Version/s:

1.2.0

Description

In a test on a 200-node cluster with 40 concurrent query streams, we found that the Impala planner was sometimes taking minutes to fetch the list of scan tokens. The tables in the query had several thousand tablets, so with the default batch size of 10 tablets per GetTableLocations RPC, the planning required hundreds of round trips, each of which had some chance of getting bumped from the queue due to backpressure, etc.

A local hack to change the batching to 1000 tablets per RPC reduced the planning times down to sub-second.

Attachments

Issue Links

relates to

KUDU-1811 C++ client: use larger batches when fetching scan tokens

Resolved

Activity

People

Assignee:: Todd Lipcon

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 13/Dec/16 04:08

Updated:: 19/Dec/16 05:06

Resolved:: 19/Dec/16 05:06