Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Public beta
-
None
-
None
Description
Currently the KuduScanNode uses blocking APIs to read data from Kudu. This means that, for many queries, although we use the same number of CPU cycles as Parquet, we spend a lot more wall clock time, which makes us look unimpressive.
We should switch to the simplest multi-threaded model that gets our performance in range of Parquet.