Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
Description
VectorizedOrcInputFormat is introduced to read orc data in batches.
When returning each row of data, instead of actually retrieving each field, we use BaseRow's abstraction to return a Columnar Row-like view.
This will greatly improve the downstream filtered scenarios, so that there is no need to access redundant fields on the filtered data.
Attachments
Issue Links
- links to