[ARROW-11877] [C++] Add initial microbenchmarks for Dataset internals - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.0.0
Component/s: C++
Labels:

External issue URL:
https://github.com/apache/arrow/issues/27719

Description

A quick investigation of ~~ARROW-11781~~ showed much of the overhead lies in evaluating partition expressions against the filter. While much of this is just kernel evaluation, we should have benchmarks of key Datasets internals like SimplifyWithGuarantee.

Attachments

Issue Links

relates to

ARROW-11781 [Python] Reading small amount of files from a partitioned dataset is unexpectedly slow

Resolved

links to

GitHub Pull Request #9638

Activity

People

Assignee:: David Li

Reporter:: David Li

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 05/Mar/21 18:46

Updated:: 11/Jan/23 08:22

Resolved:: 10/Mar/21 15:53

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

50m