[CALCITE-2040] Create adapter for Apache Arrow - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.37.0
Component/s: None
Labels:
- pull-request-available

Description

Create an adapter for Apache Arrow. This would allow people to execute SQL statements, via JDBC or ODBC, on data stored in Arrow in-memory format.

Since Arrow is an in-memory format, it is not as straightforward as reading, say, CSV files using the file adapter: an Arrow data set does not have a URL. (Unless we use Arrow's Feather format, or use an in-memory file system such as Alluxio.) So we would need to devise a way of addressing Arrow data sets.

Also, since Arrow is an extremely efficient format for processing data, it would also be good to have Arrow as a calling convention. That is, implementations of relational operators such as Filter, Project, Aggregate in addition to just TableScan.

Lastly, when we have an Arrow convention, if we build adapters for file formats (for instance the bioinformatics formats SAM, VCF, FASTQ discussed in CALCITE-2025) it would make a lot of sense to translate those formats directly into Arrow (applying simple projects and filters first if applicable). Those adapters would belong as a "contrib" module in the Arrow project better than in Calcite.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

arrow_data.py
13/Apr/21 16:38
2 kB
Michael Mior

Issue Links

depends upon

ARROW-16608 [Gandiva][Java] Unsatisfied Link Error on M1 Mac when using mavencentral artifacts

Resolved

is blocked by

ARROW-11135 [Java][Gandiva] Using Maven Central artifacts as dependencies produces runtime errors

Resolved

is depended upon by

CALCITE-6304 Improve the Arrow adapter

Open

relates to

CALCITE-2173 Sample implementation of ArrowAdapter

Closed

links to

GitHub Pull Request #2133

GitHub Pull Request #2810

GitHub Pull Request #3666

(2 links to)

Activity

People

Assignee:: hongyu guo

Reporter:: Julian Hyde

Votes:: 0 Vote for this issue

Watchers:: 17 Start watching this issue

Dates

Created:: 07/Nov/17 18:54

Updated:: 07/May/24 10:50

Resolved:: 11/Mar/24 02:47

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

1.5h