[FLINK-10989] OrcRowInputFormat uses two different file systems - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Not a Priority
Resolution: Unresolved
Affects Version/s: 1.7.0
Fix Version/s: None
Component/s: Connectors / ORC
Labels:
- auto-deprioritized-major
- auto-deprioritized-minor

Description

The OrcRowInputFormat seems to use two different FileSystem. The Flink FileSystem for listing the files and generating the InputSplits and then Hadoop's FileSystem to actually read the input splits. This can be problematic if one only configures Flink's S3 FileSystem but does not provide a S3 implementation for Hadoop's FileSystem.

I think this is not an intuitive behaviour and can lead to hard to debug problems for a user.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Till Rohrmann

Votes:: 1 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 22/Nov/18 16:49

Updated:: 15/Oct/22 13:16