Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.2.0
-
None
-
None
Description
ORC split generation does not have clearly defined strategies for different scenarios (many small orc files, few small orc files, many large files etc.). Few strategies like storing the file footer in orc split, making entire file as a orc split already exists. This JIRA to make the split generation simpler, support different strategies for various use cases (BI, ETL, ACID etc.) and to lay the foundation for HIVE-7428.
Attachments
Attachments
Issue Links
- is related to
-
HIVE-11043 ORC split strategies should adapt based on number of files
- Closed
- relates to
-
HIVE-11546 Projected columns read size should be scaled to split size for ORC Splits
- Closed
- links to