Details
-
Task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Major compaction can be run when the execution engine is MR. This can cause data loss a la HIVE-23703 (the fix for data loss when the execution engine is MR was reverted by HIVE-23763).
Currently minor compaction can only be run when the execution engine is Tez, otherwise it falls back to MR (non-query-based) compaction. We should extend this functionality to major compaction as well.
Attachments
Issue Links
- relates to
-
HIVE-23703 Major QB compaction with multiple FileSinkOperators results in data loss and one original file
- Closed
-
HIVE-23763 Query based minor compaction produces wrong files when rows with different buckets Ids are processed by the same FileSinkOperator
- Closed
- links to