Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-6045

Support append mode by default for MOR table with INSERT operation

    XMLWordPrintableJSON

Details

    Description

      1. Insert scenario in the COW table, if both online sync-clustering and online async-clustering(plan generate || plan execute) are configured simultaneously, sometimes sync-clustering takes effect, sometimes async-clustering takes effect:

      sync-clustering=true & generate async-clustering=true & execute async-clustering=true:sync-clustering takes effect
      sync-clustering=true & generate async-clustering=false & execute async-clustering=true: async-clustering takes effect

      1. Insert scenario in the MOR table,sometimes generate log files, sometimes generate parquet files.
        async-compaction=true & generate async-clustering=false : generate log files
        async-compaction=false & generate async-clustering=true: generate parquet files

      This will cause confusion for users

      After modification:

      1. Insert scenario in the COW table, sync-clustering has higher priority than online async-clusering.
      2. Insert scenario in the MOR table will always generate parquet files

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              zhuanshenbsj1 zhuanshenbsj1
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: