Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-779

Make Tez grouping splits logic possible outside InputFormat

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.3.0
    • None
    • None

    Description

      Grouping currently fetches splits from the underlying file format.
      It'd be useful to allow grouping to accept a set of splits instead of always fetching them from the underlying format.
      One example of where this will be used : Bucketed Hive data - regular HiveInputFormat splits are generated, only splits belonging to the same bucket can be Grouped together.

      Attachments

        1. TEZ-779.2.patch
          88 kB
          Bikas Saha
        2. TEZ-779.1.patch
          85 kB
          Bikas Saha

        Activity

          People

            bikassaha Bikas Saha
            sseth Siddharth Seth
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: