Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22216 Improving PySpark/Pandas interoperability
  3. SPARK-25328

Add an example for having two columns as the grouping key in group aggregate pandas UDF

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.4.0
    • 2.4.0
    • PySpark
    • None

    Description

      https://github.com/apache/spark/pull/20295 added an alternative interface for group aggregate pandas UDFs. It does not have an example that have more than one columns as the grouping key in functions.py

      Attachments

        Activity

          People

            gurwls223 Hyukjin Kwon
            smilegator Xiao Li
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: