Uploaded image for project: 'Kylin'
  1. Kylin
  2. KYLIN-2543

Still build dictionary for TopN group by column even using non-dict encoding

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • v2.0.0
    • v2.0.0
    • Job Engine
    • None

    Description

      Using latest 2.0 beta, create a cube with TopN, but specifying using non-dict encoding, the JSON is like:

      {
            "name": "TOP_CUSTOMER",
            "function": {
              "expression": "TOP_N",
              "parameter": {
                "type": "column",
                "value": "P_LINEORDER.V_REVENUE",
                "next_parameter": {
                  "type": "column",
                  "value": "P_LINEORDER.LO_CUSTKEY"
                }
              },
              "returntype": "topn(100,4)",
              "configuration": {
                "topn.encoding.P_LINEORDER.LO_CUSTKEY": "integer:4",
                "topn.encoding_version.P_LINEORDER.LO_CUSTKEY": "1"
              }
            }
      

      But, in the third step "Extract Fact Table Distinct Columns", there is still a reducer running for "P_LINEORDER.LO_CUSTKEY"; as this column is a UHC, it takes a long time and finally got failed.

      Expected behavior: no reducer for this column. 1.6.0 doesn't have this issue; this is a new issue in 2.0 during the upgrade to snowflake.

      Attachments

        Activity

          People

            shaofengshi Shao Feng Shi
            shaofengshi Shao Feng Shi
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: