Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-36212

Failing non-leader JM pod causes the deployment to be stuck in failing state

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • kubernetes-operator-1.9.0
    • None
    • Kubernetes Operator
    • None

    Description

      In case one of the JM replica pods cannot start but otherwise the job is healthy, the operator still cannot recognise the job in a running state.

      This prevents for instance periodic savepoints to be taken or savepoint upgrades to be performed correctly.

      Attachments

        Activity

          People

            Unassigned Unassigned
            gyfora Gyula Fora
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: