[IGNITE-16568] Kubernetes cluster might split apart on initialization - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 2.11.1
Fix Version/s: None
Component/s: networking
Labels:
- Kubernetes

External issue URL:
https://stackoverflow.com/questions/71118869/openshift-k8s-issue-with-project-pods-not-joining-same-grid-but-rather-create-m
Ignite Flags:

Docs Required, Release Notes Required

Description

The issue is mostly about Kubernetes/Openshift deployment but could also affect other scenarios relying on external services (AWS?).

Consider the following case: multiple nodes (PODs) were started simultaneously and all of them are trying to locate if there are other nodes available using

TcpDiscoveryKubernetesIpFinder. that just returns a set of registered IPs. Since there is no delay or retry attempt, all nodes could be returned with an empty IPs list and decide to be a coordinator, i.e. to start multiple independent grids.

Proposed changes: extend TcpDiscoveryKubernetesIpFinder with either a configurable delay or repetitions counter to check if there is a non-empty list of available IPs.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Alexandr Shapkin

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 16/Feb/22 19:10

Updated:: 16/Feb/22 22:42