constellation/.github
Markus Rudy 85b44f7f57
ci: make waiting for nodes more robust (#2981)
* ci: make waiting for nodes more robust

After initializing the cluster, a lot of things happen in parallel and
are potentially getting in each others' way: nodes are joining,
daemonsets are proliferating, the network is being set up. During this
period, it's not unusual that the Kubernetes API server is unavailable
for a short time, e.g. due to etcd loosing quorum or load balancing
changes.

This period of instability has the potential to affect all kubectl
commands negatively, leading to problems especially for tests, where
command failures often lead to test failures. On the other hand, we'd
expect everything to be quite stable after the initial dust settles.

Therefore, this commit changes how we wait after initializing a cluster.
Until we have a reasonable expectation of readiness, we ignore command
failures and wait for things to stabilize. The cluster is considered
stable once all configured nodes and all API servers report ready.
2024-03-13 09:42:18 +01:00
..
actions ci: make waiting for nodes more robust (#2981) 2024-03-13 09:42:18 +01:00
ISSUE_TEMPLATE github: replace discord with GitHub discussions 2023-10-05 16:57:19 +02:00
workflows deps: update Go to v1.22.1 2024-03-06 14:50:01 +01:00
actionlint.yaml ci: use rbe for unit tests 2023-11-10 18:15:59 +01:00
failure_project_metadata.json ci: refactor e2e test failure notifications (#1625) 2023-04-12 16:06:26 +02:00
failure_project_template.md ci: add hint about cleaning up lingering resources on failure (#2300) 2023-09-04 10:09:35 +02:00
pull_request_template.md ci: reduce amount of regular tests (#2885) 2024-02-01 15:05:07 +01:00
release.yml cli: add iam upgrade apply (#2132) 2023-07-26 17:29:03 +02:00
teams_payload_template.json ci: refactor teams notification action (#2600) 2023-11-15 08:48:13 +01:00