cloudstack/plugins
Wei Zhou 69e8ebc03f
CKS: retry if unable to drain node or unable to upgrade k8s node (#8402)
* CKS: retry if unable to drain node or unable to upgrade k8s node

I tried CKS upgrade 16 times, 11 of 16 upgrades succeeded.

2 of 16 upgrades failed due to
```
error: unable to drain node "testcluster-of7974-node-18c8c33c2c3" due to error:[error when evicting pods/"cloud-controller-manager-5b8fc87665-5nwlh" -n "kube-system": Post "https://10.0.66.18:6443/api/v1/namespaces/kube-system/pods/cloud-controller-manager-5b8fc87665-5nwlh/eviction": unexpected EOF, error when evicting pods/"coredns-5d78c9869d-h5nkz" -n "kube-system": Post "https://10.0.66.18:6443/api/v1/namespaces/kube-system/pods/coredns-5d78c9869d-h5nkz/eviction": unexpected EOF], continuing command...
```

3 of 16 upgrades failed due to
```
Error from server: error when retrieving current configuration of:
Resource: "rbac.authorization.k8s.io/v1, Resource=roles", GroupVersionKind: "rbac.authorization.k8s.io/v1, Kind=Role"
Name: "kubernetes-dashboard", Namespace: "kubernetes-dashboard"
from server for: "/mnt/k8sdisk//dashboard.yaml": etcdserver: leader changed
```

* CKS: remove tests of creating/deleting HA clusters as they are covered by the upgrade test

* Update PR 8402 as suggested

* test: remove CKS cluster if fail to create or verify
2024-02-06 11:14:10 +01:00
..
acl Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
affinity-group-processors Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
alert-handlers Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
api Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
backup veeam: fix some issues with restoring volume from backup and attaching it to VM (#8570) 2024-01-29 11:40:43 +01:00
ca/root-ca Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
database Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
dedicated-resources Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
deployment-planners Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
event-bus Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
ha-planners/skip-heurestics Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
host-allocators/random Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
hypervisors Change Cryptsetup validation (#8482) 2024-02-01 09:43:28 +01:00
integrations CKS: retry if unable to drain node or unable to upgrade k8s node (#8402) 2024-02-06 11:14:10 +01:00
metrics Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
network-elements Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
outofbandmanagement-drivers Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
storage linstor: Fix possible NPE if Linstor storage-pool data missing (#8319) 2023-12-08 17:02:18 +05:30
storage-allocators/random Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
user-authenticators ldap trust map cleanup on domain delete (#7915) 2023-09-19 08:01:15 +02:00
user-two-factor-authenticators Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00
pom.xml Updating pom.xml version numbers for release 4.18.2.0-SNAPSHOT 2023-09-12 17:26:53 +02:00