containerized-data-importer/automation
Arnon Gilboa edda5abe0f
Add new Prometheus alerts and label existing alerts (#2998)
* Add Prometheus alerts and label existing alerts

- CDINoDefaultStorageClass - not having a default (or virt default)
SC is surely not an OpenShift error, as admins may prefer their cluster
users to only use explicit SC names. However, in the CDI context when
DV is created with default SC but default does not exist, we will fire
an error event and the PVC will be Pending for the default SC, so when
there are such Pending PVCs we will fire an alert.

- CDIDefaultStorageClassDegraded - when the default (or virt default)
SC does not support CSI/Snapshot clone (smart clone) or does not have
ReadWriteMany access mode (for live migration).

- CDIStorageProfilesIncomplete - add storageClass and provisioner
labels.

- CDIDataImportCronOutdated - add dataImportCron namespace and name
labels.

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* CR fixes

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* Create stub VolumeSnapshotClass for testing

Including the VolumeSnapshot/Class/Content crds for the
CDIDefaultStorageClassDegraded alert func test.

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* Add snapshot manifests for tests

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* Deploy snapshot CRDs in the hpp destructive lane

Remove stub snapshot CRDs

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* Add label explanation to new metric help

Also rename the metric kubevirt_cdi_storageprofile_status to
kubevirt_cdi_storageprofile_info since it always reports value 1,
where the label values provide the details about the storage
class and storage profile.

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* Revert NoProvisioner check removal

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* CR fixes

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

* Nicify StorageProfile metric update

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>

---------

Signed-off-by: Arnon Gilboa <agilboa@redhat.com>
2023-12-19 12:29:08 +01:00
..
ceph-gc.sh Add Ceph DV GC lane automation script (#2349) 2022-07-06 18:54:23 +02:00
ceph-wffc.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
ceph.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
destructive.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
git-askpass.sh Add scripts to do releases with PROW (#1705) 2021-03-19 12:36:49 +01:00
istio.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
latest-hpp.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
nfs.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
non-csi-hpp.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
postsubmit-main.sh fix broken postsubmit hook (#2002) 2021-10-28 18:34:29 +02:00
previous-hpp.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00
prow_periodic_push.sh add script that is called from prow for pushing images (#1874) 2021-07-30 03:31:11 +02:00
release.sh Move apis to staging, push to containerized-data-importer-api (#1997) 2021-10-28 13:40:24 +02:00
test.sh Add new Prometheus alerts and label existing alerts (#2998) 2023-12-19 12:29:08 +01:00
upgrade.sh Switch CI lanes to 1.28 (#2908) 2023-10-17 03:37:00 +02:00