Commit Graph

261 Commits

Author SHA1 Message Date
Tuomas Katila
455f6ff212 images: set 0.27.0 tag for the release
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-06-02 09:56:32 +03:00
Tuomas Katila
13097ac78d operator: increase memory resources to 100/120Mi
Fixes: #1416

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-05-22 08:42:40 +03:00
Mikko Ylinen
e428cd6c19 go.mod: update to k8s 1.27.1 and controller runtime 0.15.x
k8s 1.27.x triggers build errors on controller-runtime 0.14.x
so we will need to update to 0.15.x at the same time.

Changes include:

* k8s e2e framework moved to use Ginkgo context so we add
  test context to all our test nodes.
* adapt Ginkgo parameter modifications.
* adapt SGX admissionwebhook to InjectDecoder removal.
* adapt deviceplugins and FPGA CRDs to controller-runtime
  API changes.

Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-05-09 14:49:24 +03:00
Mikko Ylinen
16724043b2 operator: move to controller-tools v0.12.0
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-05-05 15:02:36 +03:00
Hyeongju Johannes Lee
807949d29e qat_dpdk_app: add test-crypto1 for gen4 devices
Signed-off-by: Hyeongju Johannes Lee <hyeongju.lee@intel.com>
2023-05-02 03:39:21 -07:00
Hyeongju Lee
ed08d11aa3
Merge pull request #1392 from mythi/PR-2023-019
sgx: stop using local source hooks for EPC registration
2023-05-02 12:26:12 +03:00
Mikko Ylinen
6b5e65a137 operator: update kube-rbac-proxy image to v0.14.1
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-05-02 09:16:24 +03:00
Mikko Ylinen
3a4c0e574f sgx: stop using local source hooks for EPC registration
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-04-28 14:59:41 +03:00
Hyeongju Johannes Lee
1a41402903 qat init: make conf optional 2023-04-27 12:48:27 -07:00
Tuomas Katila
974829ff7c gpu: try to fetch PodList from kubelet API
In large clusters and with resource management, the load
from gpu-plugins can become heavy for the api-server.
This change will start fetching pod listings from kubelet
and use api-server as a backup. Any other error than timeout
will also move the logic back to using api-server.

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-03-30 12:43:02 +03:00
Mikko Ylinen
934c00f5fc qat: add support for QAT 402xx
Based on
https://lore.kernel.org/linux-crypto/20230303165650.81405-1-damian.muszynski@intel.com/

Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-03-09 15:06:30 +02:00
Mikko Ylinen
1090c12c74 deployments: fix kubectl SGX w/ NFD instructions
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-03-07 09:19:30 +02:00
Mikko Ylinen
4d96c1b49d deployments: update SGX NodeFeatureRules
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-03-07 09:19:30 +02:00
Mikko Ylinen
3dc815cda9 deployments: fix FPGA plugin namespace
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-02-21 21:15:06 +02:00
Mikko Ylinen
eb632f625a deployments: remove unused deviceplugins RBAC rules
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-02-21 20:14:03 +02:00
Mikko Ylinen
5c6e60eeb1 operator: move to controller-tools v0.11.3
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-02-21 20:14:03 +02:00
Tuomas Katila
3a1880ec8b Remove overlays using kube-system
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-02-13 12:47:22 +02:00
Mikko Ylinen
f559d8717d
Merge pull request #1327 from eero-t/nfd-features
Use more generic name for NFD features host directory volume
2023-02-13 11:45:26 +02:00
Eero Tamminen
2f3dc23651 Use more generic name for NFD features host directory volume
NFD hooks are deprecated and going away:
https://github.com/kubernetes-sigs/node-feature-discovery/issues/856

This makes the mount names more future-proof, and shows where later
changes need to be done (to change operator mount directory, and
switch hook-using deployments e.g. to feature files).

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
2023-02-08 18:20:41 +02:00
Mikko Ylinen
c65d4ab896 operator: update to 0.26.0 images
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-01-20 11:49:51 +02:00
Mikko Ylinen
019c6b6cc5 deployments: update to NFD v0.12.1
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-01-19 12:00:06 +02:00
Mikko Ylinen
90aeca48c5 deployments: update SGX configuration
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-01-12 09:41:17 +02:00
Tuomas Katila
3922aa111e xpu-manager sidecar to provide XeLink details to node labels
Fetches xelink topology information from xpu-manager's rest
interface and stores them as labels under NFD's feature.d directory.
NFD then assigns the labels to the node. On exit, sidecar will
remove the label file from disk.

Co-authored-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-01-11 14:12:48 +02:00
Tuomas Katila
f1af4166e2 Update NFD to 0.12.0
Fixes #1283

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-01-05 14:13:13 +02:00
Tuomas Katila
d1e8350c6e gpu: add new nfd + monitoring + shared-dev deployment option
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-01-05 14:13:13 +02:00
Mikko Ylinen
10a26b8fd8
Merge pull request #1234 from hj-johannes-lee/qat-cfgServices
qat: add configuration of cfgServices to qat initcontainer
2022-12-13 08:26:13 +02:00
Hyeongju Johannes Lee
a6037eae3c
qat: add configuration of cfgServices to qat initcontainer
Signed-off-by: Hyeongju Johannes Lee <hyeongju.lee@intel.com>
2022-12-12 21:48:21 +02:00
Mikko Ylinen
980c9f1c53 deployments: update kube-rbac-proxy image and args
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-11-21 19:17:21 +02:00
Mikko Ylinen
afce0ed79c
Merge pull request #1196 from ozhuraki/e2e-operator
operator: Add e2e tests for DSA, IAA
2022-11-17 21:30:33 +02:00
Oleg Zhurakivskyy
ef7954c8e1 operator: Add e2e tests for DSA, IAA
Closes #1230

Signed-off-by: Oleg Zhurakivskyy <oleg.zhurakivskyy@intel.com>
2022-11-17 17:47:21 +02:00
Mikko Ylinen
cc144c36df
Merge pull request #1240 from hj-johannes-lee/operator-update-image-name
operator: update image name and version
2022-11-14 07:26:14 +02:00
Hyeongju Johannes Lee
f85fbd6848 operator: fix the operator image name and update docs in DEVEL.md
https://sdk.operatorframework.io/docs/cli/operator-sdk_run_bundle/
According to the documentation of operator-sdk, docker.io should
be speficied.

Signed-off-by: Hyeongju Johannes Lee <hyeongju.lee@intel.com>
2022-11-11 17:04:37 +02:00
Mikko Ylinen
cb3b2a7d9e
Merge pull request #1242 from ozhuraki/operator-dsa-iaa-provisioning
operator: Add DSA, IAA provisioning
2022-11-09 17:02:40 +02:00
Hyeongju Lee
744263de15
Merge pull request #1150 from mythi/PR-2022-062
demo: update idxd-config version to v3.5.0
2022-11-09 16:25:24 +02:00
Oleg Zhurakivskyy
e9d3f683d5 operator: Add DSA, IAA provisioning
Signed-off-by: Oleg Zhurakivskyy <oleg.zhurakivskyy@intel.com>
2022-11-09 15:37:28 +02:00
Mikko Ylinen
c96f79e358 iaa: set max_transfer_size to idxd-config
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-11-03 16:48:27 +02:00
Mikko Ylinen
5876882066 operator: add support for Liveness and Readiness probes
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-11-03 10:25:07 +02:00
Mikko Ylinen
325bcd0d17 operator: update kube-rbac-proxy image
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-10-29 00:04:34 +03:00
Oleg Zhurakivskyy
ad68e998b1 iaa: Switch to device type "iaa"
Signed-off-by: Oleg Zhurakivskyy <oleg.zhurakivskyy@intel.com>
2022-10-28 10:51:18 +03:00
Mikko Ylinen
419a5ab586 operator: update to 0.25.0 images
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-10-17 22:45:41 +03:00
Ed Bartosh
9dea92541a
Merge pull request #1088 from hj-johannes-lee/dlb-initcontainer
dlb: add initcontainer to plugin
2022-10-07 14:43:12 +03:00
Hyeongju Johannes Lee
11b04425c2 dlb: add initcontainer to plugin
initcontainer enables vfs and configures vfs
 - only first pf is used to configure a vf
 - only one vf is configured from the pf
add dlb-initcontainer kustomize overlay
update CRD to have initImage
implment operator to run initcontainer
update e2e test to run initcontainer overlay
update envtest to test initimage

Signed-off-by: Hyeongju Johannes Lee <hyeongju.lee@intel.com>
2022-10-06 17:11:03 +03:00
Ukri Niemimuukko
1d09cd6549 align gpu kustomize object naming with operator naming
Operator has used "gpu-manager" as part of the cluster object names
it creates. Kustomize based deployments can be aligned with that.

Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
2022-09-26 19:50:55 +03:00
Mikko Ylinen
8c0abb54b1 deployments: disable unused serviceAccounts for plugins
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-09-23 08:21:25 +03:00
Ed Bartosh
9228d93fbe
Merge pull request #1169 from tkatila/gpu/use-nfd-label-with-fractional
gpu: add nodeSelector to fractional overlay
2022-09-22 14:49:57 +03:00
Tuomas Katila
8ecf258a82 gpu: add nodeSelector to fractional overlay
Updated documentation indicates that fractional overlay
uses nfd so maybe it should.

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-09-22 10:45:11 +03:00
Mikko Ylinen
0f5afc258d operator: move to controller-tools v0.10.0
With the latest version of controller-tools, we get to set
reinvocationPolicy tag so that we no longer have to add that
field manually in our Admission Webhook manifests.

Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-09-21 19:37:00 +03:00
Manish Regmi
a888a91d2a add selinux labels for QAT
Signed-off-by: Manish Regmi <manish.regmi@intel.com>
2022-09-19 15:31:55 -07:00
Tuomas Katila
666d516143 Use 'i915' instead of 'drm' kernel mod as requirement for gpu label
6.0.0 kernel doesn't seem to have 'drm' module anymore and it makes
more sense to depend on the i915 module.

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-09-19 15:41:23 +03:00
Manish Regmi
22e9d5f882 add selinux labels for GPU plugins 2022-09-15 14:44:51 -04:00