Commit Graph

61 Commits

Author SHA1 Message Date
Ukri Niemimuukko
1c40eaaa83 Add deprecation notices about GAS 2025-01-23 20:21:36 +02:00
Tuomas Katila
606ac77647 gpu: levelzero: documentation
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2024-09-19 19:14:15 +03:00
Tuomas Katila
402fb8d9cd gpu: add support for CDI devices
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2024-09-11 09:29:55 +03:00
Tuomas Katila
4946b26018 gpu: doc: monitoring resource notes
Also align xelink-sidecar deployment with the new files in
the xpu manager project.

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2024-03-13 08:16:16 +02:00
Tuomas Katila
1de1024530 gpu: add xe notes
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2024-03-12 15:41:44 +02:00
hugo-syn
039865aec8
chore: Fix multiple typos (#1653)
* chore: Fix multiple typos

Signed-off-by: hugo-syn <hugo.vincent@synacktiv.com>
2024-01-25 08:18:48 +02:00
Tuomas Katila
fd3ad4003f gpu: restructure readme
Split readme into smaller chunks, show only one "easy installation"
and hide the rest. Add some notes about tile resources.

Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-12-08 08:42:08 +02:00
Mikko Ylinen
834f598f80 deployments: update to NFD v0.14.1 and drop custom GPU deployment
With the NFD recent versions (v0.13+), it's no longer necessary to
start NFD with custom nfd-master args/rbac settings to get numeric
labels registered as extended resources.

The same can be specified via NodeFeatureRules which also works for
"local" source with feature files.

Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2023-09-20 14:02:52 +03:00
Tuomas Katila
031ee64590 gpu/doc: Add Max Series support and a note about SR-IOV
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-09-14 13:21:30 +03:00
Tuomas Katila
ea659a5e4b nfd: add rules to label nodes with different GPUs
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-09-12 16:20:33 +03:00
Tuomas Katila
8971280215 gpu: add notes about gpu-plugin modes
Fixes: #1381

Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com>
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-04-26 14:28:36 +03:00
Tuomas Katila
2a365263b0 gpu: add note about dry-run and yaml output
Fixes: #1059

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-04-24 09:52:36 +03:00
Eero Tamminen
92b8fe9380 Update GPU plugin README driver information
Fixes: #1382

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
2023-04-20 13:53:13 +03:00
Tuomas Katila
d1e8350c6e gpu: add new nfd + monitoring + shared-dev deployment option
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2023-01-05 14:13:13 +02:00
Lukas Kalbertodt
ae0f9c5334
Fix command in docs by adding single quotes
Otherwise most shells will interpreted `?` in an unintended way.
2022-12-16 12:28:52 +01:00
Mikko Ylinen
75bff62ba1
Merge pull request #1183 from tkatila/gpu-demo-updates
gpu: improve demo run instructions
2022-10-07 13:08:54 +03:00
Eero Tamminen
0b47ebd3e7 Add information on new DKMS kernel GPU driver packages
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
2022-10-06 18:08:53 +03:00
Tuomas Katila
56bc5ebeee Modifications based on Eero's comments
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-10-06 17:55:04 +03:00
Tuomas Katila
63cbe808a7 gpu: improve demo run instructions
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-10-05 16:10:03 +03:00
Eero Tamminen
647b484e7a Improve GPU drivers installation instructions
- Add note about LTS kernel DKMS source repo
- Correct note about the demo (unlike FPGA demo,
  GPU demo is not in docker hub)

Fixes: 89d3c5a4f3

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
2022-09-28 12:40:30 +03:00
Eero Tamminen
9b3ee06cb1 Add GPU plugin README prerequisites section
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
2022-09-23 20:32:46 +03:00
Tuomas Katila
eac635e439 gpu: fix documentation links
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-09-23 20:32:46 +03:00
Tuomas Katila
e375186458
Update cmd/gpu_plugin/README.md
Co-authored-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
2022-09-15 15:30:23 +03:00
Tuomas Katila
c562db9b28 gpu: Improve installation options and documentation
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-09-15 15:19:23 +03:00
Tuomas Katila
230570f12e gpu: add mentions about data center gpu support
Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-09-09 13:07:50 +03:00
Mikko Ylinen
1b3accacc2 docs: rework development guide
Currently, each individual plugin README documents roughly the same
daily development steps to git clone, build, and deploy. Re-purpose
the plugin READMEs more towards cluster admin type of documentation
and start moving all development related documentation to DEVEL.md.

The same is true for e2e testing documentation which is scattered
in places where they don't belong to. Having all day-to-day
development Howtos is good to have in a centralized place.

Finally, the cleanup includes some harmonization to plugins'
table of contents which now follows the pattern:

* [Introduction](#introduction)
(* [Modes and Configuration Options](#modes-and-configuration-options))
* [Installation](#installation)
    (* [Prerequisites](#prerequisites))
    * [Pre-built Images](#pre-built-images)
    * [Verify Plugin Registration](#verify-plugin-registration)
* [Testing and Demos](#testing-and-demos)
    * ...

Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-08-31 20:00:15 +03:00
Tuomas Katila
db7e5bfc55 Add support for gas-container-tiles annotation
Adds functionality to convert container's tile annotation
in to corresponding L0 affinity mask. This helps to target
container's workload to specific L0 subdevices.

Signed-off-by: Tuomas Katila <tuomas.katila@intel.com>
2022-03-24 14:13:35 +02:00
Mikko Ylinen
c064bfc4f1 demo: add intel-opencl-icd
Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2022-02-24 11:06:27 +02:00
Eero Tamminen
36046d90a4 Make GPU plugin / resource label limitations more explicit
While the labeling limit is obvious after little thought, IMHO
limitations like this should either be stated out front, or be in
their own section in the README.  Commit does former for the GPU
plugin fractional resources, and latter for the NFD hook / labeling.
2022-01-04 11:43:08 +02:00
Eero Tamminen
599fc18e71 Provide workaround for the media issue and document it
The issue is with VA-API and QSV, not VPL media API.
2021-12-15 18:40:33 +02:00
Xu, Guoshu
e4c4a8f7ac GPU devices resource preferred allocation methods.
1. Implement PreferredAllocator interface.
2. Provide 3 preferred allocation policies: balancedPolicy, packedPolicy and nonePolicy.
3. Provide the cmdline interface: -allocation-policy balanced/packed/none, to select which preferred allocation policy to use.
4. Add operator support.

Co-authored-by: Mikko Ylinen <mikko.ylinen@intel.com>
2021-11-17 22:55:10 +08:00
Mikko Ylinen
e6cf299750 gpu: update READMEs
Commit 00a59e8f7d was not complete in that it didn't update
the corresponding documentation. This commit fixes that.

Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2021-10-08 11:57:16 +03:00
Li Ning
dcc12d9089 documentation: remove deprecated toc section in README
The 'Verify node kubelet config' content was removed in 6b208f8.

Signed-off-by: Li Ning <ning.a.li@transwarp.io>
2021-09-07 19:38:41 +08:00
Eero Tamminen
83e7de0d41 Make GPU plugin intro information more generic & accurate
- Information on specific HW & virtualization types on which GPU plugin
  is tested on, belongs to releases notes, not to README intro
  (where it has already became obsolete)
- HW offloading is provided by driver backends, not frontends
  (e.g. OneVPL is just one of the media driver frontends)
2021-06-22 18:27:17 +03:00
Ukri Niemimuukko
b0130e693f more documentation for fractional resources
This adds a section heading, TOC link, command line flag description
and a short explanation of what other dependendent configuration
changes are needed with fractional resources in order for the command
line flag to achieve something useful.

Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
2021-06-14 16:25:38 +03:00
Ed Bartosh
98f80b5f47
Merge pull request #652 from uniemimu/hookupdate
add link to gpu_nfdhook and update hook README
2021-06-13 12:15:46 +03:00
Eero Tamminen
a2faa3a8fc Add section on GPU plugin options to its README 2021-06-11 19:55:43 +03:00
Ukri Niemimuukko
cbf7bab114 add link to gpu_nfdhook and update hook README
This adds a link from gpu-plugin README to the nfdhook README, and
updates the nfdhook README with label descriptions.

Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
2021-06-11 18:54:44 +03:00
skaajas
956154c1db
Updated GPU plugin-specific readme general description. 2021-06-11 15:50:14 +03:00
Ukri Niemimuukko
e3bf21dbe9 gpu_plugin: add documentation links to gpu aware scheduling
Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
2021-06-10 19:46:35 +03:00
Ukri Niemimuukko
2c4d529d66 gpu_plugin: fractional resource management
Fractional resource management feature

Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
Signed-off-by: Dmitry Rozhkov <dmitry.rozhkov@intel.com>
2021-06-04 13:06:50 +03:00
DougTW
625b30fd1b Fixes 560. Edited gpu_plugin README. Restored master to line 157
Signed-off-by: DougTW <doug.martin@intel.com>
2021-02-09 16:49:30 -08:00
DougTW
28cbebc81b edited gpu_plugin README; changed 2 instances of master to main. Related to PR 499.
Signed-off-by: DougTW <doug.martin@intel.com>
2021-02-08 18:40:47 -08:00
Ed Bartosh
6b208f8acf documentation: remove kubelet configuration check
Removed device plugin socket check from the documentation as
device plugin support is enabled by default in Kubelet.

Signed-off-by: Ed Bartosh <eduard.bartosh@intel.com>
2021-01-12 13:00:20 +02:00
Kevin Putnam
1d149ffee6 Documentation: Fixes broken links and standardizes headers.
Signed-off-by: Kevin Putnam <kevin.putnam@intel.com>
2020-09-22 08:32:21 -07:00
Dmitry Rozhkov
1b82ab9df6 sync README.md files with the current state of the code
Closes #356
2020-09-16 10:54:39 +03:00
Ukri Niemimuukko
7244bd0f25 gpu_plugin: README.md update
Move remark about GVT-d to end of introduction. Remove remarks
about GVT-g for the time being.

Signed-off-by: Ukri Niemimuukko <ukri.niemimuukko@intel.com>
2020-08-25 13:45:10 +03:00
Mikko Ylinen
f145541caf READMEs: use git clone to get the code
go get'ing does not work due to our k8s.io/kubernetes dependency
so guide users to use git clone to get the code.

Fixes: #290

Signed-off-by: Mikko Ylinen <mikko.ylinen@intel.com>
2020-02-20 08:04:07 +02:00
Antti Kervinen
d568f050c5 gpu_plugin: add kustomizations
- Default deployment: `kubectl apply -k deployments/gpu_plugin`
- Default deployment does not specify namespace anymore
  (was: `kube-system`).
- Variant: deploy only on nodes with Intel GPU label by NFD:
  `kubectl apply -k deployments/gpu_plugin/overlays/nfd_labeled_nodes`
- Variant: deploy to `kube-system` instead of user-defined namespace
  (or "default"):
  `kubectl apply -k deployments/gpu_plugin/overlays/namespace_kube-system`
- GPU plugin README updated.

Signed-off-by: Antti Kervinen <antti.kervinen@intel.com>
2020-02-07 14:56:52 +02:00
Graham Whaley
79a86c10e8 docs: gpu: Add more details, re-arrange section order
Re-arrange the section order a little (such as putting the use
of the DaemonSet before the sudo hand-deploy), and add a lot more
detail of what to expect, and how to check if the pod has launched
correctly.

Signed-off-by: Graham Whaley <graham.whaley@intel.com>
2020-01-17 13:34:13 +00:00