A doubt about the Time Slicing option in GPU Sharing #757

tonypg39 · 2024-06-04T08:10:59Z

tonypg39
Jun 4, 2024

Hello,
I had the question of if I use the time slicing option, does it happen that the credit(s) assigned to a pod have a particular GPU id associated with them? Or is it like a credit just means access to the whole set of gpus in the node, and then I would just set the number of replicas to the foreseen amount of pods in the server?
Many thanks for any help,
Toony

elezar · 2024-06-04T08:46:21Z

elezar
Jun 4, 2024
Maintainer

Hi @tonypg39

When time-slicing is enabled any single GPU available in the system is exposed as the defined number of replicas. From the perspective of the kubelet they are independent resources and are allocated as such. Each replica is, however, associated with a specific GPU id (or uuid) and this mapping is handled by the device plugin when updating the container create response for an allocated pod.

Note that there are some things to keep in mind here. The most important being that the GPU is shared using CUDA timeslicing meaning that as more applications are launched each application would get less of the GPU. There is also a danger of memory-oversubscription as no limits are placed on how much memory an application can allocate. For mor details see https://developer.nvidia.com/blog/improving-gpu-utilization-in-kubernetes/

With these taken into consideration and assuming "well-behaved" applications, you should be able to set the number or replicas in your config so that replicas * N_gpus >= N_containers where N_gpus is the number of GPUs in your system and N_containers is the number of containers that need access to GPUs. Note that this may differ depending on the memory requirements of the applications.

1 reply

tonypg39 Jun 4, 2024
Author

Understood,
Many thanks for the help!!
Cheers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A doubt about the Time Slicing option in GPU Sharing #757

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

A doubt about the Time Slicing option in GPU Sharing #757

tonypg39 Jun 4, 2024

Replies: 1 comment · 1 reply

elezar Jun 4, 2024 Maintainer

tonypg39 Jun 4, 2024 Author

tonypg39
Jun 4, 2024

Replies: 1 comment 1 reply

elezar
Jun 4, 2024
Maintainer

tonypg39 Jun 4, 2024
Author