Configure capacity of the worker nodes #877

palade · 2019-09-27T11:35:53Z

Would be possible to set the capacity of the worker nodes when the cluster is created?

aojea · 2019-09-27T14:06:53Z

can you elaborate a bit more?
what's your use case?

palade · 2019-09-27T14:27:09Z

@aojea Doing some scheduler work and would like to consider the CPU and memory capacities of each node. I could use labels for this but was wondering if it is possible to do this when the cluster is setup? Also if labels is the only option, would be possible to tag each node with particular labels from the initialisation script?

aojea · 2019-09-27T14:37:32Z

well, that seems interesting.@BenTheElder what do you think?
Basically the worker nodes are docker containers, so we should be able to use docker resource constrains to limit them https://docs.docker.com/config/containers/resource_constraints/
However, I don't know how this will work with nested cgroups 🤔

WalkerGriggs · 2019-09-28T15:09:17Z

I don't know how this will work with nested cgroups

I might be wrong, but I don't think setting resource upper bounds will impact the current cgroup architecture. I do see performance issues with starving the node of resources, though.

I'm thinking about the UX side of things too; Docker resource constraints are pretty granular. Maybe we only expose some subset of the constraints, or maybe abstract them all together?

BenTheElder · 2019-09-28T15:36:50Z

Feel free to try this out but IIRC this doesn't work.

Similarly if swap is enabled on the host memory limits won't work on your pods either.

BenTheElder · 2019-09-28T15:38:02Z

I'm working on decoupling us from docker's command line, when we experiment again with support for ignite and other backends when that is complete, some of those can actually limit things because while they are based around running container images they use VMs :+)

aojea · 2019-10-01T12:09:50Z

docker resource constraints are working for me with swap, I'll send a PR implementing it
I have one node limited to 100M in this example

aojea · 2019-10-01T12:12:45Z

/assign

BenTheElder · 2019-10-01T15:02:34Z

docker resource constraints are working for me with swap, I'll send a PR implementing it
I have one node limited to 100M in this example

That of course works but ... does it actually limit everything on the node? Have you deployed a pod trying to use more? What does kubelet report?

aojea · 2019-10-01T16:24:38Z

kind: Cluster
apiVersion: kind.sigs.k8s.io/v1alpha3
nodes:
# the control plane node
- role: control-plane
- role: worker
  constraints:
    memory: "100m"
    cpu: "1"

from https://kubernetes.io/docs/tasks/configure-pod-container/assign-memory-resource/#specify-a-memory-request-and-a-memory-limit

I modify to use directly and try to use 1.5g memory:

apiVersion: v1
kind: Pod
metadata:
  name: memory-demo
  namespace: mem-example
spec:
  containers:
  - name: memory-demo-ctr
    image: polinux/stress
    command: ["stress"]
    args: ["--vm", "1", "--vm-bytes", "1500M", "--vm-hang", "1"]

the pod takes more than 4 mins to be created, it doesn't seem to be a hard limit, maybe we should tweak something on cgroups, but checking inside the node it really seems is limiting the memory

asks:  19 total,   1 running,  18 sleeping,   0 stopped,   0 zombie
%Cpu(s):  0.5 us,  2.5 sy,  0.0 ni, 16.7 id, 80.3 wa,  0.0 hi,  0.0 si,  0.0 st
MiB Mem :  32147.3 total,  16816.6 free,   1885.6 used,  13445.2 buff/cache
MiB Swap:   2055.0 total,    901.4 free,   1153.6 used.  29866.1 avail Mem

USER      PR  NI    VIRT    RES    SHR S  %CPU  %MEM     TIME+ COMMAND                
root      20   0  140504   4916      0 S   4.3   0.0   1:16.80 kube-proxy
root      20   0  130236   1720      0 D   3.7   0.0   0:30.99 kindnetd
root      20   0 2214724  70912  60684 S   3.3   0.2   0:37.25 kubelet
root      20   0 1587948  37516     24 D   3.0   0.1   0:36.98 stress
root      20   0 2210024  30812  23940 S   2.7   0.1   0:34.11 containerd
root      20   0    9336   4180   4180 S   1.3   0.0   0:01.93 containerd-shim
root      20   0   10744   4180   4180 S   0.7   0.0   0:01.70 containerd-shim
root      19  -1   22656   6684   6508 S   0.3   0.0   0:01.78 systemd-journal
root      20   0    6024   2756   2648 R   0.3   0.0   0:00.11 top                    
root      20   0   17524   7688   7688 S   0.0   0.0   0:00.53 systemd
root      20   0   10744   4180   4180 S   0.0   0.0   0:02.67 containerd-shim
root      20   0    1024      0      0 S   0.0   0.0   0:00.00 pause
root      20   0    9336   4180   4180 S   0.0   0.0   0:02.23 containerd-shim
root      20   0    1024      0      0 S   0.0   0.0   0:00.00 pause
root      20   0   10744   4608   4564 S   0.0   0.0   0:00.81 containerd-shim
root      20   0    1024      0      0 S   0.0   0.0   0:00.00 pause
root      20   0   10744   3980   3980 S   0.0   0.0   0:00.91 containerd-shim
root      20   0     744      0      0 S   0.0   0.0   0:00.06 stress
root      20   0    4052   2936   2936 S   0.0   0.0   0:00.05 bash

aojea · 2019-10-01T16:30:32Z

Looking at the kernel docs it seems that this is throttling https://www.kernel.org/doc/Documentation/cgroup-v1/blkio-controller.txt , check the block I/o stats

ONTAINER ID        NAME                 CPU %               MEM USAGE / LIMIT     MEM %               NET I/O             BLOCK I/O           PIDS                        
1698a9d1be92        kind-worker          14.64%              99.42MiB / 100MiB     99.42%              4.34MB / 361kB      1.91GB / 1.04GB     155                         
1a1a6fb0f69a        kind-control-plane   6.75%               1.268GiB / 31.39GiB   4.04%               512kB / 2.03MB      0B / 81.7MB         392

do we want this? or is the idea to fail if it overcommit?

aojea · 2020-09-09T07:29:36Z

I think that there are several optison:

use a provider that use VMs for the nodes
implement something like lxcfs to "fake" the resources and cheat cadvisor and the kubelet

otherwise you can set the limit manually as explained here
#1524

using container constraints (cgroups) is only valid for limiting the resources, but kubelet keeps using the whole host memory and cpu resources for its calculations.

louiznk · 2020-10-14T09:51:59Z

using container constraints (cgroups) is only valid for limiting the resources, but kubelet keeps using the whole host memory and cpu resources for its calculations.

Hello @aojea ,
This PR on cAdvisor adress this point.
I hope this will help.
Thanks

aojea · 2020-10-14T10:25:35Z

using container constraints (cgroups) is only valid for limiting the resources, but kubelet keeps using the whole host memory and cpu resources for its calculations.

Hello @aojea ,
This PR on cAdvisor adress this point.
I hope this will help.
Thanks

that sounds nice, do you think it has chances to be approved?

louiznk · 2020-10-14T10:39:51Z

using container constraints (cgroups) is only valid for limiting the resources, but kubelet keeps using the whole host memory and cpu resources for its calculations.

Hello @aojea ,
This PR on cAdvisor adress this point.
I hope this will help.
Thanks

that sounds nice, do you think it has chances to be approved?

I hope 🤷🏻‍♂️

BenTheElder · 2021-06-24T08:25:01Z

Sadly no re: cAdvisor. This doesn't leave us with spectactular options. Maybe we can trick kubelet into reading our own ""vfs"" or something (like lxcfs?) 😬 , semi related: #2318's solution.

LambertZhaglog · 2022-04-13T06:56:53Z

Doing some scheduler work and would like to consider the CPU and memory capacities of each node. I could use labels for this...

@palade Did you mean we can limit node's CPU and memory capacities provided to kubernetes cluster by assigning some labels to node? which label you use? Can you give me an example? Thanks a lot.

hwdef · 2023-10-20T08:33:32Z

any progress?
Will still be able to do this?

BenTheElder · 2023-12-07T16:52:17Z

kubernetes/kubernetes#120832

palade added the kind/support Categorizes issue or PR as a support question. label Sep 27, 2019

aojea mentioned this issue Oct 1, 2019

WIP Implement resource constraints on nodes #896

Closed

k8s-ci-robot assigned aojea Oct 1, 2019

BenTheElder added kind/design Categorizes issue or PR as related to design. kind/feature Categorizes issue or PR as related to a new feature. labels Oct 2, 2019

BenTheElder added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Nov 5, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 3, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 4, 2020

BenTheElder added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Mar 15, 2020

kubernetes-sigs deleted a comment from fejta-bot Mar 15, 2020

BenTheElder mentioned this issue Mar 30, 2020

Ability to configure cpu: "x" #1451

Closed

arianvp mentioned this issue Apr 28, 2020

How to set Node Allocatable memory in kind? #1524

Closed

0Delta mentioned this issue May 11, 2020

How limit worker resource ? #1578

Closed

aojea removed their assignment Sep 9, 2020

louiznk mentioned this issue Oct 13, 2020

CGroup Memory Resource Controller for kubelet node in a container google/cadvisor#2698

Open

BenTheElder mentioned this issue Jan 5, 2021

Kubelet/Kubernetes should work with Swap Enabled kubernetes/kubernetes#53533

Closed

BenTheElder removed the kind/support Categorizes issue or PR as a support question. label Jun 24, 2021

BenTheElder mentioned this issue Jul 23, 2021

how to allocate resources while creating kind cluster #2384

Closed

zephinzer mentioned this issue Jul 29, 2021

Request for ability to control memory/cpu resource allocated to nodes #2395

Closed

BenTheElder mentioned this issue Sep 17, 2021

Question on cluster modeling #2462

Closed

BenTheElder mentioned this issue Oct 25, 2021

Unable to get the OOM killed event when the memory usage of the pod is greater than limit in KinD cluster #2514

Closed

This was referenced Feb 1, 2022

Add resource awareness for node #1963

Closed

A Pod becomes a memory hog in Kind (the issue seems Kind-specific) #2623

Closed

killianmuldoon mentioned this issue Feb 25, 2022

How do I set the template Node size kubernetes-sigs/cluster-api#6209

Closed

aojea mentioned this issue Jun 17, 2022

Does KIND node share the host cpu and memory #2805

Closed

BenTheElder mentioned this issue Aug 2, 2022

OOM metrics undetected #2848

Open

aojea mentioned this issue Dec 30, 2022

line 6: field constraints not found in type v1alpha4.Node #3048

Closed

aojea mentioned this issue Aug 31, 2023

how to set the container memory which launched by kind ? #3341

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configure capacity of the worker nodes #877

Configure capacity of the worker nodes #877

palade commented Sep 27, 2019

aojea commented Sep 27, 2019

palade commented Sep 27, 2019

aojea commented Sep 27, 2019

WalkerGriggs commented Sep 28, 2019

BenTheElder commented Sep 28, 2019

BenTheElder commented Sep 28, 2019

aojea commented Oct 1, 2019

aojea commented Oct 1, 2019

BenTheElder commented Oct 1, 2019

aojea commented Oct 1, 2019

aojea commented Oct 1, 2019

aojea commented Sep 9, 2020

louiznk commented Oct 14, 2020

aojea commented Oct 14, 2020

louiznk commented Oct 14, 2020

BenTheElder commented Jun 24, 2021

LambertZhaglog commented Apr 13, 2022

hwdef commented Oct 20, 2023

BenTheElder commented Dec 7, 2023

Configure capacity of the worker nodes #877

Configure capacity of the worker nodes #877

Comments

palade commented Sep 27, 2019

aojea commented Sep 27, 2019

palade commented Sep 27, 2019

aojea commented Sep 27, 2019

WalkerGriggs commented Sep 28, 2019

BenTheElder commented Sep 28, 2019

BenTheElder commented Sep 28, 2019

aojea commented Oct 1, 2019

aojea commented Oct 1, 2019

BenTheElder commented Oct 1, 2019

aojea commented Oct 1, 2019

aojea commented Oct 1, 2019

aojea commented Sep 9, 2020

louiznk commented Oct 14, 2020

aojea commented Oct 14, 2020

louiznk commented Oct 14, 2020

BenTheElder commented Jun 24, 2021

LambertZhaglog commented Apr 13, 2022

hwdef commented Oct 20, 2023

BenTheElder commented Dec 7, 2023