[CA] Pod pending with Custom Scheduler request for scaling node #506

suker200 · 2017-12-19T09:29:55Z

I have written a custom-scheduler in scheduling pod based on node workload.

The "--verify-unschedulable-pods=false" option was removed in #189 , which lead to CA will not scale up with custom-scheduler

IMG reference

Do we have any way to trigger CA (api, etc..)?

MaciekPytel · 2017-12-19T13:49:11Z

Currently CA doesn't support custom schedulers, scheduler extenders, or any other custom way of scheduling that is not done via scheduler predicates in main kubernetes repository. Setting --verify-unschedulable-pods=false (back when the flag existed) did not make it work for custom schedulers. All it would achieve was to make CA add a completely random node (or a few), with no guarantee whatsoever that it will help pending pod.

On the most basic level CA works by simulating how a scheduler would behave if cluster looked differently. This assumes that the scheduler code baked into CA is the same as the one used by scheduler.

Depending on what exactly your scheduler does you may be able to get it to work with CA by modifying https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/simulator/predicates.go (perhaps by implementing and injecting your own predicate?) and building your own CA image.

suker200 · 2017-12-20T03:15:04Z

Hi, Thank for you comment. I will give it a try.

Do we have any plan for supporting custom-scheduler feature?

MaciekPytel · 2017-12-20T12:40:32Z

Not really, at least in the nearest future. There are 2 main issues we would have to solve:

The scheduler works in real cluster. You don't pass the state of the cluster to it, it just takes it from informer (more generally: apiserver). That's not true for our use-case - we need to know what it would do in a simulated cluster, with some nodes added or removed and some pods shuffled around. And we need to run against different simulations all the time, so it must be stateless.
Performance. We need to do much, much more scheduling than scheduler does. We only run scheduler predicates, not priorities and we even have a bunch of custom optimizations for predicates (for example: Figure out and implement custom handling for MatchInterPodAffinity predicate #257). The main issue here goes back to point 1 - how would we pass complete cluster state for all of those predicates we need to run? Serializing and sending to a separate process complete cluster state for each query doesn't seem feasible. Unless we can figure out a way around this, we won't be able to support anything outside of CA binary.
Perhaps some sort of golang interface that could be implemented by users and compiled with CA could be an option? Either way it's not something we're working on right now.

suker200 · 2017-12-21T04:26:51Z

Thank, I will close this issue and try your suggestion. I will update back if there has news.

Define priority by value in testing wrappers

suker200 closed this as completed Dec 21, 2017

yaroslava-serdiuk pushed a commit to yaroslava-serdiuk/autoscaler that referenced this issue Feb 22, 2024

Merge pull request kubernetes#506 from alculquicondor/cleanup_util

01a101e

Define priority by value in testing wrappers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CA] Pod pending with Custom Scheduler request for scaling node #506

[CA] Pod pending with Custom Scheduler request for scaling node #506

suker200 commented Dec 19, 2017 •

edited

Loading

MaciekPytel commented Dec 19, 2017

suker200 commented Dec 20, 2017

MaciekPytel commented Dec 20, 2017

suker200 commented Dec 21, 2017

[CA] Pod pending with Custom Scheduler request for scaling node #506

[CA] Pod pending with Custom Scheduler request for scaling node #506

Comments

suker200 commented Dec 19, 2017 • edited Loading

MaciekPytel commented Dec 19, 2017

suker200 commented Dec 20, 2017

MaciekPytel commented Dec 20, 2017

suker200 commented Dec 21, 2017

suker200 commented Dec 19, 2017 •

edited

Loading