Open
Description
Our team got hit by #658 today. The proposal in #659 would help a lot.
Additionally, I think that the controller should fail fast(er).
We use helm
to install the controller, with the atomic: true
option set; the rationale is that if the pods can't become ready, helm rolls back to the previous release.
Currently, the controller will become ready, but fail after a couple of minutes and go into CrashLoopBackOff
.
Having the controller check for pre-requisites before becoming ready would prevent this behavior.