-
Notifications
You must be signed in to change notification settings - Fork 340
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HOSTEDCP-2169: Add aro scheduler #5417
HOSTEDCP-2169: Add aro scheduler #5417
Conversation
@Patryk-Stefanski: This pull request references HOSTEDCP-2169 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.19.0" version, but no target version was set. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
c6cf373
to
93ce409
Compare
f07a91e
to
51f3c10
Compare
8583818
to
0ffcb3a
Compare
/retest-required |
/test e2e-aws |
0ffcb3a
to
31c0214
Compare
31c0214
to
0fbc2c3
Compare
can you please include in the PR desc or godoc the purpose and outcome of this new controller? e.g. |
/test e2e-aks-4-18 |
@Patryk-Stefanski: This pull request references HOSTEDCP-2169 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.19.0" version, but no target version was set. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
0fbc2c3
to
5b6a739
Compare
/lgtm |
/retest-required |
"sigs.k8s.io/controller-runtime/pkg/reconcile" | ||
) | ||
|
||
func TestAzureSchedulerReconcile(t *testing.T) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: this is within the scheduler/azure, so TestReconcile seems should be sufficient
sizingConfig: sizingConfig, | ||
expectError: false, | ||
expectRequeue: false, | ||
expectAnnotations: map[string]string{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
any reason to not validate the rest of the effects?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ill add test cases for the rest of the effects to TestSetHostedClusterSchedulingAnnotations
can we can an e2e similar to TestCreateClusterRequestServingIsolation? |
lgtm other than #5417 (comment) |
Yep I was planning on creating a follow up task to e2e this. |
5b6a739
to
f7764f5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a nit on the doc, otherwise lgtm
|
||
- AKS cluster with cluster-autoscaler enabled and using Standard\_D4s\_v4 VMs for this example. (--enable-cluster-autoscaler flag when installing AKS cluster, with --min-count 2 --max-count 6) | ||
- Hypershift operator with size tagging enabled. (--enable-size-tagging flag when installing hypershift operator) | ||
- HostedCluster with the `ClusterSizingConfiguration` resource created. (Default resource is created by the hypershift operator) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a little confusing. What is a HostedCluster with ClusterSizingConfiguration?
I would call them out as separate prereqs.
It would also be good to specify the name of the HostedCluster and NodePool that you're using throughout your example.
increase: 0s | ||
``` | ||
|
||
3. Scale nodepool up to 3 nodes: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Before this there is no mention of nodepool pstefans-3
f7764f5
to
f343ade
Compare
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: csrwng, Patryk-Stefanski The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@Patryk-Stefanski: The following test failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
/lgtm |
55c2de5
into
openshift:main
[ART PR BUILD NOTIFIER] Distgit: hypershift |
What this PR does / why we need it:
The reason for implementing this scheduler is to help Azure Red Hat OpenShift (ARO) scale to 500 nodes. With this scheduler, ARO can dynamically control the CPU and memory requests for all pods and the Kube API Server (KAS) GoMem limits based on the number of nodes a
HostedCluster
has. This capability is essential for efficiently scaling aHostedCluster
to 500 nodes.Which issue(s) this PR fixes (optional, use
fixes #<issue_number>(, fixes #<issue_number>, ...)
format, where issue_number might be a GitHub issue, or a Jira story:Fixes # HOSTEDCP-2169
Checklist