Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

agent loadbalancing doesn't work in MSses #9640

Open
DaanHoogland opened this issue Sep 5, 2024 · 7 comments
Open

agent loadbalancing doesn't work in MSses #9640

DaanHoogland opened this issue Sep 5, 2024 · 7 comments

Comments

@DaanHoogland
Copy link
Contributor

ISSUE TYPE
  • Bug Report
COMPONENT NAME
MS-cluster
CLOUDSTACK VERSION

CONFIGURATION
OS / ENVIRONMENT
SUMMARY

image

STEPS TO REPRODUCE
2 MS
2 hosts+SSVM+CPVM == 4 agents
EXPECTED RESULTS
on 4 agents at most 3 would be on one of the two MSses
ACTUAL RESULTS
all 4 agents on one MS
@weizhouapache
Copy link
Member

what's the value of global setting indirect.agent.lb.algorithm ?

@DaanHoogland
Copy link
Contributor Author

what's the value of global setting indirect.agent.lb.algorithm ?

that is only for indirect agents is it? not for agents in general. It would explain, as it is static and there is only one ip in host but that still would constitude a bug as the other MS could easily take responsibility for the SVMs, if all hosts are with the first MS.

@weizhouapache
Copy link
Member

what's the value of global setting indirect.agent.lb.algorithm ?

that is only for indirect agents is it? not for agents in general. It would explain, as it is static and there is only one ip in host but that still would constitude a bug as the other MS could easily take responsibility for the SVMs, if all hosts are with the first MS.

yep.
You can put all mgmt server IPs in host, change indirect.agent.lb.algorithm to roundrobin or shuffle
then restart mgmt servers (or agents)

@DaanHoogland
Copy link
Contributor Author

what's the value of global setting indirect.agent.lb.algorithm ?

that is only for indirect agents is it? not for agents in general. It would explain, as it is static and there is only one ip in host but that still would constitude a bug as the other MS could easily take responsibility for the SVMs, if all hosts are with the first MS.

yep. You can put all mgmt server IPs in host, change indirect.agent.lb.algorithm to roundrobin or shuffle then restart mgmt servers (or agents)

This will configure the agents but not let the MSses renegotiate their load. These are two different functionalities. I will do some testing with direct attached host agents (vmware or xen) to see what and when this broke.

@weizhouapache
Copy link
Member

what's the value of global setting indirect.agent.lb.algorithm ?

that is only for indirect agents is it? not for agents in general. It would explain, as it is static and there is only one ip in host but that still would constitude a bug as the other MS could easily take responsibility for the SVMs, if all hosts are with the first MS.

yep. You can put all mgmt server IPs in host, change indirect.agent.lb.algorithm to roundrobin or shuffle then restart mgmt servers (or agents)

This will configure the agents but not let the MSses renegotiate their load. These are two different functionalities. I will do some testing with direct attached host agents (vmware or xen) to see what and when this broke.

ok.
I only know the feature introduced by #2469 .
Is there another functionality ? sorry I do not remember it ..

@vishesh92
Copy link
Member

There is also agent.load.threshold global setting which load balances the agents (only hosts [type == Routing]). Check for balancing is done at regular intervals. And once balancing has been triggered, it is not executed again.

This balancing is done per cluster i.e. if migrating all the hosts of the cluster will balance the cluster, then all hosts are migrated to another MS. The balancing happens per cluster probably to support balancing across different hypervisors.

I am not sure about the effort, but we should allow this migration to happen at host level as well depending on the hypervisor.

@DaanHoogland
Copy link
Contributor Author

@vishesh92 , do you think this is a major effort? It seems a bit heavy for a .1 release, agree?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: No status
Development

No branches or pull requests

3 participants