-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DRS improvements #10397
Comments
Thanks, @DaanHoogland, for opening this issue and outlining the scope! I agree that focusing on actual metrics instead of allocated metrics is a solid first step. It will provide a more accurate understanding of system load and help improve DRS efficiency. I'll start working on refining this further and share updates soon. Also, incorporating historical data for VM migration planning sounds like a great enhancement—I'll explore potential approaches for this. Looking forward to your feedback as I progress! |
Hi @DaanHoogland, I have reviewed the current DRS implementation and would like to propose a few improvements based on our discussions: 1️⃣ Enhanced Load Distribution Implement zone-wide DRS balancing, ensuring that loads are distributed not just within a cluster but across multiple zones when necessary. Instead of using only allocated metrics, we can introduce time-based usage trends (e.g., CPU/memory/disk utilization over the last X hours/days). Move from allocated resource metrics to real usage statistics for migration and load balancing. Thanks and regards, |
@iishitahere , I discussed with a colleague and we think this will be a six month project. This is fine in it self. But to reduce the risk xould you define phases, here? Or do you think creating sub issues for the three items you mentioned is a good idea? These can be addressed as separate projects don't you think? |
@DaanHoogland, that makes sense. To manage risk effectively, I propose breaking this into phases: 1️⃣ Phase 1 – Metrics-Driven Decision Making (real-time tracking & threshold-based alerts). Alternatively, we can create sub-issues for each, treating them as separate projects. Let me know your preference. Best, |
The required feature described as a wish
As a Operator I would like to have the loads on my systems more evenly/centrally distributed. At the moment there is a simple DRS for clusterwide distribution of loads, this is however not applying zone wide distribution or based on automated queries/improvements.
In addition we should add historic data for the VM in planning possible migrations.
At the moment allocated metrics are used. An first improvement would be to use actual metrics.
The text was updated successfully, but these errors were encountered: