-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New feature: Reconcile commands (CopyCommand, MigrateCommand, MigrateVolumeCommand) #10514
base: main
Are you sure you want to change the base?
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #10514 +/- ##
============================================
- Coverage 16.17% 16.15% -0.02%
- Complexity 13291 13326 +35
============================================
Files 5668 5687 +19
Lines 498179 499972 +1793
Branches 60290 60562 +272
============================================
+ Hits 80581 80791 +210
- Misses 408578 410137 +1559
- Partials 9020 9044 +24
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
5dcdcb8
to
f9bba6e
Compare
@blueorangutan package |
@weizhouapache a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress. |
Packaging result [SF]: ✔️ el8 ✔️ el9 ✔️ debian ✔️ suse15. SL-JID 12678 |
@blueorangutan test matrix |
@weizhouapache a [SL] Trillian-Jenkins matrix job (EL8 mgmt + EL8 KVM, Ubuntu22 mgmt + Ubuntu22 KVM, EL8 mgmt + VMware 7.0u3, EL9 mgmt + XCP-ng 8.2 ) has been kicked to run smoke tests |
[SF] Trillian Build Failed (tid-12593) |
[SF] Trillian Build Failed (tid-12594) |
[SF] Trillian test result (tid-12592)
|
[SF] Trillian test result (tid-12595)
|
Description
This PR aims to improve the process on some agent commands and answers.
Current process
Many cloudstack operations require the communication between management server and cloudstack agent.
The normal process is
management server --> send commands to agents --> agents process the commands ->
agents send the answers to management server --> management server process the answers
Each operation might have one or more processes above.
Issues in some scenarios
Normally the process works fine. However, there are some issues in some scenarios
Consider the following examples
Operations to address
This FR focuses on the following operations
The backend processes can be found at
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=337678693#AsyncAgentCommandReconciliation-4.1BackendcommandsofVMandvolumemigrations
Main changes
Design doc: https://cwiki.apache.org/confluence/display/CLOUDSTACK/Async+Agent+Command+Reconciliation
Global settings
New terminology: Reconcile commands
How it works
For reconcile commands, during stop/start of mgmt server and agent
Improvement on management server when wait for the answer of reconcile commands
Improvement on VM migration w/wo volumes
Fixes after Volume migration
Improvement on Agent
Test results
It has been tested by dev on NFS and Powerflex
Refer to https://cwiki.apache.org/confluence/display/CLOUDSTACK/Async+Agent+Command+Reconciliation#AsyncAgentCommandReconciliation-4.3Summaryoftestresults
Types of changes
Feature/Enhancement Scale or Bug Severity
Feature/Enhancement Scale
Bug Severity
Screenshots (if appropriate):
How Has This Been Tested?
How did you try to break this feature and the system with this change?