CON-11819 spread sync of load balancers #802

llDrLove · 2025-02-14T15:10:14Z

Spread the sync period of the loadbalancer unevenly to prevent a spike of loadbalancer API calls at the same time. We saw an issue when multiple clusters see their maintenance window happening at the same time and the CCM component is restarted at the same time. The sync tag period will match closely for all those CCM pods and all of those clusters will make a LIST api calls relatively close to each other resulting in spikes each 15minutes. This PR attempts to mitigate this issue by adding some randomness to the sync period (5minutes) as an initial delay. Thus, the new sync period will remain 15 minutes but the subsequent sync tag will be off between 0 to 300 seconds.

d-honeybadger · 2025-02-14T15:40:56Z

cloud-controller-manager/do/resources.go

@@ -85,6 +86,14 @@ func (s *tickerSyncer) Sync(name string, period time.Duration, stopCh <-chan str
 		klog.Errorf("%s failed: %s", name, err)
 	}

+	initialDelayTicker := time.NewTicker(initialDelay)
+	defer initialDelayTicker.Stop()


(nit) could use time.After() https://pkg.go.dev/time#After instead of ticker

Using time.After() instead. Thanks

d-honeybadger · 2025-02-14T15:41:43Z

cloud-controller-manager/do/resources.go

@@ -126,7 +135,7 @@ func (r *ResourcesController) Run(stopCh <-chan struct{}) {
 		klog.Info("No cluster ID configured -- skipping cluster dependent syncers.")
 		return
 	}
-	go r.syncer.Sync("tags syncer", controllerSyncTagsPeriod, stopCh, r.syncTags)
+	go r.syncer.Sync("tags syncer", controllerSyncTagsPeriod, time.Second*time.Duration(rand.Int31n(300)), stopCh, r.syncTags)


Now that it's initial delay, maybe let's increase the spread to ~10 min?

Bumped to 600 (10minutes)

llDrLove added 3 commits February 13, 2025 13:24

CON-11819 spread sync period

24b4b0c

CON-11819

da758b9

CON-11819

e0f7259

llDrLove self-assigned this Feb 14, 2025

llDrLove changed the title ~~Con 11819 spread sync of load balancers~~ CON-11819 spread sync of load balancers Feb 14, 2025

llDrLove requested a review from d-honeybadger February 14, 2025 15:10

d-honeybadger approved these changes Feb 14, 2025

View reviewed changes

CON-11819

acca13d

d-honeybadger reviewed Feb 14, 2025

View reviewed changes

llDrLove added 2 commits February 14, 2025 10:45

CON-11819

354347e

CON-11819 use time.After

919978f

d-honeybadger approved these changes Feb 14, 2025

View reviewed changes

llDrLove merged commit ff62883 into master Feb 14, 2025
3 checks passed

llDrLove deleted the CON-11819-spread-sync-of-load-balancers branch February 14, 2025 18:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CON-11819 spread sync of load balancers #802

CON-11819 spread sync of load balancers #802

llDrLove commented Feb 14, 2025 •

edited

Loading

d-honeybadger Feb 14, 2025

llDrLove Feb 14, 2025

d-honeybadger Feb 14, 2025

llDrLove Feb 14, 2025

CON-11819 spread sync of load balancers #802

CON-11819 spread sync of load balancers #802

Conversation

llDrLove commented Feb 14, 2025 • edited Loading

d-honeybadger Feb 14, 2025

Choose a reason for hiding this comment

llDrLove Feb 14, 2025

Choose a reason for hiding this comment

d-honeybadger Feb 14, 2025

Choose a reason for hiding this comment

llDrLove Feb 14, 2025

Choose a reason for hiding this comment

llDrLove commented Feb 14, 2025 •

edited

Loading