XAPI throttling proof of concept #6778

cplaursen · 2025-12-01T13:38:26Z

Proof of concept for XAPI throttling.
This allows users to specify the user-agent rate limited clients in xapi.conf, which then consume from a token bucket whenever a request is made, and have to wait for it to refill if they exceed its capacity.

The token bucket library implements the token bucket algorithm, to be used for rate-limiting. This commit implements basic token buckets, which contain tokens that are refilled over time according to their refill parameter, up to a maximum determined by the burst parameter. Tokens can be consumed in a thread-safe way - consuming returns false when there are not enough tokens available, and true when the operation was successful. Signed-off-by: Christian Pardillo Laursen <[email protected]>

Signed-off-by: Christian Pardillo Laursen <[email protected]>

Bucket tables map client identifiers to their token buckets, and are the main data structure for rate limiting. Signed-off-by: Christian Pardillo Laursen <[email protected]>

To be replaced with a proper datamodel. Bucket tables are used for mapping requests to their respective token bucket so that they can be rate limited. Signed-off-by: Christian Pardillo Laursen <[email protected]>

Signed-off-by: Christian Pardillo Laursen <[email protected]>

psafont

It's a bit tough to understand how the datastructure does rate-limitting. I had to find documentation online to undestand it. As such, I would appreciate a more in-depth explanation at the header of the mli file. For example, it would be good to understand when is the "refill" timestamp changed, and explain that some of the methods are only meant to be used for testing.

psafont · 2025-12-01T14:07:25Z

ocaml/libs/rate-limit/token_bucket.mli

+(** Create token bucket with given parameters and supplied inital timestamp
+    @param timestamp Initial timestamp
+    @param burst_size Maximum number of tokens that can fit in the bucket
+    @param fill_rate Number of tokens added to the bucket per second


fill_rate has an implicit unit (Hz), I think that it can be worth representing the fill rate as the amount of time it takes the bucket from being empty to becoming full. This is a timespan, and we can use a known datatype with explicit unit here: Mtime.span.

How some operations would be changed:

let peek_with_delta time_delta tb = let fill_time = Mtime.Span.to_float_ns tb.fill_time in let time_delta = Mtime.Span.to_float_ns time_delta in min tb.burst_size (tb.tokens +. (time_delta /. fill_time) let delay_until_available_with_delta delta tb amount = let current_tokens = peek_with_delta delta tb in let required_tokens = max 0. (amount -. current_tokens) in required_tokens *. (Mtime.Span.to_float_ns tb.fill_time) |> Float.to_int64 |> Mtime.Span.of_unit64_ns

I can see the appeal of adding units to the fill_rate but I'd rather keep the burst size decoupled from fill rate, and I think it's more intuitive as tokens per second than seconds per token.

How can this fail, i.e., return None?

I think in some extreme cases, if you have an arithmetic overflow. Although that'll be like an Mtime span of ~584 years.

psafont · 2025-12-01T14:18:41Z

ocaml/libs/rate-limit/token_bucket.ml

+    burst_size: float
+  ; fill_rate: float
+  ; mutable tokens: float
+  ; mutable last_refill: Mtime.span


Is there any reason this cannot be a counter? They return the time difference directly: https://ocaml.org/p/mtime/latest/doc/mtime.clock/Mtime_clock/index.html#counters

This would mean that the peek functions would turn into:

let peek_with_delta time_delta tb = let time_delta_seconds = Mtime.Span.to_float_ns time_delta *. 1e-9 in min tb.burst_size (tb.tokens +. (time_delta_seconds *. tb.fill_rate)) let peek tb = peek_with_delta (Mtime_clock.count tb.last_refill) tb

Keeping a counter does simplify things a bit, but if I understand correctly it forces us to make two system time calls when consuming tokens - one to obtain the difference from the counter, and another to produce a new counter. It probably won't make much of a difference, I can try profiling.

ocaml/libs/rate-limit/token_bucket.mli

ocaml/libs/rate-limit/token_bucket.ml

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen · 2025-12-01T16:16:40Z

Added some documentation to the token bucket module to explain the rate limit application.

Zero or negative rate limits can cause issues in the behaviour of rate limiting. In particular, zero fill rate leads to a division by zero in time calculations. Rather than account for this, we forbid the creation of token buckets with a bad fill rate by returning None. Signed-off-by: Christian Pardillo Laursen <[email protected]>

Signed-off-by: Christian Pardillo Laursen <[email protected]>

ocaml/libs/rate-limit/bucket_table.mli

ocaml/libs/rate-limit/bucket_table.ml

Make token bucket type abstract to hide Hashtbl.t Use `replace` rather than `add` for adding a new bucket Signed-off-by: Christian Pardillo Laursen <[email protected]>

ocaml/libs/rate-limit/bucket_table.ml

Signed-off-by: Christian Pardillo Laursen <[email protected]>

The current implementation of rate limiting had severe fairness issues. These have been resolved through the addition of a request queue, to which rate limited requests are added. A worker thread sleeps until its associated token bucket has enough tokens to handle the request at the head of the queue, calls it, and sleeps until the next request is ready. Signed-off-by: Christian Pardillo Laursen <[email protected]>

lindig · 2025-12-04T16:57:30Z

ocaml/libs/rate-limit/token_bucket.mli

+(** Create token bucket with given parameters and supplied inital timestamp
+    @param timestamp Initial timestamp
+    @param burst_size Maximum number of tokens that can fit in the bucket
+    @param fill_rate Number of tokens added to the bucket per second


How can this fail, i.e., return None?

ocaml/xapi/xapi_rate_limit.ml

lindig · 2025-12-04T17:02:37Z

ocaml/libs/rate-limit/bucket_table.ml

+ * GNU Lesser General Public License for more details.
+ *)
+
+type rate_limit_data = {


Could use a simpler name like bucket, consumer or similar.

ocaml/libs/rate-limit/bucket_table.ml

Signed-off-by: Christian Pardillo Laursen <[email protected]>

Creating a token bucket fails if the rate limit supplied is 0 or negative - this can lead to unexpected and undesirable behaviour, such as division by 0 or negative token counts. Signed-off-by: Christian Pardillo Laursen <[email protected]>

Signed-off-by: Christian Pardillo Laursen <[email protected]>

psafont · 2025-12-08T09:35:51Z

ocaml/idl/datamodel_rate_limit.ml

+            "An identifier for the rate limited client" ~ignore_foreign_key:true
+            ~default_value:(Some (VString ""))
+        ; field ~qualifier:StaticRO ~ty:Float ~lifecycle "burst_size"
+            "Amount of tokens that can be consumed in one burst"


I don't think the idl should mention tokens or buckets at all, instead I would try to communicate the meaning of the parameters in a way that allows users to make a mental model of how rate limiting works:

Suggested change

"Amount of tokens that can be consumed in one burst"

"Amount of RPC calls that the client can do in burst"

I agree, we shouldn't talk about token buckets and I'll change that. The plan is to assign higher token costs to more expensive calls, e.g. VM create, so we can't simplify to the level of RPC calls, but I'll figure out how to document this for users.

psafont · 2025-12-08T09:36:54Z

ocaml/idl/datamodel_rate_limit.ml

+            "Amount of tokens that can be consumed in one burst"
+            ~ignore_foreign_key:true ~default_value:(Some (VFloat 0.))
+        ; field ~qualifier:StaticRO ~ty:Float ~lifecycle "fill_rate"
+            "Tokens added to token bucket per second" ~ignore_foreign_key:true


Suggested change

"Tokens added to token bucket per second" ~ignore_foreign_key:true

"Calls per second afforded to the client" ~ignore_foreign_key:true

The rate limiting can no longer be set from xapi_globs. Instead, the rate limiter is initialised from the database on startup now. Signed-off-by: Christian Pardillo Laursen <[email protected]>

Signed-off-by: Christian Pardillo Laursen <[email protected]>

minglumlu · 2025-12-09T14:05:17Z

ocaml/libs/rate-limit/bucket_table.ml

+
+type rate_limit_data = {
+    bucket: Token_bucket.t
+  ; process_queue:


Would it be possible to let the caller know how long it should delay before moving ahead?
For example:

consume t amount |> Option.iter Thread.delay ; handle ()

The schedules of different threads can be partially sorted by the delays returned to them.

cplaursen added 7 commits December 1, 2025 13:35

rate-limit: Test token bucket

60b637b

Signed-off-by: Christian Pardillo Laursen <[email protected]>

rate-limit: Implement bucket tables

9123696

Bucket tables map client identifiers to their token buckets, and are the main data structure for rate limiting. Signed-off-by: Christian Pardillo Laursen <[email protected]>

rate-limit: Create bucket table from xapi globs

1112ae5

To be replaced with a proper datamodel. Bucket tables are used for mapping requests to their respective token bucket so that they can be rate limited. Signed-off-by: Christian Pardillo Laursen <[email protected]>

xapi: Add rate limiting to do_dispatch

e933851

Signed-off-by: Christian Pardillo Laursen <[email protected]>

xapi rate limiting: Add logging

9fd1f49

Signed-off-by: Christian Pardillo Laursen <[email protected]>

rate_limit: Add rate limiter to xapi initialisation

69585f8

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen changed the title ~~Token bucket~~ XAPI throttling proof of concept Dec 1, 2025

cplaursen requested a review from robhoes December 1, 2025 13:39

psafont reviewed Dec 1, 2025

View reviewed changes

Rate limiting: Improve token_bucket documentation

0047b24

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen force-pushed the token-bucket branch 3 times, most recently from b9098c1 to dd10c1c Compare December 2, 2025 10:38

rate-limit: Write unit tests for bucket table

a540ca7

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen force-pushed the token-bucket branch from dd10c1c to a540ca7 Compare December 2, 2025 10:56

contificate reviewed Dec 2, 2025

View reviewed changes

ocaml/libs/rate-limit/bucket_table.mli Outdated Show resolved Hide resolved

contificate reviewed Dec 2, 2025

View reviewed changes

ocaml/libs/rate-limit/bucket_table.ml Outdated Show resolved Hide resolved

rate-limit: Minor fixes to bucket table

ebcbe84

Make token bucket type abstract to hide Hashtbl.t Use `replace` rather than `add` for adding a new bucket Signed-off-by: Christian Pardillo Laursen <[email protected]>

edwintorok reviewed Dec 3, 2025

View reviewed changes

ocaml/libs/rate-limit/bucket_table.ml Outdated Show resolved Hide resolved

edwintorok reviewed Dec 3, 2025

View reviewed changes

ocaml/libs/rate-limit/bucket_table.ml Outdated Show resolved Hide resolved

cplaursen force-pushed the token-bucket branch from 28cfa02 to 532a6df Compare December 4, 2025 11:04

rate-limit: Add readers-writer lock to bucket table

d6b68df

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen force-pushed the token-bucket branch from 532a6df to d6b68df Compare December 4, 2025 11:25

lindig reviewed Dec 4, 2025

View reviewed changes

cplaursen force-pushed the token-bucket branch from bd1af3a to 32092aa Compare December 5, 2025 13:14

cplaursen marked this pull request as draft December 5, 2025 13:17

cplaursen force-pushed the token-bucket branch from 31a30ff to 9c2c449 Compare December 5, 2025 15:50

cplaursen added 2 commits December 5, 2025 16:54

rate-limit: Replace readers-writer lock with atomic Map

786abb6

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen force-pushed the token-bucket branch from 9c2c449 to 3973dc3 Compare December 5, 2025 16:56

cplaursen added 3 commits December 5, 2025 18:43

idl: Add Rate_limit datamodel

3cf71dc

Signed-off-by: Christian Pardillo Laursen <[email protected]>

xapi-cli-server: Add rate limit CLI operations

18f8a88

Signed-off-by: Christian Pardillo Laursen <[email protected]>

token_bucket: replace mutex with lock-free atomics

19f18b3

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen force-pushed the token-bucket branch from 3973dc3 to 19f18b3 Compare December 5, 2025 18:52

psafont reviewed Dec 8, 2025

View reviewed changes

cplaursen added 2 commits December 8, 2025 15:57

xapi_rate_limit: Replace xapi_globs support with datamodel

3b02b8c

The rate limiting can no longer be set from xapi_globs. Instead, the rate limiter is initialised from the database on startup now. Signed-off-by: Christian Pardillo Laursen <[email protected]>

xapi_http: Add rate limiting to all handlers

ff6be8f

Signed-off-by: Christian Pardillo Laursen <[email protected]>

cplaursen force-pushed the token-bucket branch from 8a3fe58 to ff6be8f Compare December 8, 2025 15:59

minglumlu reviewed Dec 9, 2025

View reviewed changes

	"Amount of tokens that can be consumed in one burst"
	"Amount of RPC calls that the client can do in burst"

	"Tokens added to token bucket per second" ~ignore_foreign_key:true
	"Calls per second afforded to the client" ~ignore_foreign_key:true

XAPI throttling proof of concept #6778

Are you sure you want to change the base?

XAPI throttling proof of concept #6778

Uh oh!

Conversation

cplaursen commented Dec 1, 2025

Uh oh!

psafont left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cplaursen commented Dec 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

psafont Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

psafont left a comment •

edited

Loading

psafont Dec 8, 2025 •

edited

Loading