Skip to content

fix(runner): add 10s cooldown between concurrent and serial search ph…#801

Open
zpatronus wants to merge 1 commit into
zilliztech:mainfrom
zpatronus:fix/serial-search-cooldown
Open

fix(runner): add 10s cooldown between concurrent and serial search ph…#801
zpatronus wants to merge 1 commit into
zilliztech:mainfrom
zpatronus:fix/serial-search-cooldown

Conversation

@zpatronus

Copy link
Copy Markdown
Contributor

…ases

The serial search phase measures single-query latency under idle conditions. Previously it launched immediately after the concurrent phase, while the end-to-end path may still be in a saturated state. This significantly affects the accuracy of serial latency, especially p99/p95, because the first few queries experience an environment similar to the concurrent test rather than the intended single-user condition.


This improves serial latency accuracy for all database backends.

…ases

The serial search phase measures single-query latency under idle conditions. Previously it launched immediately after the concurrent phase, while the end-to-end path may still be in a saturated state. This significantly affects the accuracy of serial latency, especially p99/p95, because the first few queries experience an environment similar to the concurrent test rather than the intended single-user condition.
@sre-ci-robot

Copy link
Copy Markdown

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: zpatronus
To complete the pull request process, please assign xuanyang-cn after the PR has been reviewed.
You can assign the PR to them by writing /assign @xuanyang-cn in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@zpatronus

Copy link
Copy Markdown
Contributor Author

/assign @XuanYang-cn

@XuanYang-cn

Copy link
Copy Markdown
Collaborator

@zpatronus I'd suggest closing this. The PR assumes serial latency should reflect a single-user / idle condition, and a cooldown brings it closer to that.

But serial isn't measuring idle — it measures warm best-case latency. That's why serial runs after concurrent on purpose: the concurrent phase replays the same queries to warm the server, and serial then measures that warm state. So post-concurrent isn't contamination to recover from — it's exactly the intended condition. A cooldown only pushes the server back toward cold, away from the target.

Also, 10s is an arbitrary constant — nothing verifies the system settled, and one value can't fit all backends.

@zpatronus

zpatronus commented Jun 18, 2026

Copy link
Copy Markdown
Contributor Author

@XuanYang-cn Fair point on the warm state. The cooldown isn't meant to evict data caches or similar thing, just to drain transient things like in-flight requests, network buffers, etc.
Agreed that 10s is arbitrary. Even 1s would be sufficient for what I'm describing. Alternatively, would a configurable --serial-cooldown-ms parameter (default 0) work? That way backends that need it can opt in. Happy to implement either.

@zpatronus

Copy link
Copy Markdown
Contributor Author

#803 implements the configurable parameter.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants