Base/staging branch for GuideLLM refactor for all PRs to eventually be merged into before landing on main #351

markurtz · 2025-09-19T03:21:52Z

Summary

TODO

Details

TODO

Test Plan

TODO

Related Issues

TODO

… refactor branch Signed-off-by: Mark Kurtz <[email protected]>

…ng config.py to settings.py due to later config additions and potential conflicts in naming Signed-off-by: Mark Kurtz <[email protected]>

Signed-off-by: Mark Kurtz <[email protected]>

…view Signed-off-by: Mark Kurtz <[email protected]>

Signed-off-by: Mark Kurtz <[email protected]>

… for plural Signed-off-by: Mark Kurtz <[email protected]>

…p to avoid conflicts Signed-off-by: Mark Kurtz <[email protected]>

Signed-off-by: Mark Kurtz <[email protected]>

Signed-off-by: jaredoconnell <[email protected]>

## Summary  TODO --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

## Summary This PR ports the new functionality from `benchmark run` to `benchmark from-file`, and does so in a way that reuses as much code as practical to have one source of truth. ## Details  - Fixes from-file by making it to use the new output format. - Moves code related to the new output formats to separate functions that are called from both benchmark entrypoints. - Moves additional chunks of code out of the large benchmark run entrypoint function for modularity. ## Test Plan Run a benchmark with an output of json or yaml, and use `from-file` to re-import it and export it. You can select any output type supported by `benchmark run`. `guidellm benchmark from-file ./result.json --output-formats console` `guidellm benchmark from-file ./result.yaml --output-formats yaml` ## Related Issues --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`) --------- Signed-off-by: Jared O'Connell <[email protected]>

## Summary Reintroduces a few changes from main --------- Signed-off-by: Samuel Monson <[email protected]>

Replace scenario entrypoint with a decorator Forward-port get_default and from_file to Scenario Apply scenario args as an update to kwargs Readd scenario support to CLI Signed-off-by: Samuel Monson <[email protected]>

Signed-off-by: Samuel Monson <[email protected]>

Signed-off-by: Jared O'Connell <[email protected]>

Signed-off-by: Samuel Monson <[email protected]>

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Jared O'Connell <[email protected]>

## Summary This PR gets the CSV output to a state comparable to pre-refactor. ## Details Implements the functions to export the required data to the CSV format. The goal is to include the required information in the CSV without cluttering it, but also without creating too much of a burden to the future maintainers resulting from referencing specific schema elements. The following columns are new: - Profile (an entire JSON dump of the profile) - Backend (the entire JSON dump of the internal data structure) - Generator Data You can view these files in the attached output generated by the following test: `guidellm benchmark run --output-path result_7.csv --max-seconds 2 --target=http://localhost:8000 --data "prompt_tokens=256,output_tokens=128" --rate-type constant --rate 1 --output-formats csv` [result_7.csv](https://github.com/user-attachments/files/22222220/result_7.csv) ## Test Plan Run GuideLLM with the following additional args: `--output-path result.csv --output-formats csv` The generated file should have all info required. ## Related Issues This is a part of the scheduler refactor. --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

## Summary  Bumps the minimum python version to 3.10 and updates all dependent tooling (CI, tox, ruff, mypy). --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

Fixes a issue in metric calculation that caused incorrect statistics at extreme changes in concurrency and an issue where the first decode token was not counted in total tokens per second.  - [x] Fixed issue where merged concurrency change events would double-count concurrency - [x] Ensure first decode token is counted when calculating total tokens per second  - Run unit tests: `tox -e test-unit -- -m "regression and sanity"` --- - [x] "I certify that all code in this PR is my own, except as noted below." - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [x] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`) --------- Signed-off-by: Samuel Monson <[email protected]>

Signed-off-by: Samuel Monson <[email protected]>

## Summary  Fixes failing unit tests. Most were failing due to changes in functionality but a couple were regressions. --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [x] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

## Summary This PR fixes all type errors in the utils package. Only a few were ignored. ## Details - A lot of these changes are reflecting that values can be None, and the associated None checks. - Others are incorrect type annotations - Others are asserting with cast that we know for certain that the type is correct. - Plus other minor changes ## Test Plan Run the tests and look through the changes to make sure the logic is equivalent or better to the original code. --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

Many of the quality errors are due to using the older union style, and have appeared due to the upgrade of the minimum Python version from 3.9 to 3.10 Signed-off-by: Jared O'Connell <[email protected]>

Signed-off-by: Jared O'Connell <[email protected]>

## Summary This fixes quality errors in the code. ## Details The recent switch to Python 3.10 means that the linter can apply 3.10 specific linters, so this fixes errors that occur now that backwards compatibility is no longer required. The biggest change is the switch from explicit `Union` references to `|`. I also took advantage of the version change to use `match`. Also, it appears that the `from __future__ import annotations` lines are masking a lot of circular imports. So I can't remove those yet. ## Test Plan Run GuideLLM as normal, make sure the tests pass, and confirm that there are no more quality errors. --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [x] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

Signed-off-by: Samuel Monson <[email protected]>

Signed-off-by: Jared O'Connell <[email protected]>

Signed-off-by: Samuel Monson <[email protected]>

## Summary  Default to marking tests that timeout as XFAIL. ## Details  - [ ] ## Test Plan  - ## Related Issues  - Closes #404 --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [x] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)

Base version update to what is pushed as latest to enable PR for base…

a2d19cd

… refactor branch Signed-off-by: Mark Kurtz <[email protected]>

markurtz force-pushed the features/refactor/base branch from ebdb6ce to a2d19cd Compare September 19, 2025 03:22

markurtz and others added 19 commits September 19, 2025 03:50

Base version update to what is pushed as latest to enable PR for base…

cd5a92d

… refactor branch Signed-off-by: Mark Kurtz <[email protected]>

core changes for refactor including pyproject.toml updates and renami…

8d6e19a

…ng config.py to settings.py due to later config additions and potential conflicts in naming Signed-off-by: Mark Kurtz <[email protected]>

remove improper readdition of pyhumps

669848d

Signed-off-by: Mark Kurtz <[email protected]>

refactors for the utility modules

6b6ed98

Signed-off-by: Mark Kurtz <[email protected]>

Remove old pydantic file that is now replaced

d15cf17

Signed-off-by: Mark Kurtz <[email protected]>

fixes from copilot review

5b83c2d

Signed-off-by: Mark Kurtz <[email protected]>

add refactored scheduler package and tests

c84299b

Signed-off-by: Mark Kurtz <[email protected]>

Standardize on plural for modules/packages and update from copilot re…

a7ae737

…view Signed-off-by: Mark Kurtz <[email protected]>

backend refactor implementations

02554b0

Signed-off-by: Mark Kurtz <[email protected]>

fixes from copilot review and standardize backend package to backends…

a88605e

… for plural Signed-off-by: Mark Kurtz <[email protected]>

remove renaming changes from benchmark package til after that PR is u…

452eb65

…p to avoid conflicts Signed-off-by: Mark Kurtz <[email protected]>

Add in benchmark package refactor

7829fb8

Signed-off-by: Mark Kurtz <[email protected]>

fixes and rebase

4834767

Signed-off-by: Mark Kurtz <[email protected]>

fixes from copilot review

61736f5

Signed-off-by: Mark Kurtz <[email protected]>

Mock server implementation for guidellm

a28bbe3

fixes from copilot review

bb98193

Signed-off-by: Mark Kurtz <[email protected]>

Any missing changes / working state for refactor

a9a082a

Signed-off-by: Mark Kurtz <[email protected]>

add in the perf extras

6d0d4c2

Signed-off-by: Mark Kurtz <[email protected]>

Complete CSV output

bfc8e50

Signed-off-by: jaredoconnell <[email protected]>

sjmonson mentioned this pull request Sep 23, 2025

Fix start_token is not correct #361

Open

4 tasks

sjmonson and others added 8 commits September 24, 2025 12:57

[GuideLLM Refactor] Entrypoint: Reintroduce changes from main (#363)

78615f7

## Summary Reintroduces a few changes from main --------- Signed-off-by: Samuel Monson <[email protected]>

Update GenerativeTextScenario to match current def

3ac1537

Replace scenario entrypoint with a decorator Forward-port get_default and from_file to Scenario Apply scenario args as an update to kwargs Readd scenario support to CLI Signed-off-by: Samuel Monson <[email protected]>

Add workaround for pydantic/pydantic#9541

c47a1f6

Signed-off-by: Samuel Monson <[email protected]>

Rename rate_type -> profile in builtin scenarios

965aca2

Signed-off-by: Samuel Monson <[email protected]>

Always parse rate as list[float]

d9a4df2

Signed-off-by: Samuel Monson <[email protected]>

Fix bug where empty constraints in sweep caused error

03f9085

Signed-off-by: Jared O'Connell <[email protected]>

sjmonson and others added 30 commits October 9, 2025 11:05

Bump minimal python version to 3.10

eb8e84e

Signed-off-by: Samuel Monson <[email protected]>

Bump min python to 3.10 in CI

9292e38

Signed-off-by: Samuel Monson <[email protected]>

Update docs to reflect bumping min pyhton

d40a657

Signed-off-by: Samuel Monson <[email protected]>

Add force update stratagy to lockfile script

eff0f46

Signed-off-by: Samuel Monson <[email protected]>

Optimize use of lowercase in src/guidellm/utils/registry.py

bde3ae8

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Jared O'Connell <[email protected]>

Disable base class initialization

e1fb966

Signed-off-by: Samuel Monson <[email protected]>

Test cleanup

a9aad63

Signed-off-by: Samuel Monson <[email protected]>

Fix backend tests

440b4e3

Signed-off-by: Samuel Monson <[email protected]>

Initial scheduler test fixes

272304c

Signed-off-by: Samuel Monson <[email protected]>

Fix MeasuredRequestTimings tests

544c888

Signed-off-by: Samuel Monson <[email protected]>

Patch time.time in workgroup lifecycle test

5032e9e

Signed-off-by: Samuel Monson <[email protected]>

Tear down worker process group in instance fixture

4971e56

Signed-off-by: Samuel Monson <[email protected]>

Match main tests to current CLI

5676895

Signed-off-by: Samuel Monson <[email protected]>

Various small fixes to utils tests

5f36174

Signed-off-by: Samuel Monson <[email protected]>

Fix typing import for python3.10

1556236

Signed-off-by: Samuel Monson <[email protected]>

Fixed quality errors

87ba006

Many of the quality errors are due to using the older union style, and have appeared due to the upgrade of the minimum Python version from 3.9 to 3.10 Signed-off-by: Jared O'Connell <[email protected]>

Run auto-formatter

1bd8846

Signed-off-by: Jared O'Connell <[email protected]>

Fix remaining ruff errors

1e8974c

Signed-off-by: Jared O'Connell <[email protected]>

Fix unit tests

d0dad5a

Signed-off-by: Jared O'Connell <[email protected]>

Move asyncio timeout to common location

b243664

Signed-off-by: Samuel Monson <[email protected]>

Fix duplicate timeout in openai backend tests

cfcbd13

Signed-off-by: Samuel Monson <[email protected]>

Force time zone in tests

9ca2dba

Signed-off-by: Jared O'Connell <[email protected]>

Fix function doc

8d20525

Signed-off-by: Samuel Monson <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Base/staging branch for GuideLLM refactor for all PRs to eventually be merged into before landing on main #351

Base/staging branch for GuideLLM refactor for all PRs to eventually be merged into before landing on main #351

Uh oh!

markurtz commented Sep 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Base/staging branch for GuideLLM refactor for all PRs to eventually be merged into before landing on main #351

Are you sure you want to change the base?

Base/staging branch for GuideLLM refactor for all PRs to eventually be merged into before landing on main #351

Uh oh!

Conversation

markurtz commented Sep 19, 2025

Summary

Details

Test Plan

Related Issues

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants