test: Add responses test suite vllm-qwen35 with recordings by msager27 · Pull Request #5216 · llamastack/llama-stack

msager27 · 2026-03-19T16:01:27Z

What does this PR do?

Add a new responses test suite vllm-qwen35. Recordings for these tests are included. I know some changes are in the works for the recording system. As such, I'm making this a draft PR for discussion purposes.

The vLLM was deployed on an AWS EC2 instance with GPUs. The vLLM version is 0.17.1. The model is Qwen/Qwen3.5-35B-A3B. Requiring a GPU does complicate things if we need to re-record. That's being looked into within the community. I used a Qwen3.5 model since it supports image input.

I included one small fix to the recording system. A couple of the web search tool tests were overwriting the corresponding gpt recording. I added test_id in normalize_tool_request to remedy this.

Test Plan

Run the responses test suite with the above-mentioned vLLM + Qwen model in record mode. Then verify everything works in replay mode with:

uv run ./scripts/integration-tests.sh --stack-config server:ci-tests --setup vllm-qwen35 --inference-mode replay --subdirs responses

github-actions · 2026-03-19T16:05:26Z

✅ Recordings committed successfully

Recordings from the integration tests have been committed to this PR.

View commit workflow

mergify · 2026-03-25T08:47:05Z

This pull request has merge conflicts that must be resolved before it can be merged. @msager27 please rebase it. https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

# What does this PR do? Updates a few responses integration tests based on testing with vLLM. Some context: I initially tested with vLLM + Qwen3.5 as part of #5216. That PR was more of a staging effort and will be mostly obsoleted by #5297 and get closed. However, there are a few changes from that PR that I've pulled into separate PRs: 1. This PR which makes one of the web search tests more flexible in its validation (plus a couple skips when provider is vllm) 2. #5233   ## Test Plan Rerun the responses web search tests and verify they work as expected  Co-authored-by: Sébastien Han <seb@redhat.com>

msager27 · 2026-04-03T21:36:30Z

Closing this PR. It included some initial staging work for discussion, but this work will be superseded by PR #5297. New vLLM recordings are expected to be created as either part of that PR or a new one.

This PR also included a couple fixes. These have been submitted as separate PRs. One was merged. The other is awaiting review.

test: Add responses test suite vllm-qwen35 with recordings

976c17f

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 19, 2026

msager27 mentioned this pull request Mar 19, 2026

test: Add new test suite responses-vllm #5039

Closed

test: add vllm-qwen35 ci job

ce47a1a

msager27 mentioned this pull request Mar 20, 2026

fix: add test_id to normalize_tool_request() to avoid hash collision #5233

Open

mergify bot added the needs-rebase label Mar 25, 2026

msager27 mentioned this pull request Mar 26, 2026

test: Update responses tests based on vllm testing #5328

Merged

msager27 closed this Apr 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Add responses test suite vllm-qwen35 with recordings#5216

test: Add responses test suite vllm-qwen35 with recordings#5216
msager27 wants to merge 2 commits intollamastack:mainfrom
msager27:responses_vllm_qwen35

msager27 commented Mar 19, 2026

Uh oh!

github-actions bot commented Mar 19, 2026

Uh oh!

mergify bot commented Mar 25, 2026

Uh oh!

msager27 commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

msager27 commented Mar 19, 2026

What does this PR do?

Test Plan

Uh oh!

github-actions bot commented Mar 19, 2026

Uh oh!

mergify bot commented Mar 25, 2026

Uh oh!

msager27 commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant