Skip to content

multi-lora batching #14249

Closed Answered by jeejeelee
Diffizle asked this question in Q&A
Mar 5, 2025 · 1 comments · 1 reply
Discussion options

You must be logged in to vote

Does vllm support batching of prompts with different lora-adapters?

vLLM support this featrue

the example shown in <examples/offline_inference/multilora_inference.py> does not seem to express this feature

multilora_inference.py show this feature, different lora_id to represent different LoRAs.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Diffizle
Comment options

Answer selected by Diffizle
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants