[POC] Image generation multi-concurrency idea #2113

dkalinowski · 2025-04-25T08:43:37Z

Increased number of infer requests for each model
Added a way to select infer request by request_idx
Added GenerationRequest class to manage assignment of request_idx

likholat · 2025-05-07T16:50:59Z

@dkalinowski, we've discussed this POC with @Wovchena and @ilya-lavrenov. It looks like the simplest and most transparent solution is to create new inference_requests using clone() method.

The clone() method should be implemented at the following levels:

Tasks: Text2ImagePipeline, Image2ImagePipeline, InpaintingPipeline
Pipelines: StableDiffusionPipeline, StableDiffusionXLPipeline, ...
Models: SD3Transformer2DModel, UNet2DConditionModel, AutoencoderKL, ...

Example: Text2ImagePipeline:

When calling Text2ImagePipeline::clone(), internally it will:

Call clone() on the corresponding diffusion pipeline (e.g. StableDiffusionPipeline::clone(...)), which:
1. Reconstructs a new Scheduler from the config of the original
2. Calls clone() for each model in the pipeline (e.g. CLIPTextModel::clone(...)), where a new inference_request is created internally

dkalinowski · 2025-05-09T14:42:05Z

@dkalinowski, we've discussed this POC with @Wovchena and @ilya-lavrenov. It looks like the simplest and most transparent solution is to create new inference_requests using clone() method.

The clone() method should be implemented at the following levels:

Tasks: Text2ImagePipeline, Image2ImagePipeline, InpaintingPipeline

Pipelines: StableDiffusionPipeline, StableDiffusionXLPipeline, ...

Models: SD3Transformer2DModel, UNet2DConditionModel, AutoencoderKL, ...

Example: Text2ImagePipeline:

When calling Text2ImagePipeline::clone(), internally it will:

Call clone() on the corresponding diffusion pipeline (e.g. StableDiffusionPipeline::clone(...)), which:

Reconstructs a new Scheduler from the config of the original

Calls clone() for each model in the pipeline (e.g. CLIPTextModel::clone(...)), where a new inference_request is created internally

Thank you for the review. Here is the corrected version with the clone idea: #2190

Could you review before I proceed with remaining pipeline types? @likholat

dkalinowski added 5 commits April 24, 2025 11:15

Validation app

77dff8d

request_idx idea

a6c08bd

schedulers per request

12190bd

working demo? infer busy

e714c4a

working

a3d1ccb

dkalinowski added the WIP label Apr 25, 2025

github-actions bot added category: image generation Image generation pipelines category: tokenizers Tokenizer class or submodule update category: Python API Python API for GenAI category: LLM samples GenAI LLM samples category: CPP API Changes in GenAI C++ public headers labels Apr 25, 2025

GenerationRequest API

a141d7d

dkalinowski force-pushed the image-multithread branch from 6565cf3 to a141d7d Compare April 25, 2025 10:17

ilya-lavrenov assigned ilya-lavrenov and likholat Apr 29, 2025

ilya-lavrenov mentioned this pull request Apr 30, 2025

Concurrency in stable-diffusion image generation #1475

Closed

dkalinowski added 4 commits May 5, 2025 15:50

wrap

a51e3e2

better test app

76c516a

cleanup

5b78a23

GenerationRequest API

12f2a5c

dkalinowski requested review from ilya-lavrenov and likholat May 6, 2025 09:33

github-actions bot added category: continuous batching Continuous batching no-match-files category: Image generation samples GenAI Image generation samples and removed category: LLM samples GenAI LLM samples labels May 6, 2025

dkalinowski mentioned this pull request May 9, 2025

Image generation multiconcurrency - clone idea #2190

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[POC] Image generation multi-concurrency idea #2113

[POC] Image generation multi-concurrency idea #2113

dkalinowski commented Apr 25, 2025 •

edited

Loading

likholat commented May 7, 2025 •

edited

Loading

dkalinowski commented May 9, 2025

[POC] Image generation multi-concurrency idea #2113

Are you sure you want to change the base?

[POC] Image generation multi-concurrency idea #2113

Conversation

dkalinowski commented Apr 25, 2025 • edited Loading

likholat commented May 7, 2025 • edited Loading

dkalinowski commented May 9, 2025

dkalinowski commented Apr 25, 2025 •

edited

Loading

likholat commented May 7, 2025 •

edited

Loading