[REQUEST] Support Offload deepspeed engine in RLHF training #7013

hijkzzz · 2025-02-07T12:32:53Z

See vLLM sleep mode: vllm-project/vllm#11743
This feature is very important in RLHF
such as the hybrid engine mode in OpenRLHF: OpenRLHF/OpenRLHF@c10f02b

The text was updated successfully, but these errors were encountered:

tjruwase · 2025-02-07T13:24:03Z

@hijkzzz can you be more specific on this request. DeepSpeed is probably the first framework to offer offloading in RLHF training:
https://github.com/deepspeedai/DeepSpeedExamples/blob/8075143d922e0a25c8217ed4f72ef7121cad423a/applications/DeepSpeed-Chat/training/step3_rlhf_finetuning/main.py#L237-L251

hijkzzz · 2025-02-07T15:25:23Z

@hijkzzz can you be more specific on this request. DeepSpeed is probably the first framework to offer offloading in RLHF training: https://github.com/deepspeedai/DeepSpeedExamples/blob/8075143d922e0a25c8217ed4f72ef7121cad423a/applications/DeepSpeed-Chat/training/step3_rlhf_finetuning/main.py#L237-L251

RLHF training needs:
offload model/optimizer to CPU when using vLLM samples generation for larger inference batch size
load model/optimizer back to GPU during training to avoid PCI-E data exchange
same as vllm-project/vllm#11743

tjruwase · 2025-02-07T15:42:48Z

@hijkzzz, it seems you are requesting on-demand and fine-grained offloading, is that correct? Would the following APIs work:
https://deepspeed.readthedocs.io/en/latest/zero3.html#offload-states

@tohtana, FYI

hijkzzz added the enhancement New feature or request label Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] Support Offload deepspeed engine in RLHF training #7013

[REQUEST] Support Offload deepspeed engine in RLHF training #7013

hijkzzz commented Feb 7, 2025

tjruwase commented Feb 7, 2025

hijkzzz commented Feb 7, 2025 •

edited

Loading

tjruwase commented Feb 7, 2025

[REQUEST] Support Offload deepspeed engine in RLHF training #7013

[REQUEST] Support Offload deepspeed engine in RLHF training #7013

Comments

hijkzzz commented Feb 7, 2025

tjruwase commented Feb 7, 2025

hijkzzz commented Feb 7, 2025 • edited Loading

tjruwase commented Feb 7, 2025

hijkzzz commented Feb 7, 2025 •

edited

Loading