volcengine / verl Public

Notifications You must be signed in to change notification settings
Fork 363
Star 4k

Code
Issues 94
Pull requests 33
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: volcengine/verl

[Roadmap] veRL Development Roadmap

#22 opened Nov 22, 2024 by PeterSH6

Open 3

[RFC] Megatron-LM and MCore maintaining issues for veRL

#15 opened Nov 19, 2024 by PeterSH6

Open

verl v0.2.1 & v0.3 release checklist

#354 opened Feb 23, 2025 by eric-haibin-lin

Open 13

Labels 19 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

94 Open 85 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Question] How to call reward model in rule based reward function?

#433 opened Mar 1, 2025 by zsychina

[Bug?] Attention Mask Not Rearranged in Critic Model When use_dynamic_bsz is Enabled

#431 opened Mar 1, 2025 by Viol2000

Maybe a memory check tool is need.

#430 opened Mar 1, 2025 by Jinyi6

Ray OOM causes the process to be killed

#429 opened Mar 1, 2025 by PKU-Fgx

请问训练完的actor model 里边的参数文件怎么加载测试呀

#422 opened Feb 28, 2025 by aaa559

运行你们的demo遇到这个报错是咋回事啊

#421 opened Feb 28, 2025 by xzagit

[Feature Request] Save checkpoint with huggingface format.

#420 opened Feb 28, 2025 by wenxueru

optim.warmup_style do not take effect

#418 opened Feb 28, 2025 by wenxueru

Does it support the update of the ref model and how to control its update frequency?

#415 opened Feb 28, 2025 by nomadlx

Load weight issue when world_size==1 and vllm>=0.7.0

#413 opened Feb 28, 2025 by wenxueru

SFT with Megatron backend

#412 opened Feb 28, 2025 by kechunFIVE

SPMD mode fail when VLLM_USE_V1=1 and VLLM_ENABLE_V1_MULTIPROCESSING=0 with vllm 0.7.4.dev nightly

#410 opened Feb 27, 2025 by SwordFaith

RuntimeError: CUDA error: uncorrectable ECC error encountered！！！

#407 opened Feb 27, 2025 by asirgogogo

[Feature Request]: Support Parallel Reward Calculation for Time-consuming Methods

#406 opened Feb 27, 2025 by AIBionics

How many H20 (96GB) GPUs are needed to train Qwen7B with the GRPO algorithm?

#401 opened Feb 27, 2025 by Tuziking

TP rollout + FSDP / TP actor call for contribution enhancement

New feature or request

#400 opened Feb 27, 2025 by jeromeku

Support multi-turn rollout

#398 opened Feb 26, 2025 by casper-hansen

reward_model.micro_batch_size_per_gpu not work

#395 opened Feb 26, 2025 by yuki-666

vllm 0.6.3 error

#393 opened Feb 26, 2025 by CurryxIaoHu

[Question] How to use customized chat template

#390 opened Feb 26, 2025 by zpqiu

Using Megatron backend, OOM occurs when running the PPO of qwen25-32b model on 4-node H800

#388 opened Feb 26, 2025 by echo-valor

PPO Training Hangs at Step 0 when use_remove_padding

#387 opened Feb 26, 2025 by maksimstw

Support for mutliturn online RL training

#385 opened Feb 25, 2025 by UbeCc

Startup times are slow

#384 opened Feb 25, 2025 by casper-hansen

ray.exceptions.ActorDiedError: The actor died unexpectedly before finishing this task.

#383 opened Feb 25, 2025 by fengyang95

Previous 1 2 3 4 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly