-
Notifications
You must be signed in to change notification settings - Fork 363
Issues: volcengine/verl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Question] How to call reward model in rule based reward function?
#433
opened Mar 1, 2025 by
zsychina
[Bug?] Attention Mask Not Rearranged in Critic Model When use_dynamic_bsz is Enabled
#431
opened Mar 1, 2025 by
Viol2000
Does it support the update of the ref model and how to control its update frequency?
#415
opened Feb 28, 2025 by
nomadlx
SPMD mode fail when VLLM_USE_V1=1 and VLLM_ENABLE_V1_MULTIPROCESSING=0 with vllm 0.7.4.dev nightly
#410
opened Feb 27, 2025 by
SwordFaith
RuntimeError: CUDA error: uncorrectable ECC error encountered!!!
#407
opened Feb 27, 2025 by
asirgogogo
[Feature Request]: Support Parallel Reward Calculation for Time-consuming Methods
#406
opened Feb 27, 2025 by
AIBionics
How many H20 (96GB) GPUs are needed to train Qwen7B with the GRPO algorithm?
#401
opened Feb 27, 2025 by
Tuziking
TP rollout + FSDP / TP actor
call for contribution
enhancement
New feature or request
#400
opened Feb 27, 2025 by
jeromeku
Using Megatron backend, OOM occurs when running the PPO of qwen25-32b model on 4-node H800
#388
opened Feb 26, 2025 by
echo-valor
ray.exceptions.ActorDiedError: The actor died unexpectedly before finishing this task.
#383
opened Feb 25, 2025 by
fengyang95
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.