Pinned Loading
Repositories
Showing 10 of 523 repositories
- slime Public Forked from THUDM/slime
slime is an LLM post-training framework for RL Scaling.(Fork for contributing. All changes intended for upstream PRs.)
alibaba/slime’s past year of commit activity - ROLL Public
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
alibaba/ROLL’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…