v2.0

Latest

sunzeyeah released this 26 May 01:56

5ec3e3a

Pipelined implementation of SFT, Reward and RLHF training based on transformers, DeepSpeed and DeepSpeedChat. List of supported models: Pangu, GLM, ChatGLM

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2.0

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!