Skip to content

v2.0

Latest

Choose a tag to compare

@sunzeyeah sunzeyeah released this 26 May 01:56

Pipelined implementation of SFT, Reward and RLHF training based on transformers, DeepSpeed and DeepSpeedChat. List of supported models: Pangu, GLM, ChatGLM