更新！！！

train_tokenizer

train_tokenizer代码在train_tokenizer.py中，用bash step_01.sh运行，这里使用的是char-based的方法，后面看时间情况改成使用bpe的方法（看来是没时间弄了）。

pretrain

pretrain代码在pretrain.py中，用bash step_02.sh运行。注意：

--train_type pretrain来指定模型进行预训练。

Evaluation

用python pretrain.py运行。注意：

在pretrain.py代码中自己修改想要评测模型的位置。
修改eval_type变量来选择是对pretrain测试还是对sft测试

sft

用bash step_02.sh运行。注意：

--train_type sft来指定模型进行supervised finetune。

ceval

执行

cd ceval
bash ceval.sh

注意：

在ceval.sh中修改自己的模型路径和参数。
修改了generate.py，没懂为什么老师是用logits=logits[0][0]，不应该是根据最近生成的token获得的词典大小的logits来预测ABCD吗？

总结

以上代码都能在我服务器上运行，且看起来效果还不错，如果有问题，可以Q我 or Email [email protected]。

更新！！！

本次实验更新了DPO和Application；
DPO分别对自己的sft模型和Qwen系列模型进行了DPO更新；
Application选择的是文本分类，直接使用Qwen系列的模型，分别实现使用zero-shot的prompt进行对话生成的文本分类，和直接使用Qwen Model代码中的AutoModelForSequenceClassification进行直接分类训练。

DPO

分别在DPO_MyGPT和DPO_Qwen文件夹中(训练数据自己准备)

cd DPO_MyGPT 或者 cd DPO_Qwen
训练：
python train.py
测试
python inference.py

自己的sft模型dpo后好像训崩了(应该是代码写的依托，但是懒得改了)，但是Qwen模型dpo后效果very good；

Application

在Text_classification文件夹中，因为我对数据集也做了一些处理，所以直接传上去了，在toutiao_cat_data文件夹中；
zero-shot的对话生成进行分类：

cd Text_classification
训练：
python train.py
测试
python inference.py

直接使用Qwen定制的AutoModelForSequenceClassification进行文本分类：

cd Text_classification
训练：
python train_v2.py
测试
python inference_v2.py

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
DPO_MyGPT		DPO_MyGPT
DPO_Qwen		DPO_Qwen
Qwen_ceval		Qwen_ceval
Text_classification		Text_classification
ceval		ceval
model		model
README.md		README.md
bpe_tokenizer.py		bpe_tokenizer.py
check_paramters.py		check_paramters.py
data.py		data.py
generator.py		generator.py
inference.py		inference.py
mygpt.py		mygpt.py
optim.py		optim.py
pretrain.py		pretrain.py
sft_train_loss.png		sft_train_loss.png
step_01.sh		step_01.sh
step_02.sh		step_02.sh
tokenizer.py		tokenizer.py
train_loss.png		train_loss.png
train_tokenizer.py		train_tokenizer.py
transformer.py		transformer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

train_tokenizer

pretrain

Evaluation

sft

ceval

总结

更新！！！

DPO

Application

About

Releases

Packages

Languages

sorrystopper/MyGPT

Folders and files

Latest commit

History

Repository files navigation

train_tokenizer

pretrain

Evaluation

sft

ceval

总结

更新！！！

DPO

Application

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages