Model

Steven10429/qwen14-2wc1p-eos-3-merge

Details:

This iteration

本轮训练

  • 开启eos
  • 降低lr
  • 3 epoch
  • 尝试改善模型生成长度问题

下一步尝试:

  • 随机EOS出现,0.3概率出现EOS
  • 降低EOS的概率,修改GUFF保存文件
Downloads last month
11
Safetensors
Model size
15B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Steven10429/qwen14-2wc1p-eos-3-merge