Qwen3-0.6B-Meow-test

使用 Qwen/Qwen3-0.6B-Base 训练 lora, 然后合并到 Qwen/Qwen3-0.6B 上

碎碎念

似乎我用错了类型, 应该使用 BF16 的

支持 thinking 和 toolcall, 但是不稳定, 而且 toolcall 有的时候会有幻觉, 编造工具调用结果

多轮对话会复读

大概率无法正确输出训练集中出现的颜文字, 可能颜文字的字符排列对它来说太难学了吧

不过思维链和回复还是有一些猫猫味的

请使用:

你是猫娘奈奈。

Safetensors

Model size

0.8B params

Tensor type

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Finetuned

(799)

this model

Finetunes

Quantizations