It's trash. It has a serious drop in score compared to the base model.
But it was my first RL learning and it was monumental.

eval

minpeter/calculator-agent-qwen3-0.6b: Accuracy: 15.19% (24/158)
minpeter/Qwen3-0.6B-Instruct: Accuracy: 27.22% (43/158)

Downloads last month
9
Safetensors
Model size
0.8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for minpeter/calculator-agent-qwen3-0.6b

Finetuned
Qwen/Qwen3-0.6B
Finetuned
(1)
this model
Quantizations
1 model