distill_o46_2930

Qwen3-8B fine-tuned for autonomous coding via OpenHands agent framework.

Training Details

  • Base model: Qwen3-8B
  • Data: o46 (2930 training examples)
  • Format: OpenHands free-text function calling with <think> reasoning
  • Best val loss: 0.2417
  • Checkpoint: global_step_46
  • LR: 2e-5, cosine schedule
  • Epochs: 3, batch=128, bf16

Usage with vLLM

python -m vllm.entrypoints.openai.api_server \
    --model sqy201x/distill_o46_2930 \
    --max-model-len 32768 \
    --enforce-eager

No --enable-auto-tool-choice needed — model outputs <function=...> as plain text.

Downloads last month
11
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for sqy201x/distill_o46_2930

Finetuned
Qwen/Qwen3-8B
Finetuned
(1448)
this model