distill_o46_2930
Qwen3-8B fine-tuned for autonomous coding via OpenHands agent framework.
Training Details
- Base model: Qwen3-8B
- Data: o46 (2930 training examples)
- Format: OpenHands free-text function calling with
<think>reasoning - Best val loss: 0.2417
- Checkpoint: global_step_46
- LR: 2e-5, cosine schedule
- Epochs: 3, batch=128, bf16
Usage with vLLM
python -m vllm.entrypoints.openai.api_server \
--model sqy201x/distill_o46_2930 \
--max-model-len 32768 \
--enforce-eager
No --enable-auto-tool-choice needed — model outputs <function=...> as plain text.
- Downloads last month
- 11
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support