Qwen3-4B Finetune with SFT + Offline DPO + Online GRPO
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for lly0571/Qwen3-4B-2507-FC
Base model
Qwen/Qwen3-4B-Instruct-2507Qwen3-4B Finetune with SFT + Offline DPO + Online GRPO
Base model
Qwen/Qwen3-4B-Instruct-2507