koutch/qwen3-thinking-4b_train_grpo_v1_train_no_think Text Generation • 4B • Updated Dec 20, 2025 • 1
koutch/qwen3-instruct-4b_train_grpo_v1_train_no_think Text Generation • 4B • Updated Dec 19, 2025 • 1