xiao45791/Qwen3-VL-8B-Instruct-SFT-Gemini-Distill-100k Image-Text-to-Text • 9B • Updated 5 days ago • 36
xiao45791/Qwen3-VL-8B-Instruct-SFT-Gemini-Distill-100k Image-Text-to-Text • 9B • Updated 5 days ago • 36
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-dapo-1144steps 5B • Updated 18 days ago • 53
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-dapo-1144steps 5B • Updated 18 days ago • 53
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1320steps 5B • Updated 18 days ago • 50
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1320steps 5B • Updated 18 days ago • 50
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1500steps 5B • Updated 19 days ago • 63
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-grpo-1500steps 5B • Updated 19 days ago • 63
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-720steps 5B • Updated 21 days ago • 56
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-720steps 5B • Updated 21 days ago • 56
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-960steps 5B • Updated 21 days ago • 65
xiao45791/Qwen3-VL-4B-Instruct-SFT-Gemini-Distill-after500steps-step2-DAPO-960steps 5B • Updated 21 days ago • 65
xiao45791/Qwen3-VL-4B-Instruct-SFT-MMR1-TechAI-Gemini-Distill-DAPO-30steps 5B • Updated 29 days ago • 83
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published about 1 month ago • 185
xiao45791/Qwen3-VL-4B-Instruct-SFT-MMR1-TechAI-Gemini-Distill-DAPO-30steps 5B • Updated 29 days ago • 83