lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-resume-step100 Text Generation • 196k • Updated about 18 hours ago
lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-resume-step100 Text Generation • 196k • Updated about 18 hours ago
lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-step50 Text Generation • 196k • Updated 2 days ago • 168
lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-step50 Text Generation • 196k • Updated 2 days ago • 168
lihaoxin2020/qwen3-4B-refiner-3201-rl-balanced-step100 Text Generation • 196k • Updated 2 days ago • 117
lihaoxin2020/qwen3-4B-refiner-3201-rl-balanced-step100 Text Generation • 196k • Updated 2 days ago • 117
lihaoxin2020/qwen3-4B-refiner-3201-rl-balanced-step50 Text Generation • 196k • Updated 3 days ago • 1
lihaoxin2020/qwen3-4B-refiner-3201-rl-balanced-step50 Text Generation • 196k • Updated 3 days ago • 1