Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 10 days ago • 589k • 2.67k
Qwopus3.5-v3 Collection 🌟Qwopus3.5-v3 is the latest model in the Claude series. • 12 items • Updated 6 days ago • 85
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text • 27B • Updated 10 days ago • 398k • 566
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 85 items • Updated 4 days ago • 525
view post Post 3910 We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. 💚 Learn:• Why RL environments matter + how to build them• When RL is better than SFT• GRPO and RL best practices• How verifiable rewards and RLVR workBlog: https://unsloth.ai/blog/rl-environments See translation 4 replies · 🔥 9 9 🤝 2 2 ❤️ 1 1 + Reply