bigatuna/Qwen3.5-9b-Sushi-Coder-RL-GGUF Text Generation • 9B • Updated 13 days ago • 16.7k • 44
unsloth/Nemotron-3-Nano-30B-A3B-GGUF Text Generation • 32B • Updated Dec 31, 2025 • 214k • 290
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated 29 days ago • 1.52M • 711
Nanbeige/Nanbeige4-3B-Thinking-2511 Text Generation • 4B • Updated Dec 17, 2025 • 1.81k • 204
view post Post 4438 At the close of the National Holiday🇨🇳, Antgroup drops a new SoTA model.Ling-1T 🔥 the trillion-parameter flagship of the Ling 2.0 series. inclusionAI/Ling-1T✨1T total / 50B active params per token ✨20T+ reasoning-dense tokens (Evo-CoT)✨128K context via YaRN ✨FP8 training: 15%+ faster, same precision as BF16 ✨Hybrid Syntax-Function-Aesthetics reward for front-end & visual generation See translation 1 reply · 🔥 8 8 + Reply