unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning-GGUF Text Generation • 32B • Updated 9 days ago • 55.4k • 109
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published 8 days ago • 39
cyankiwi/gemma-4-26B-A4B-it-AWQ-4bit Image-Text-to-Text • 27B • Updated about 22 hours ago • 2.08M • 59
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published Apr 6 • 112
Running Agents Featured 587 LLM-Perf Leaderboard 🏆 587 Explore LLM performance across hardware configurations