TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 9 days ago • 106
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate Paper • 2504.19874 • Published Apr 28, 2025 • 32
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2 Image-Text-to-Text • 10B • Updated 9 days ago • 48.1k • 156
Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 2B • Updated about 1 month ago • 72k • 147
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 9 days ago • 589k • 2.64k
freddm/Voxtral-Mini-4B-Realtime-2602-GGUF Automatic Speech Recognition • 4B • Updated Feb 14 • 439 • 7