view article Article Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp Jan 30 • 17
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 12 days ago • 839
bartowski/arcee-ai_Trinity-Large-Thinking-GGUF Text Generation • 399B • Updated 13 days ago • 3.12k • 10
Running 119 Qwen3.5 Omni Offline Demo 🌍 119 Chat with a multimodal AI using text, images, audio, or video