🚀 Frontier MoE — 128B–1T - a batiai Collection

batiai 's Collections

🧠 NVIDIA Nemotron 3 — Hybrid Mamba+Attention

🚀 Frontier MoE — 128B–1T

⚡ Qwen 3.6 — Tools, Thinking, Vision

🍎 Gemma 4 — Google's Latest

🐉 Qwen 3.5 — Alibaba Stable

BatiAI RAG Stack

🚀 Frontier MoE — 128B–1T

updated 2 days ago

Largest open-weight LLMs, BatiAI-quantized. Mac-runnable from M4 Max 128GB to Mac Studio M3 Ultra 512GB.

batiai/DeepSeek-V4-Flash-GGUF

Text Generation • 284B • Updated 3 days ago • 1.2k • 3

Note 284B-A13B MoE • CSA+HCA hybrid attention • SWE-Bench Pro top tier • via batiai/bati.cpp
batiai/Kimi-K2.6-GGUF

Text Generation • 1T • Updated 6 days ago • 6.11k
batiai/MiniMax-M2.7-GGUF

Text Generation • 229B • Updated 22 days ago • 1.31k
batiai/Mistral-Medium-3.5-128B-GGUF

Text Generation • 125B • Updated 2 days ago • 659

Note 128B Dense • SWE-Bench Verified 77.6% • Modified MIT • measured 6.8 t/s on M4 Max IQ3