๐ Frontier MoE โ 128Bโ1T
Collection
Largest open-weight LLMs, BatiAI-quantized. Mac-runnable from M4 Max 128GB to Mac Studio M3 Ultra 512GB. โข 5 items โข Updated
Quantizations of Mistral Medium 3.5 128B (Dense, Modified MIT) โ frontier coding model on a Mac. Built and verified by BatiAI for BatiFlow.
ollama pull batiai/mistral-medium-3.5:iq4
| Quant | Size | VRAM target | Recommended For |
|---|---|---|---|
| IQ3_XXS | ~50GB | ~64GB | 96GB+ Mac (compact) |
| IQ4_XS | ~70GB | ~84GB | 128GB Mac (recommended) |
| Q5_K_M | ~90GB | ~108GB | 192GB+ Mac (highest quality) |
| Your Mac RAM | IQ3 (~50GB) | IQ4 (~70GB) | Q5 (~90GB) |
|---|---|---|---|
| 64GB | โ ๏ธ Heavy swap | โ | โ |
| 96GB | โ Fast | โ ๏ธ Tight | โ |
| 128GB | โ | โ Recommended | โ ๏ธ Tight |
| 192GB+ | โ | โ | โ Best |
| 512GB | โ | โ | โ Headroom |
| Benchmark | Mistral Medium 3.5 |
|---|---|
| SWE-Bench Verified | 77.6% |
| Tau3-Telecom (agentic) | 91.4% |
| Context window | 256K |
A coding model that goes head-to-head with cloud frontier (Gemini 3.1 Pro), runnable entirely on-device on a 128GB MacBook Pro.
| Your Mac | Best Pick | Speed | Use Case |
|---|---|---|---|
| 32GB | batiai/nemotron3-nano:iq4 |
โ | General agentic |
| 96GB | batiai/mistral-medium-3.5:iq3 (this) |
โ | Coding agents |
| 128GB | batiai/mistral-medium-3.5:iq4 (this) |
โ | Frontier coding |
| 128GB | batiai/minimax-m2.7:iq3 |
36.7 t/s | 229B Dense, GDPval |
| 192GB+ | batiai/mistral-medium-3.5:q5 |
โ | Max quality |
| 512GB | batiai/kimi-k2.6:iq4 |
โ | 1T MoE, SWE-Bench Pro 58.6 |
| BatiAI | Third-party | |
|---|---|---|
| Source | Official Mistral weights | Re-quantized |
| imatrix | โ wikitext-2 200 chunks | Standard |
| Vision | (text-only quants) | (text-only quants) |
| BatiAI signed | โ | โ |
BatiFlow โ free, on-device AI automation for Mac. 5MB app, 100% local, unlimited. 60+ tools โ KakaoTalk, iMessage, Slack, Calendar, browser, file system.
Quantized from mistralai/Mistral-Medium-3.5-128B. License: Modified MIT.
| Machine | Quant | Cold start | Prompt eval | Token gen | Tested |
|---|---|---|---|---|---|
| MacBook Pro M4 Max 128GB | IQ3_XXS | 35.316s | 36.7 t/s | 6.82 t/s | 2026-05-04 |
3-bit
4-bit
5-bit
Base model
mistralai/Mistral-Medium-3.5-128B