Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
batiai 's Collections
🧠 NVIDIA Nemotron 3 — Hybrid Mamba+Attention
🚀 Frontier MoE — 128B–1T
⚡ Qwen 3.6 — Tools, Thinking, Vision
🍎 Gemma 4 — Google's Latest
🐉 Qwen 3.5 — Alibaba Stable
BatiAI RAG Stack

🚀 Frontier MoE — 128B–1T

updated 2 days ago

Largest open-weight LLMs, BatiAI-quantized. Mac-runnable from M4 Max 128GB to Mac Studio M3 Ultra 512GB.

Upvote
-

  • batiai/DeepSeek-V4-Flash-GGUF

    Text Generation • 284B • Updated 3 days ago • 1.2k • 3

    Note 284B-A13B MoE • CSA+HCA hybrid attention • SWE-Bench Pro top tier • via batiai/bati.cpp


  • batiai/Kimi-K2.6-GGUF

    Text Generation • 1T • Updated 6 days ago • 6.11k

  • batiai/MiniMax-M2.7-GGUF

    Text Generation • 229B • Updated 22 days ago • 1.31k

  • batiai/Mistral-Medium-3.5-128B-GGUF

    Text Generation • 125B • Updated 2 days ago • 659

    Note 128B Dense • SWE-Bench Verified 77.6% • Modified MIT • measured 6.8 t/s on M4 Max IQ3

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs