Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 26 items • Updated 1 day ago • 151
Qwen3.5-122B-A10B Collection MINT quantized versions of Qwen3.5-122B-A10B at multiple budget targets (MLX & GGUF) • 4 items • Updated 11 days ago • 2
APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 24 items • Updated about 16 hours ago • 50
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 1 day ago • 145
Qwen-3.5-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 20 items • Updated 19 days ago • 20
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 503
MS3.2-PaintedFantasy-v4-24B MLX Collection MLX Quants of zerofata's MS3.2-PaintedFantasy-v4-24B • 6 items • Updated Mar 6 • 1