bad quality code generation

by saadsafi - opened Dec 16, 2025

Dec 16, 2025

•

edited Dec 16, 2025

Q5_K_M gives low quality code for standard tests I run. also Q5_K_S is significantly smaller than Q4_K_M.
same low quality code from "unsloth" (both UD-Q4_K_XL and UD-Q3_K_XL)
and Q4_K_M from "lmstudio-community"
using the latest llama.cpp release(s) on windows (CUDA 13.1).
something special about this model.

ilintar

Dec 17, 2025

This model has a very weird architecture, might be that standard techniques for optimizing quants don't work very well here.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment