unsloth
/

GLM-5.1-GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (0)

Do NOT use CUDA 13.2

#4 opened 3 days ago by

Highest performance inference on <8 RTX 6000 Pros setups

#6 opened 2 days ago by

Are these the final GGUFs are you working on revisions

#5 opened 3 days ago by

IQ4_NL Gibberish in llama.cpp

#3 opened 4 days ago by

Speed inference UD-IQ2_M

#2 opened 4 days ago by

Any possibilty to Re-Quantize GLM-5 quants?

#1 opened 4 days ago by