croll83/Qwopus3.5-27B-v3-Abliterated · im getting this error when running q4km

im getting this error when running q4km

by daniel7789 - opened 18 days ago

Discussion

daniel7789

18 days ago

im getting this error using llama.cpp missing tensor 'blk.64.ssm_conv1d.weight'

croll83

Owner 18 days ago

Are you using the standard llama.cpp? Give it a try with the turbo-tan/llama.cpp-tq3 fork. I think it's because of the MTP layer which i don't think it's supported by the upstream llama.cpp for qwen3.5 models.

daniel7789

18 days ago

Are you using the standard llama.cpp? Give it a try with the turbo-tan/llama.cpp-tq3 fork. I think it's because of the MTP layer which i don't think it's supported by the upstream llama.cpp for qwen3.5 models.

i tried that too, it showed the exact same error.

gigaShrimp

18 days ago

Can also confirm it's missing the 'blk.64.ssm_conv1d.weight' tensor. Tried on both TQ3 and vanilla llama.cpp and got same error

croll83

Owner 18 days ago

Hey sorry to everyone. I'm experimenting with MTP and i uploaded the models with MTP. Now the Q4_K model should be without MTP and should work on default llama.cpp (not tested). You could try to redownload the file and test. If you want to experiment with MTP i also renamed the old model for you to play with.

gigaShrimp

18 days ago

runs great! Thanks for the re-upload

croll83

Owner 17 days ago

Please share the results, it's my first experiment with abliteration

gigaShrimp

16 days ago

Still experimenting with what I can/can't do so not much to report back on that

I am however only getting about 13-14 tps vs about 25 on the 4 bit quant on 1x 3090 - not sure if it's relevant or useful but thought I'd add

croll83

Owner 16 days ago

Are you using the TQ version with the turbotan llamacpp and comparing to standard llamacpp with q4? I'm running a local fork with a lot of modification and on DGX i'm now at 14.5tok/sec almost flat (vs 9-11tok/sec with default q4_k

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment