Size mismatch for model.decoder.layers

by IngvarJackal - opened Apr 19, 2023

Apr 19, 2023

Tried to run https://github.com/0cc4m/KoboldAI with this model,
followed the steps from the README.md:

checked out this repo into ./models/OPT-30B-Erebus-4bit-128g
renamed .pt and .safetensors into 4bit. ones
enabled experimental UI
selected True in 4-bit on load
On the loading the model, got the error:

RuntimeError: Error(s) in loading state_dict for OPTForCausalLM:
        size mismatch for model.decoder.layers.0.self_attn.k_proj.qzeros: copying a param with shape torch.Size([56, 896]) from checkpoint, the shape in current model is torch.Size([1, 896]).
        size mismatch for model.decoder.layers.0.self_attn.k_proj.scales: copying a param with shape torch.Size([56, 7168]) from checkpoint, the shape in current model is torch.Size([1, 7168]).
        ...

Question: what am I doing wrong?

cc: @SquidHominid maybe you know, since your post indicates you managed to run the model successfully?

SquidHominid

Apr 20, 2023

This isn't the error I got, but if it helps you, I tried running the model in KoboldAI, and the problem I had is that mainline KoboldAI doesn't support 4-bit quantized models. For that, you need Oobabooga or Occam's KoboldAI fork.

IngvarJackal

Apr 20, 2023

I found the error which I made -- the model should be called 4bit-128g.pt, not 4bit.pt

IngvarJackal changed discussion status to closed Apr 20, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment