This PR transposes the W_dec weights in 36 layer files to match the expected shape format (features x hidden_dim).
· Sign up or log in to comment