running "MIXED" gguf with latest llama.cpp gave this error:

#1
by saadsafi - opened

llama.cpp/ggml/src/ggml-backend.cpp:1367: GGML_ASSERT(n_inputs < GGML_SCHED_MAX_SPLIT_INPUTS) failed

saadsafi changed discussion title from running "MIXED" gguf with latest lalma.cpp gave this error to running "MIXED" gguf with latest llama.cpp gave this error:

Hi, thanks for the report, unfortunately I couldn't replicate it on the newest Metal llama.cpp version and the model performed well. Hope you could get it working.

Sign up or log in to comment