Post
1364
Good news, llama.cpp seems to be close to supporting MTP on qwen models. Bad news, every single gguf will have to be redone when it is.
Join the community of Machine Learners and AI enthusiasts.
Sign UpSeems like they could find a way to add backwards compatibility