GGUF Please
#2
by anubhav200 - opened
Do you plan to publish GGUFs as well ?
Please!!
Tried quantizing but SarvamMoE is a new architecture it seems. The llamacpp script (convert_hf_to_gguf) failed. Would need help mapping the tensors appropriately.
Tried quantizing but SarvamMoE is a new architecture it seems. The llamacpp script (convert_hf_to_gguf) failed. Would need help mapping the tensors appropriately.
Yeah same error