very slow gguf with latest llama.cpp server
2
#3 opened 11 days ago
by
subbur
Install & run this model easily using llmpm
#2 opened about 1 month ago
by
sarthak-saxena
Performance report with 2 GPUs: 85 t/s
👍 4
1
#1 opened 2 months ago
by
SlavikF