MyAi
#11 opened about 10 hours ago
by
Xqm-QaD-KNR-ry5
RuntimeError: expected mat1 and mat2 to have the same dtype, but got: float != c10::Half
#10 opened about 11 hours ago
by
mancub
It is fast but run for a while easily get error and cause vllm stop.
1
#9 opened 1 day ago
by
james0010
Full Local Hands-on Step-by-Step Demo Video
#8 opened 6 days ago
by
fahdmirzac
Why is there a 4k ctx limit?
2
#7 opened 10 days ago
by
crazyi
with latest sglang, gibberish output
2
#6 opened 10 days ago
by
cudaoom
Would this work with the FP8 version of the model?
4
#5 opened 11 days ago
by
pathosethoslogos
I used it on Omlx,but it showed thinking as content.
3
#4 opened 12 days ago
by
BeCreated
Thank you!
3
#3 opened 13 days ago
by
xneoenx
Avg Draft acceptance rate is low.
17
#2 opened 14 days ago
by
fouvy
LLAMA.CPP + ROCm + DFlash on 7900 XTX
🔥 4
3
#1 opened 14 days ago
by
flamme-demon