"KLD reduced by ~10%."
#8 opened 3 days ago
by
vgoklani
Next-level!
#7 opened 4 days ago
by
mayhem4markets
2 DGX Spark cluster recipe
🤗 1
#6 opened 5 days ago
by
susni
tokenizer component mismatch and w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected issue
1
#5 opened 5 days ago
by
mtcl
Working configuration for Nvidia Blackwell
7
#4 opened 5 days ago
by
luismiguelsaez
Calibration Dataset Mixture
1
#3 opened 6 days ago
by
vgoklani
Thanks, thanks and more thanks. Many thanks.
13
#2 opened 6 days ago
by
aaron-newsome
w1 not matching w3 weight scales
12
#1 opened 6 days ago
by
dareposte