Add ParseBench evaluation results
#17 opened 2 days ago
by
boyang-runllama
fix chat template to avoid empty historical `<think>` blocks
1
#14 opened 11 days ago
by
latent-variable
Can we have a FP8 version?
#13 opened 13 days ago
by
drguolai
Add ScreenSpot-Pro evaluation result
#12 opened about 1 month ago
by
merve
Recipe for full tuning using trl?
#11 opened about 1 month ago
by
celsowm
is this genuinely just overfitting with brittleness pro max or what
1
#10 opened about 1 month ago
by
unokayish182
Instruct
2
#9 opened about 1 month ago
by
karouswissem
Create generation_config.json
#8 opened about 1 month ago
by
jalola
QORA-4B is a 4-billion parameter language model with built-in vision. Pure Rust multimodal inference engine build on Qwen3.5-4B
#7 opened about 1 month ago
by
drdraq
openai.APIConnectionError: Connection error.
#5 opened about 1 month ago
by
kfranic
Installation Video and Testing - Step by Step
#3 opened about 2 months ago
by
fahdmirzac
MMLU EVAL DGX SPARK
#2 opened about 2 months ago
by
RGMC98
Add MMLU-Pro evaluation result
#1 opened about 2 months ago
by
SaylorTwift