reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v2 Viewer • Updated 14 days ago • 24 • 41
reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v2 Viewer • Updated 14 days ago • 24 • 41
reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v1 Viewer • Updated 14 days ago • 20 • 68 • 1
reasoning-degeneration-dev/ttt-discover-circle_packing_26-qwen3-8b-v1 Viewer • Updated 14 days ago • 20 • 68 • 1
Algorithmic SFT vs Distillation Collection 10 LoRA adapters + 6 datasets. Algo template SFT vs QwQ distillation on Qwen2.5-1.5B-Instruct across 4 reasoning domains. • 16 items • Updated 17 days ago