vanilla GRPO: backport EvidenceCallback for live evidence/*.csv + plots 11307a1 verified anugrahhu commited on 12 days ago
fix: disable fast_inference (vLLM not installed) in training/evaluate.py 8f805e2 verified anugrahhu commited on 13 days ago
fix: disable fast_inference (vLLM not installed) in training/training_unsloth.py f82f913 verified anugrahhu commited on 13 days ago