physix-3b-rl-ckpt / sft /tokenizer.json

Commit History

SFT merged_16bit: Qwen/Qwen2.5-3B-Instruct | epochs=3 lora_r=32
9a4e08a
verified

Pratyush-01 commited on