Naahraf27/npo_llama-3.2-1b-instruct_forget10_ep10_lr5e-5_alpha1.0_beta0.1 Text Generation • 1B • Updated 2 days ago • 654
Naahraf27/npo_llama-3.2-3b-instruct_forget10_ep5_lr2e-5_alpha2.0_beta0.1 Text Generation • 3B • Updated about 24 hours ago • 467
Naahraf27/npo_llama-3.1-8b-instruct_forget10_ep5_lr5e-5_alpha2.0_beta0.1 Text Generation • 8B • Updated 2 days ago • 657