Raghav-Singhal/tulu3sft-normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated 1 day ago • 282
Raghav-Singhal/normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated 1 day ago • 274
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.05-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 10 days ago • 22
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.1-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 10 days ago • 155
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-no-bad-data-off-policy-if Text Generation • 2B • Updated 11 days ago • 414
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-normal-fixed-off-policy-if Text Generation • 2B • Updated 11 days ago • 428
Raghav-Singhal/tulu3sft-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 11 days ago • 110
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 12 days ago • 42
Raghav-Singhal/tulu3-normal-fixed-smollm-1p7b-100B-20n-2048sl-960gbsz-4n-gbs128 2B • Updated 12 days ago • 466
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-sft-tulu3sft Text Generation • 2B • Updated 13 days ago • 185
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz Text Generation • 2B • Updated 13 days ago • 39