Raghav-Singhal/tulu3sft-normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated 3 days ago • 293
Raghav-Singhal/tulu3sft-normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated 3 days ago • 293
Raghav-Singhal/normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated 3 days ago • 276
Raghav-Singhal/normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated 3 days ago • 276
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.05-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 12 days ago • 22
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.1-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 12 days ago • 161
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.1-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 12 days ago • 161
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.05-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 12 days ago • 22
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-no-bad-data-off-policy-if Text Generation • 2B • Updated 12 days ago • 416
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-normal-fixed-off-policy-if Text Generation • 2B • Updated 12 days ago • 431
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-no-bad-data-off-policy-if Text Generation • 2B • Updated 12 days ago • 416
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-normal-fixed-off-policy-if Text Generation • 2B • Updated 12 days ago • 431
Raghav-Singhal/tulu3sft-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 13 days ago • 132
Raghav-Singhal/tulu3sft-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 13 days ago • 132
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 13 days ago • 42
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 13 days ago • 42
Raghav-Singhal/tulu3-normal-fixed-smollm-1p7b-100B-20n-2048sl-960gbsz-4n-gbs128 2B • Updated 14 days ago • 508
Raghav-Singhal/tulu3-normal-fixed-smollm-1p7b-100B-20n-2048sl-960gbsz-4n-gbs128 2B • Updated 14 days ago • 508
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-sft-tulu3sft Text Generation • 2B • Updated 15 days ago • 187
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-sft-tulu3sft Text Generation • 2B • Updated 15 days ago • 187