ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k Feature Extraction • 2B • Updated 8 days ago • 35 • 1
ali-elganzory/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k Feature Extraction • 2B • Updated 8 days ago • 28
ali-elganzory/1.7b-MixtureVitae-web_curated-100BT-longsft_16k Feature Extraction • 2B • Updated 8 days ago • 27
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k Feature Extraction • 2B • Updated 8 days ago • 29
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-4096-longsft_16k Text Generation • 2B • Updated 8 days ago • 428 • 1
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-4096-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 10 days ago • 345
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-16k-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 10 days ago • 342
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 10 days ago • 338
ali-elganzory/Baguettotron-DPO-Tulu3-decontaminated Text Generation • 0.3B • Updated 10 days ago • 342
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-DPO-Tulu3-decontaminated Text Generation • 0.4B • Updated 10 days ago • 341
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-4096-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 11 days ago • 287
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 11 days ago • 277
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-SFT-Tulu3-decontaminated Text Generation • 0.4B • Updated 11 days ago • 280
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 12 days ago • 298
ali-elganzory/Baguettotron-SFT-Tulu3-decontaminated Text Generation • 0.3B • Updated 12 days ago • 300
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-16384-rope_theta-1M-long_sft_16k Text Generation • 2B • Updated 14 days ago • 210
ali-elganzory/ablation-model-fineweb-edu-DPO-Tulu3-decontaminated Text Generation • 2B • Updated Feb 1 • 3
ali-elganzory/ablation-model-fineweb-edu-SFT-Tulu3-decontaminated Text Generation • 2B • Updated Jan 31 • 4
ali-elganzory/1.7b-Comma0.1-300BT-SFT-Tulu3-decontaminated Text Generation • 2B • Updated Jan 31 • 12
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-DPO-Tulu3-decontaminated Text Generation • 2B • Updated Jan 26 • 12
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-SFT-Tulu3-decontaminated Text Generation • 2B • Updated Jan 26 • 13