ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-merged Text Generation • 2B • Updated about 5 hours ago
ali-elganzory/open-sci-ref-v0.02-1.7b-dclm-300B-4096-longsft_16k Text Generation • 2B • Updated about 5 hours ago
ali-elganzory/Baguettotron-longsft_16k-DPO-Tulu3-decontaminated Text Generation • 0.3B • Updated 4 days ago • 248
ali-elganzory/1.7b-MixtureVitae-web_curated-100BT-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 4 days ago • 37
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 0.4B • Updated 4 days ago • 38
ali-elganzory/Baguettotron-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 0.3B • Updated 4 days ago • 403
ali-elganzory/1.7b-MixtureVitae-web_curated-100BT-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 4 days ago • 431
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-longsft_16k-SFT-Tulu3-decontaminated Feature Extraction • 0.4B • Updated 4 days ago • 44
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-longsft_16k-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 4 days ago • 526
ali-elganzory/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 4 days ago • 44
ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 4 days ago • 45
ali-elganzory/1.7b-Comma0.1-300BT-longsft_16k-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 4 days ago • 527
ali-elganzory/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k-SFT-Tulu3-decontaminated Feature Extraction • 2B • Updated 4 days ago • 47
ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k-SFT-Tulu3-decontaminated Feature Extraction • 2B • Updated 4 days ago • 44
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 4 days ago • 448
ali-elganzory/1.7b-Comma0.1-300BT-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 4 days ago • 434
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 4 days ago • 20
ali-elganzory/SmolLM2-1.7B-16k-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 4 days ago • 133
ali-elganzory/SmolLM2-1.7B-16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 4 days ago • 134