ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-merged Text Generation • 2B • Updated 3 days ago • 737
ali-elganzory/open-sci-ref-v0.02-1.7b-dclm-300B-4096-longsft_16k Text Generation • 2B • Updated 3 days ago • 300
ali-elganzory/Baguettotron-longsft_16k-DPO-Tulu3-decontaminated Text Generation • 0.3B • Updated 7 days ago • 257
ali-elganzory/1.7b-MixtureVitae-web_curated-100BT-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 7 days ago • 39
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 0.4B • Updated 7 days ago • 41
ali-elganzory/Baguettotron-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 0.3B • Updated 7 days ago • 411
ali-elganzory/1.7b-MixtureVitae-web_curated-100BT-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 7 days ago • 441
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-longsft_16k-SFT-Tulu3-decontaminated Feature Extraction • 0.4B • Updated 7 days ago • 48
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-longsft_16k-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 7 days ago • 539
ali-elganzory/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 7 days ago • 47
ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 7 days ago • 49
ali-elganzory/1.7b-Comma0.1-300BT-longsft_16k-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 7 days ago • 537
ali-elganzory/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k-SFT-Tulu3-decontaminated Feature Extraction • 2B • Updated 7 days ago • 50
ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k-SFT-Tulu3-decontaminated Feature Extraction • 2B • Updated 7 days ago • 48
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 7 days ago • 459
ali-elganzory/1.7b-Comma0.1-300BT-longsft_16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 7 days ago • 444
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-DPO-Tulu3-decontaminated Feature Extraction • 2B • Updated 7 days ago • 21
ali-elganzory/SmolLM2-1.7B-16k-DPO-Tulu3-decontaminated Text Generation • 2B • Updated 7 days ago • 143
ali-elganzory/SmolLM2-1.7B-16k-SFT-Tulu3-decontaminated Text Generation • 2B • Updated 7 days ago • 144
ali-elganzory/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-SFT-Tulu3-decontaminated Feature Extraction • 2B • Updated 7 days ago • 24
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096 Text Generation • 0.4B • Updated 7 days ago • 315
ali-elganzory/open-sci-ref-v0.02-1.7b-fineweb-edu-1.4t-300B-4096 Text Generation • 2B • Updated 7 days ago • 256
ali-elganzory/open-sci-ref-v0.02-1.7b-nemotron-hq-300B-4096 Text Generation • 2B • Updated 7 days ago • 260
ali-elganzory/0.4b-mixturevitae-v1-decontaminated-300B-4096-longsft_16k Feature Extraction • 0.4B • Updated 7 days ago • 39
ali-elganzory/1.7b-MixtureVitae-100BT-longsft_16k Feature Extraction • 2B • Updated 7 days ago • 35 • 1
ali-elganzory/1.7b-MixtureVitae-curated_instruct-100BT-longsft_16k Feature Extraction • 2B • Updated 7 days ago • 28
ali-elganzory/1.7b-MixtureVitae-web_curated-100BT-longsft_16k Feature Extraction • 2B • Updated 7 days ago • 27