·
AI & ML interests
None yet
Organizations
None yet
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-fhm600-batch32-epoch3-8192
Text Generation
• 3B • Updated • 2
Lansechen/Qwen2.5-3B-Distill-ot114k-batch32-epoch3-8192
Text Generation
• 3B • Updated • 1
Lansechen/Qwen2.5-3B-Distill-bs17k-batch32-epoch3-8192
Text Generation
• 3B • Updated • 3
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken-new
Text Generation
• 3B • Updated • 2
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken
Text Generation
• 3B • Updated • 1
Lansechen/OLMoE-1B-7B-012-Distill-or-math220k-batch32-epoch3-8192
Text Generation
• 7B • Updated • 6
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192
Text Generation
• 3B • Updated • 5
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch16-epoch3-8192
Updated
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-16384
Updated
Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch3-8192
Text Generation
• 7B • Updated • 3
Lansechen/OLMoE-1B-7B-0125-Distill-or-math220k-batch32-epoch1-8192
Text Generation
• 7B • Updated • 3
Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch1-8192
Text Generation
• 7B • Updated • 4
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch5-8192
Text Generation
• 7B • Updated • 6
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-or-math220k-batch32
Text Generation
• 7B • Updated • 2
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192
Text Generation
• 7B • Updated • 2
• 1
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch5-8192
Text Generation
• 7B • Updated • 1
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch1-8192
Text Generation
• 7B • Updated • 5
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32-epoch2
Updated
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32
Text Generation
• 7B • Updated • 6
• 1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-bs17k-batch32
Text Generation
• 16B • Updated • 9
• 1
Lansechen/Qwen2.5-3B-Instruct-Distill-om220k-batch32
Text Generation
• 3B • Updated • 6
Lansechen/Qwen2.5-3B-Instruct-Distill-ot114k-batch32
Text Generation
• 3B • Updated • 5
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch5
Text Generation
• 7B • Updated • 1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch16-lora-numinamath
Text Generation
• Updated • 6
• 1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch8-numinamath
Text Generation
• 16B • Updated • 7
• 1
Lansechen/Qwen2.5-7B-Open-R1-Distill
Text Generation
• 8B • Updated • 22
Lansechen/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
• 1