Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 2B • Updated 30 days ago • 72k • 147
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 8 days ago • 589k • 2.63k
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published Feb 5 • 8
Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models Paper • 2512.06266 • Published Dec 6, 2025 • 8
DavidAU/ERNIE-4.5-37B-A3B-Thinking-Brainstorm20x Text Generation • 37B • Updated Sep 17, 2025 • 20 • 4
DavidAU/MN-CaptainErisNebula-Chimera-v1.1-THINKING-ClaudeOpus4.5-12B-heretic-uncensored Text Generation • 12B • Updated Jan 7 • 122 • 9