Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21, 2025 • 78.1k • • 1.32k
Alibaba-NLP/gte-Qwen2-7B-instruct Sentence Similarity • 8B • Updated Mar 24, 2025 • 280k • 479
Running 3.78k The Ultra-Scale Playbook 🌌 3.78k The ultimate guide to training LLM on large GPU Clusters