Running 3.79k The Ultra-Scale Playbook ๐ 3.79k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/deepseek-coder-6.7b-instruct Text Generation โข 7B โข Updated Feb 2, 2024 โข 138k โข 490