Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs Paper • 2503.05139 • Published Mar 7, 2025 • 6
Granite 2.0 Code Models Collection Code models for generation, understanding, and instruction-following tasks. • 22 items • Updated 13 days ago • 202