view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 17 days ago • 864
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated Mar 16 • 66
Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters Paper • 2405.18093 • Published May 28, 2024 • 1