Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 136
✨SimpleChat Collection The SimpleChat series represents our new exploration into Non-Chain-of-Thought (Non-CoT) models. Designed to be concise, rational, and empathetic. • 4 items • Updated Mar 11 • 3
view article Article Memory-efficient Diffusion Transformers with Quanto and Diffusers Jul 30, 2024 • 68