view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 4 days ago • 1
view article Article Google Released Gemma-4 Four Days Ago. We Already Made It 1.72× Faster. 7 days ago • 2
view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs 11 days ago • 6