nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 3 days ago • 1.07M • 229
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models Paper • 2601.07372 • Published Jan 12 • 47
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 • 1.19k