The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 18 days ago • 142
LeWM Collection Official checkpoints and datasets related to LeWM paper. • 9 items • Updated 23 days ago • 24
Qwen-3.5-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 20 items • Updated 21 days ago • 20
AHN Collection Artificial Hippocampus Networks (AHNs) for Efficient Long-Context Modeling • 9 items • Updated Oct 9, 2025 • 7
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 14 items • Updated 5 days ago • 54
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 5 days ago • 270
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 5 days ago • 139
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 5 days ago • 123
XiYanSQL Models Collection The XiYanSQL series, are foundational SQL models available in various sizes, including 3B, 7B, 14B, and 32B. • 8 items • Updated Mar 15 • 9
Qwen3 Next Collection Alibaba's first hybrid model, designed to cut resources and speed things up. • 8 items • Updated Sep 15, 2025 • 5