view article Article Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts 3 days ago • 8
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 309
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2604.12374 • Published 3 days ago • 26
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 9 days ago • 38
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 9 days ago • 70
Query-Kontext: An Unified Multimodal Model for Image Generation and Editing Paper • 2509.26641 • Published Sep 30, 2025 • 4
Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers Paper • 2603.27666 • Published 19 days ago • 18
Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation Paper • 2604.03118 • Published 14 days ago • 6
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing Paper • 2604.04911 • Published 11 days ago • 35
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 11 days ago • 107
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published 11 days ago • 42
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 15 days ago • 852
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 22 days ago • 183
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 15 days ago • 40