view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 17 days ago • 862
Cosmos-Predict2.5 Collection Improved World Simulation with Video Foundation Models for Physical AI • 2 items • Updated 3 days ago • 20
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Paper • 2509.19296 • Published Sep 23, 2025 • 28
Running on Zero Agents Featured 1.88k Qwen3-TTS Demo 🎙 1.88k Generate speech audio from text with custom or cloned voices
Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation Paper • 2511.20714 • Published Nov 25, 2025 • 50
Running on Zero Agents Featured 843 FLUX.2 [dev] 💻 843 Generate or edit images from text prompts with optional pictures
PS3: Scaling Vision Pre-Training to 4K Resolution Collection Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/ • 15 items • Updated 3 days ago • 9