view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 16 days ago • 854
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels Paper • 2603.19312 • Published Mar 13 • 28
LeWM Collection Official checkpoints and datasets related to LeWM paper. • 9 items • Updated 21 days ago • 24
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published Mar 3 • 103
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 309
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 255
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 65
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 97