view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 16 days ago • 857
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 8 items • Updated 4 days ago • 24
🛰️🌍 Geospatial Datasets Collection A curated collections of diverse geospatial and satellite imagery datasets. • 52 items • Updated Mar 2 • 31
OlmoEarth Collection OlmoEarth pre-trained and fine-tuned foundation models for remote sensing • 10 items • Updated Dec 23, 2025 • 17
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated Mar 2 • 107
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Mar 2 • 79
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces Paper • 2506.00123 • Published May 30, 2025 • 35
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5, 2025 • 56
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 Mar 4, 2025 • 78