-
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114 -
Cambrian-S: Towards Spatial Supersensing in Video
Paper • 2511.04670 • Published • 39 -
MagicWorld: Interactive Geometry-driven Video World Exploration
Paper • 2511.18886 • Published • 19 -
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
Paper • 2512.14614 • Published • 73
Collections
Discover the best community collections!
Collections including paper arxiv:2510.26583
-
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114 -
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging
Paper • 2510.20479 • Published • 12 -
A Definition of AGI
Paper • 2510.18212 • Published • 36 -
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper • 2510.20888 • Published • 50
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 115 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 62 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114
-
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114 -
Cambrian-S: Towards Spatial Supersensing in Video
Paper • 2511.04670 • Published • 39 -
MagicWorld: Interactive Geometry-driven Video World Exploration
Paper • 2511.18886 • Published • 19 -
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
Paper • 2512.14614 • Published • 73
-
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114 -
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging
Paper • 2510.20479 • Published • 12 -
A Definition of AGI
Paper • 2510.18212 • Published • 36 -
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper • 2510.20888 • Published • 50
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 115 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 62 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 114