-
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
Paper • 2603.17051 • Published • 109 -
Versatile Editing of Video Content, Actions, and Dynamics without Training
Paper • 2603.17989 • Published • 17 -
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
Paper • 2603.19235 • Published • 95 -
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model
Paper • 2603.18524 • Published • 58
Collections
Discover the best community collections!
Collections including paper arxiv:2603.17051
-
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
Paper • 2603.17051 • Published • 109 -
facebook/sam3
Mask Generation • 0.9B • Updated • 2.18M • 1.9k -
Voxtral TTS Demo
⚡199Generate realistic speech from text with custom or preset voices
-
google/gemma-4-31B-it
Image-Text-to-Text • 33B • Updated • 4M • • 2.19k
-
Canvas-to-Image: Compositional Image Generation with Multimodal Controls
Paper • 2511.21691 • Published • 36 -
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper • 2511.21678 • Published • 12 -
ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction
Paper • 2511.20937 • Published • 16 -
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
Paper • 2512.10949 • Published • 47
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper • 2503.14456 • Published • 154 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper • 2503.15265 • Published • 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper • 2503.15558 • Published • 50
-
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
Paper • 2603.17051 • Published • 109 -
Versatile Editing of Video Content, Actions, and Dynamics without Training
Paper • 2603.17989 • Published • 17 -
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
Paper • 2603.19235 • Published • 95 -
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model
Paper • 2603.18524 • Published • 58
-
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
Paper • 2603.17051 • Published • 109 -
facebook/sam3
Mask Generation • 0.9B • Updated • 2.18M • 1.9k -
Voxtral TTS Demo
⚡199Generate realistic speech from text with custom or preset voices
-
google/gemma-4-31B-it
Image-Text-to-Text • 33B • Updated • 4M • • 2.19k
-
Canvas-to-Image: Compositional Image Generation with Multimodal Controls
Paper • 2511.21691 • Published • 36 -
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper • 2511.21678 • Published • 12 -
ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction
Paper • 2511.20937 • Published • 16 -
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
Paper • 2512.10949 • Published • 47
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper • 2501.18585 • Published • 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper • 2503.14456 • Published • 154 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper • 2503.15265 • Published • 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper • 2503.15558 • Published • 50