-
SkeletonGaussian: Editable 4D Generation through Gaussian Skeletonization
Paper • 2602.04271 • Published • 1 -
MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting
Paper • 2508.17811 • Published • 7 -
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
Paper • 2507.15454 • Published • 7 -
DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Paper • 2501.02576 • Published • 15
Collections
Discover the best community collections!
Collections including paper arxiv:2508.17811
-
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models
Paper • 2506.19851 • Published • 60 -
SeqTex: Generate Mesh Textures in Video Sequence
Paper • 2507.04285 • Published • 10 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer
Paper • 2508.10893 • Published • 31
-
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
Paper • 2503.10437 • Published • 34 -
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Paper • 2503.09642 • Published • 20 -
VGGT: Visual Geometry Grounded Transformer
Paper • 2503.11651 • Published • 37 -
1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering
Paper • 2503.16422 • Published • 15
-
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation
Paper • 2508.07981 • Published • 63 -
CharacterShot: Controllable and Consistent 4D Character Animation
Paper • 2508.07409 • Published • 39 -
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Paper • 2508.10881 • Published • 53 -
Puppeteer: Rig and Animate Your 3D Models
Paper • 2508.10898 • Published • 33
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 196 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
SkeletonGaussian: Editable 4D Generation through Gaussian Skeletonization
Paper • 2602.04271 • Published • 1 -
MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting
Paper • 2508.17811 • Published • 7 -
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
Paper • 2507.15454 • Published • 7 -
DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Paper • 2501.02576 • Published • 15
-
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation
Paper • 2508.07981 • Published • 63 -
CharacterShot: Controllable and Consistent 4D Character Animation
Paper • 2508.07409 • Published • 39 -
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Paper • 2508.10881 • Published • 53 -
Puppeteer: Rig and Animate Your 3D Models
Paper • 2508.10898 • Published • 33
-
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models
Paper • 2506.19851 • Published • 60 -
SeqTex: Generate Mesh Textures in Video Sequence
Paper • 2507.04285 • Published • 10 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer
Paper • 2508.10893 • Published • 31
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 196 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
Paper • 2503.10437 • Published • 34 -
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Paper • 2503.09642 • Published • 20 -
VGGT: Visual Geometry Grounded Transformer
Paper • 2503.11651 • Published • 37 -
1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering
Paper • 2503.16422 • Published • 15