Collections
Discover the best community collections!
Collections including paper arxiv:2603.03283
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 21 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 53 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 55 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds
Paper • 2508.14879 • Published • 69 -
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Paper • 2508.19247 • Published • 43 -
Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels
Paper • 2508.17437 • Published • 37 -
Multi-View 3D Point Tracking
Paper • 2508.21060 • Published • 23
-
Utonia: Toward One Encoder for All Point Clouds
Paper • 2603.03283 • Published • 185 -
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM
Paper • 2603.23386 • Published • 40 -
Learn2Fold: Structured Origami Generation with World Model Planning
Paper • 2603.29585 • Published • 16 -
ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video
Paper • 2604.07882 • Published • 9
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 322 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 176
-
RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models
Paper • 2409.19989 • Published • 18 -
3D Scene Generation: A Survey
Paper • 2505.05474 • Published • 21 -
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Paper • 2505.22129 • Published • 16 -
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
Paper • 2505.18600 • Published • 49
-
Utonia: Toward One Encoder for All Point Clouds
Paper • 2603.03283 • Published • 185 -
SIMART: Decomposing Monolithic Meshes into Sim-ready Articulated Assets via MLLM
Paper • 2603.23386 • Published • 40 -
Learn2Fold: Structured Origami Generation with World Model Planning
Paper • 2603.29585 • Published • 16 -
ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video
Paper • 2604.07882 • Published • 9
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 21 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 53 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 55 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 322 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 176
-
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds
Paper • 2508.14879 • Published • 69 -
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Paper • 2508.19247 • Published • 43 -
Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels
Paper • 2508.17437 • Published • 37 -
Multi-View 3D Point Tracking
Paper • 2508.21060 • Published • 23
-
RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models
Paper • 2409.19989 • Published • 18 -
3D Scene Generation: A Survey
Paper • 2505.05474 • Published • 21 -
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Paper • 2505.22129 • Published • 16 -
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
Paper • 2505.18600 • Published • 49