-
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Paper • 2401.17053 • Published • 33 -
FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction
Paper • 2509.21657 • Published • 4 -
VGGT: Visual Geometry Grounded Transformer
Paper • 2503.11651 • Published • 37 -
GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing
Paper • 2508.02831 • Published • 12
Collections
Discover the best community collections!
Collections including paper arxiv:2402.15391
-
LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes
Paper • 2311.13384 • Published • 53 -
Disentangled 3D Scene Generation with Layout Learning
Paper • 2402.16936 • Published • 11 -
WonderWorld: Interactive 3D Scene Generation from a Single Image
Paper • 2406.09394 • Published • 3 -
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Paper • 2504.01956 • Published • 41
-
Customizing Text-to-Image Models with a Single Image Pair
Paper • 2405.01536 • Published • 22 -
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Paper • 2404.03913 • Published -
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Paper • 2404.03620 • Published • 1 -
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Paper • 2404.12333 • Published • 1
-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper • 2402.14083 • Published • 47 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 628 -
Genie: Generative Interactive Environments
Paper • 2402.15391 • Published • 72 -
Humanoid Locomotion as Next Token Prediction
Paper • 2402.19469 • Published • 29
-
Xtra-Computing/XtraGPT-14B
Text Generation • Updated • 1.26k • 3 -
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper • 2601.11077 • Published • 67 -
Molecular Contrastive Learning with Chemical Element Knowledge Graph
Paper • 2112.00544 • Published • 1 -
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Paper • 2404.00884 • Published • 1
-
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Paper • 2305.06131 • Published • 2 -
Perpetual Humanoid Control for Real-time Simulated Avatars
Paper • 2305.06456 • Published • 1 -
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
Paper • 2305.10973 • Published • 39 -
LDM3D: Latent Diffusion Model for 3D
Paper • 2305.10853 • Published • 13
-
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Paper • 2401.17053 • Published • 33 -
FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction
Paper • 2509.21657 • Published • 4 -
VGGT: Visual Geometry Grounded Transformer
Paper • 2503.11651 • Published • 37 -
GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing
Paper • 2508.02831 • Published • 12
-
Xtra-Computing/XtraGPT-14B
Text Generation • Updated • 1.26k • 3 -
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper • 2601.11077 • Published • 67 -
Molecular Contrastive Learning with Chemical Element Knowledge Graph
Paper • 2112.00544 • Published • 1 -
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Paper • 2404.00884 • Published • 1
-
LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes
Paper • 2311.13384 • Published • 53 -
Disentangled 3D Scene Generation with Layout Learning
Paper • 2402.16936 • Published • 11 -
WonderWorld: Interactive 3D Scene Generation from a Single Image
Paper • 2406.09394 • Published • 3 -
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Paper • 2504.01956 • Published • 41
-
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Paper • 2305.06131 • Published • 2 -
Perpetual Humanoid Control for Real-time Simulated Avatars
Paper • 2305.06456 • Published • 1 -
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
Paper • 2305.10973 • Published • 39 -
LDM3D: Latent Diffusion Model for 3D
Paper • 2305.10853 • Published • 13
-
Customizing Text-to-Image Models with a Single Image Pair
Paper • 2405.01536 • Published • 22 -
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Paper • 2404.03913 • Published -
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Paper • 2404.03620 • Published • 1 -
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Paper • 2404.12333 • Published • 1
-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper • 2402.14083 • Published • 47 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 628 -
Genie: Generative Interactive Environments
Paper • 2402.15391 • Published • 72 -
Humanoid Locomotion as Next Token Prediction
Paper • 2402.19469 • Published • 29