-
4K4DGen: Panoramic 4D Generation at 4K Resolution
Paper ⢠2406.13527 ⢠Published ⢠9 -
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Paper ⢠2406.13393 ⢠Published ⢠5 -
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
Paper ⢠2406.16273 ⢠Published ⢠43 -
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Paper ⢠2406.20076 ⢠Published ⢠10
Collections
Discover the best community collections!
Collections including paper arxiv:2407.06938
-
GECO: Generative Image-to-3D within a SECOnd
Paper ⢠2405.20327 ⢠Published ⢠12 -
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Paper ⢠2406.03184 ⢠Published ⢠21 -
NPGA: Neural Parametric Gaussian Avatars
Paper ⢠2405.19331 ⢠Published ⢠10 -
Unified Text-to-Image Generation and Retrieval
Paper ⢠2406.05814 ⢠Published ⢠16
-
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Paper ⢠2404.07839 ⢠Published ⢠48 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper ⢠2404.03715 ⢠Published ⢠62 -
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Paper ⢠2404.05674 ⢠Published ⢠15 -
Agentless: Demystifying LLM-based Software Engineering Agents
Paper ⢠2407.01489 ⢠Published ⢠65
-
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Paper ⢠2403.01807 ⢠Published ⢠9 -
TripoSR: Fast 3D Object Reconstruction from a Single Image
Paper ⢠2403.02151 ⢠Published ⢠16 -
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper ⢠2403.01779 ⢠Published ⢠30 -
MagicClay: Sculpting Meshes With Generative Neural Fields
Paper ⢠2403.02460 ⢠Published ⢠8
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper ⢠2401.09416 ⢠Published ⢠11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper ⢠2401.10171 ⢠Published ⢠14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper ⢠2311.09217 ⢠Published ⢠22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper ⢠2401.12979 ⢠Published ⢠9
-
MotionLLM: Understanding Human Behaviors from Human Motions and Videos
Paper ⢠2405.20340 ⢠Published ⢠20 -
Spectrally Pruned Gaussian Fields with Neural Compensation
Paper ⢠2405.00676 ⢠Published ⢠10 -
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper ⢠2404.18212 ⢠Published ⢠30 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper ⢠2405.00732 ⢠Published ⢠122
-
2D Gaussian Splatting for Geometrically Accurate Radiance Fields
Paper ⢠2403.17888 ⢠Published ⢠31 -
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper ⢠2403.17920 ⢠Published ⢠18 -
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Paper ⢠2407.06938 ⢠Published ⢠25 -
TencentARC/InstantMesh
Image-to-3D ⢠Updated ⢠18.9k ⢠332
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠194 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors
Paper ⢠2312.16837 ⢠Published ⢠6 -
Learning the 3D Fauna of the Web
Paper ⢠2401.02400 ⢠Published ⢠11 -
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Paper ⢠2310.15110 ⢠Published ⢠3 -
Zero-1-to-3: Zero-shot One Image to 3D Object
Paper ⢠2303.11328 ⢠Published ⢠4
-
4K4DGen: Panoramic 4D Generation at 4K Resolution
Paper ⢠2406.13527 ⢠Published ⢠9 -
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Paper ⢠2406.13393 ⢠Published ⢠5 -
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
Paper ⢠2406.16273 ⢠Published ⢠43 -
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Paper ⢠2406.20076 ⢠Published ⢠10
-
GECO: Generative Image-to-3D within a SECOnd
Paper ⢠2405.20327 ⢠Published ⢠12 -
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Paper ⢠2406.03184 ⢠Published ⢠21 -
NPGA: Neural Parametric Gaussian Avatars
Paper ⢠2405.19331 ⢠Published ⢠10 -
Unified Text-to-Image Generation and Retrieval
Paper ⢠2406.05814 ⢠Published ⢠16
-
MotionLLM: Understanding Human Behaviors from Human Motions and Videos
Paper ⢠2405.20340 ⢠Published ⢠20 -
Spectrally Pruned Gaussian Fields with Neural Compensation
Paper ⢠2405.00676 ⢠Published ⢠10 -
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper ⢠2404.18212 ⢠Published ⢠30 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper ⢠2405.00732 ⢠Published ⢠122
-
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Paper ⢠2404.07839 ⢠Published ⢠48 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper ⢠2404.03715 ⢠Published ⢠62 -
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Paper ⢠2404.05674 ⢠Published ⢠15 -
Agentless: Demystifying LLM-based Software Engineering Agents
Paper ⢠2407.01489 ⢠Published ⢠65
-
2D Gaussian Splatting for Geometrically Accurate Radiance Fields
Paper ⢠2403.17888 ⢠Published ⢠31 -
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper ⢠2403.17920 ⢠Published ⢠18 -
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Paper ⢠2407.06938 ⢠Published ⢠25 -
TencentARC/InstantMesh
Image-to-3D ⢠Updated ⢠18.9k ⢠332
-
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Paper ⢠2403.01807 ⢠Published ⢠9 -
TripoSR: Fast 3D Object Reconstruction from a Single Image
Paper ⢠2403.02151 ⢠Published ⢠16 -
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper ⢠2403.01779 ⢠Published ⢠30 -
MagicClay: Sculpting Meshes With Generative Neural Fields
Paper ⢠2403.02460 ⢠Published ⢠8
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠194 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper ⢠2401.09416 ⢠Published ⢠11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper ⢠2401.10171 ⢠Published ⢠14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper ⢠2311.09217 ⢠Published ⢠22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper ⢠2401.12979 ⢠Published ⢠9
-
DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors
Paper ⢠2312.16837 ⢠Published ⢠6 -
Learning the 3D Fauna of the Web
Paper ⢠2401.02400 ⢠Published ⢠11 -
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Paper ⢠2310.15110 ⢠Published ⢠3 -
Zero-1-to-3: Zero-shot One Image to 3D Object
Paper ⢠2303.11328 ⢠Published ⢠4