-
Colorful Diffuse Intrinsic Image Decomposition in the Wild
Paper ⢠2409.13690 ⢠Published ⢠13 -
Latent Intrinsics Emerge from Training to Relight
Paper ⢠2405.21074 ⢠Published ⢠1 -
Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections
Paper ⢠2409.14677 ⢠Published ⢠15 -
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
Paper ⢠2501.09756 ⢠Published ⢠20
Collections
Discover the best community collections!
Collections including paper arxiv:2312.03704
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠194 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper ⢠2401.15687 ⢠Published ⢠24 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠27 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠15
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠15 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠27 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠32
-
3D-LLM: Injecting the 3D World into Large Language Models
Paper ⢠2307.12981 ⢠Published ⢠40 -
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study
Paper ⢠2401.17981 ⢠Published ⢠1 -
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Paper ⢠2312.02126 ⢠Published ⢠2 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠32
-
aMUSEd: An Open MUSE Reproduction
Paper ⢠2401.01808 ⢠Published ⢠31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper ⢠2401.01885 ⢠Published ⢠28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper ⢠2401.00604 ⢠Published ⢠6 -
LARP: Language-Agent Role Play for Open-World Games
Paper ⢠2312.17653 ⢠Published ⢠33
-
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance
Paper ⢠2312.08889 ⢠Published ⢠15 -
Towards Practical Capture of High-Fidelity Relightable Avatars
Paper ⢠2309.04247 ⢠Published ⢠10 -
Learning Disentangled Avatars with Hybrid 3D Representations
Paper ⢠2309.06441 ⢠Published ⢠5 -
Text-Guided Generation and Editing of Compositional 3D Avatars
Paper ⢠2309.07125 ⢠Published ⢠6
-
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Paper ⢠2310.12474 ⢠Published ⢠5 -
Drivable 3D Gaussian Avatars
Paper ⢠2311.08581 ⢠Published ⢠47 -
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
Paper ⢠2311.12775 ⢠Published ⢠30 -
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Paper ⢠2311.13141 ⢠Published ⢠16
-
Colorful Diffuse Intrinsic Image Decomposition in the Wild
Paper ⢠2409.13690 ⢠Published ⢠13 -
Latent Intrinsics Emerge from Training to Relight
Paper ⢠2405.21074 ⢠Published ⢠1 -
Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections
Paper ⢠2409.14677 ⢠Published ⢠15 -
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
Paper ⢠2501.09756 ⢠Published ⢠20
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠194 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
3D-LLM: Injecting the 3D World into Large Language Models
Paper ⢠2307.12981 ⢠Published ⢠40 -
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study
Paper ⢠2401.17981 ⢠Published ⢠1 -
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Paper ⢠2312.02126 ⢠Published ⢠2 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠32
-
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper ⢠2401.15687 ⢠Published ⢠24 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠27 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠15
-
aMUSEd: An Open MUSE Reproduction
Paper ⢠2401.01808 ⢠Published ⢠31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper ⢠2401.01885 ⢠Published ⢠28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper ⢠2401.00604 ⢠Published ⢠6 -
LARP: Language-Agent Role Play for Open-World Games
Paper ⢠2312.17653 ⢠Published ⢠33
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠15 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠27 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠32
-
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance
Paper ⢠2312.08889 ⢠Published ⢠15 -
Towards Practical Capture of High-Fidelity Relightable Avatars
Paper ⢠2309.04247 ⢠Published ⢠10 -
Learning Disentangled Avatars with Hybrid 3D Representations
Paper ⢠2309.06441 ⢠Published ⢠5 -
Text-Guided Generation and Editing of Compositional 3D Avatars
Paper ⢠2309.07125 ⢠Published ⢠6
-
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Paper ⢠2310.12474 ⢠Published ⢠5 -
Drivable 3D Gaussian Avatars
Paper ⢠2311.08581 ⢠Published ⢠47 -
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
Paper ⢠2311.12775 ⢠Published ⢠30 -
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Paper ⢠2311.13141 ⢠Published ⢠16