Submitted by akhaliq 9 Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust · 4 authors 352 2
Submitted by akhaliq 7 HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance · 2 authors 210 1
Submitted by akhaliq 5 StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation · 10 authors 508 2
Submitted by akhaliq 5 Real-World Image Variation by Aligning Diffusion Inversion Chain · 4 authors 154 1
Submitted by akhaliq 4 Grammar Prompting for Domain-Specific Language Generation with Large Language Models · 6 authors 78 4
Submitted by akhaliq 4 PaLI-X: On Scaling up a Multilingual Vision and Language Model · 43 authors 95
Submitted by akhaliq 4 Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation · 10 authors 192 1
Submitted by akhaliq 2 AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation · 5 authors
Submitted by akhaliq 2 LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images · 4 authors 31
Submitted by akhaliq 1 KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models · 7 authors