Submitted by akhaliq 38 Octopus: Embodied Vision-Language Programmer from Environmental Feedback · 11 authors 297 4
Submitted by akhaliq 33 Lemur: Harmonizing Natural Language and Code for Language Agents · 16 authors 556 3
Submitted by akhaliq 18 Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation · 7 authors 6
Submitted by akhaliq 18 GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors · 8 authors 822 2
Submitted by akhaliq 16 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion · 9 authors 497 1
Submitted by akhaliq 16 MotionDirector: Motion Customization of Text-to-Video Diffusion Models · 8 authors 1.05k 5