Submitted by akhaliq 36 AgentTuning: Enabling Generalized Agent Abilities for LLMs · 7 authors 1.48k 1
Submitted by akhaliq 28 Safe RLHF: Safe Reinforcement Learning from Human Feedback · 8 authors 1.6k 5
Submitted by akhaliq 26 Eureka: Human-Level Reward Design via Coding Large Language Models · 9 authors 3.14k 3
Submitted by akhaliq 19 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning · 5 authors 70 1
Submitted by akhaliq 14 Loop Copilot: Conducting AI Ensembles for Music Generation and Iterative Editing · 5 authors 12 1
Submitted by akhaliq 13 An Emulator for Fine-Tuning Large Language Models using Small Language Models · 5 authors 1
Submitted by akhaliq 13 An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Concept Prompt Learning · 5 authors 1
Submitted by akhaliq 5 Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping · 4 authors 84 1