Learning User Preferences Through Interaction for Long-Term Collaboration Paper • 2601.02702 • Published Jan 6 • 2
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents Paper • 2505.01592 • Published May 2, 2025
Automatic Evaluation of Generative Models with Instruction Tuning Paper • 2310.20072 • Published Oct 30, 2023
Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis Paper • 2502.04511 • Published Feb 6, 2025
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models Paper • 2310.07611 • Published Oct 11, 2023 • 2