yale-nlp/AgentTrek-1.0-32B_webarena-verified_milestone-bert
0.1B • Updated
Natural Language Processing at Yale
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation
References Improve LLM Alignment in Non-Verifiable Domains