yale-nlp/mistral-7b-dpo-mistralv2-7b-beta-0.1
Text Generation • 7B • Updated
Natural Language Processing at Yale
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation
References Improve LLM Alignment in Non-Verifiable Domains