yale-nlp/comal-qwen2-1.5b-iter-ipo-round6
Text Generation • 2B • Updated • 1
Natural Language Processing at Yale
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation
References Improve LLM Alignment in Non-Verifiable Domains