arxiv:2512.01282

Kardia-R1: Unleashing LLMs to Reason toward Understanding and Empathy for Emotional Support via Rubric-as-Judge Reinforcement Learning

Published on Dec 1, 2025

Upvote

Authors:

Jiahao Yuan ,

Yuansheng Gao ,

Yucheng Zhou ,

Abstract

KardiaBench and Kardia-R1 address limitations in conversational agents by providing a large-scale dataset and a framework that uses interpretable reward signals to improve emotional reasoning and empathy.

AI-generated summary

As web platforms evolve towards greater personalization and emotional complexity, conversational agents must transcend superficial empathy to demonstrate identity-aware emotional reasoning. However, existing systems face two limitations: (1) reliance on situation-centric datasets lacking persistent user identity, which hampers the capture of personalized affective nuances; and (2) dependence on opaque, coarse reward signals that hinder development of verifiable empathetic reasoning. To address these gaps, we introduce KardiaBench, a large-scale user-grounded benchmark comprising 178,080 QA pairs across 22,080 multi-turn conversations anchored to 671 real-world profiles. The dataset is constructed via a model-in-the-loop pipeline with iterative rubric-guided refinement to ensure psychological plausibility and persona consistency. This progressive empathy pipeline that integrates user comprehension, contextual reasoning, and emotion perception into conversations, followed by iterative critique and rubric-based refinement to ensure psychological plausibility, emotional fidelity, and persona consistency. Building on this, we propose Kardia-R1, a framework that trains models for interpretable, stepwise empathetic cognition. Kardia-R1 leverages Rubric-as-Judge Empathetic Reinforcement Learning (Rubric-ERL), a GRPO-based method that uses explainable, human-aligned rubric rewards to tightly couple user understanding, emotional inference, and supportive response generation. Extensive experiments across four LLM backbones demonstrate that Kardia-R1 consistently outperforms othet methods in emotion accuracy, empathy, relevance, persona consistency, and safety. Our dataset and model will be released at https://github.com/JhCircle/Kardia-R1.

View arXiv page View PDF GitHub 54 auto Add to collection

Community

Jhcircle

Paper author 30 days ago

Hi all!

Kardia-R1 model weights are now live on Hugging Face:

🤗 Dataset: https://huggingface.co/datasets/JhCircle/KardiaBench
Model: https://huggingface.co/Jhcircle/Kardia-R1
Github: https://github.com/JhCircle/Kardia-R1

Citation appreciated if this work helps you! 📄

Open to discussions and collaborations to advance emotional intelligence together.

Best,
JhCircle

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2512.01282

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 1

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2512.01282 in a Space README.md to link it from this page.