BonaFide - a yoavgurarieh Collection

yoavgurarieh 's Collections

BonaFide

updated about 13 hours ago

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets.

Running

Agents

3

BonaFide Leaderboard

📊

3

A leaderboard for chain-of-thought faithfulness metrics.
yoavgurarieh/BonaFide

Viewer • Updated about 14 hours ago • 3.07k • 107 • 1
yoavgurarieh/BonaFide-Extended

Viewer • Updated about 14 hours ago • 19.5k • 349 • 2
Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

Paper • 2605.25052 • Published 3 days ago • 6