Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
yoavgurarieh 's Collections
BonaFide

BonaFide

updated about 13 hours ago

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets.

Upvote
1

  • Running
    Agents
    3

    BonaFide Leaderboard

    📊
    3

    A leaderboard for chain-of-thought faithfulness metrics.


  • yoavgurarieh/BonaFide

    Viewer • Updated about 14 hours ago • 3.07k • 107 • 1

  • yoavgurarieh/BonaFide-Extended

    Viewer • Updated about 14 hours ago • 19.5k • 349 • 2

  • Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

    Paper • 2605.25052 • Published 3 days ago • 6
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs