Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ceselder 's Collections
Loracle: weight-reading model interpretability
CoT Oracle Paper Ablations And Baselines
loracle
CoT Oracle Training Data
CoT Oracle Evals

Loracle: weight-reading model interpretability

updated 3 days ago

Loracles + direction tokens for AuditBench, IA, OOD evals.

Upvote
-

  • ceselder/loracle-k16-realdpo

    Updated 2 days ago

  • ceselder/loracle-k16-dpoready-sft

    Updated 3 days ago

  • ceselder/loracle-k16-pruned-15k-sft

    Updated 3 days ago

  • ceselder/loracle-k16-pruned-sft

    Updated 3 days ago

  • ceselder/loracle-k16-full-sft

    Updated 3 days ago

  • ceselder/loracle-eval-direction-tokens

    Viewer • Updated 3 days ago • 6 • 88

  • ceselder/loracle-ia-14b-direction-tokens

    Updated 3 days ago • 27

  • ceselder/loracle-dpo-training-data

    Viewer • Updated 3 days ago • 893 • 18

  • ceselder/loracle-eval-rollouts

    Viewer • Updated 3 days ago • 4.68k • 65

  • ceselder/loracle-ia-loraqa-pruned

    Viewer • Updated 3 days ago • 453 • 24

  • ceselder/loracle-k16-cispo

    Updated 3 days ago
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs