Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ceselder
's Collections
Loracle: weight-reading model interpretability
CoT Oracle Paper Ablations And Baselines
loracle
CoT Oracle Training Data
CoT Oracle Evals
Loracle: weight-reading model interpretability
updated
3 days ago
Loracles + direction tokens for AuditBench, IA, OOD evals.
Upvote
-
ceselder/loracle-k16-realdpo
Updated
2 days ago
ceselder/loracle-k16-dpoready-sft
Updated
3 days ago
ceselder/loracle-k16-pruned-15k-sft
Updated
3 days ago
ceselder/loracle-k16-pruned-sft
Updated
3 days ago
ceselder/loracle-k16-full-sft
Updated
3 days ago
ceselder/loracle-eval-direction-tokens
Viewer
•
Updated
3 days ago
•
6
•
88
ceselder/loracle-ia-14b-direction-tokens
Updated
3 days ago
•
27
ceselder/loracle-dpo-training-data
Viewer
•
Updated
3 days ago
•
893
•
18
ceselder/loracle-eval-rollouts
Viewer
•
Updated
3 days ago
•
4.68k
•
65
ceselder/loracle-ia-loraqa-pruned
Viewer
•
Updated
3 days ago
•
453
•
24
ceselder/loracle-k16-cispo
Updated
3 days ago
Upvote
-
Share collection
View history
Collection guide
Browse collections