Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Antieval
non-profit
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
evalevanto
updated
a dataset
2 days ago
antieval/repro
rb
updated
a dataset
7 days ago
antieval/generator_confound_capped
evalevanto
published
a dataset
8 days ago
antieval/repro
View all activity
Team members
4
antieval
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Articles
evalevanto
updated
a dataset
2 days ago
antieval/repro
Updated
2 days ago
•
144
rb
updated
a dataset
7 days ago
antieval/generator_confound_capped
Updated
14 days ago
•
393
evalevanto
published
a dataset
8 days ago
antieval/repro
Updated
2 days ago
•
144
rb
updated
a dataset
9 days ago
antieval/repro_deploy
Updated
9 days ago
•
221
rb
in
antieval/repro
10 days ago
Add dataclaw deployment dataset (diverse 40 per model with tools)
1
#1 opened 10 days ago by
rb
rb
published
a dataset
10 days ago
antieval/repro_deploy
Updated
9 days ago
•
221
rb
published
a dataset
14 days ago
antieval/generator_confound_capped
Updated
14 days ago
•
393
rb
updated
a dataset
16 days ago
antieval/generator_confound
Updated
16 days ago
•
107
rb
published
a dataset
24 days ago
antieval/generator_confound
Updated
16 days ago
•
107
rb
updated
a dataset
27 days ago
antieval/frontier_sweep_evals
Updated
27 days ago
•
99
rb
published
a dataset
29 days ago
antieval/frontier_sweep_evals
Updated
27 days ago
•
99
rb
updated
a dataset
30 days ago
antieval/swebench-trajectories
Viewer
•
Updated
30 days ago
•
200
•
22
rb
published
a dataset
30 days ago
antieval/swebench-trajectories
Viewer
•
Updated
30 days ago
•
200
•
22
rb
updated
a dataset
30 days ago
antieval/cybench-trajectories
Viewer
•
Updated
30 days ago
•
190
•
13
rb
published
a dataset
30 days ago
antieval/cybench-trajectories
Viewer
•
Updated
30 days ago
•
190
•
13
rb
updated
a dataset
30 days ago
antieval/agentharm-trajectories
Viewer
•
Updated
30 days ago
•
160
•
14
rb
published
a dataset
30 days ago
antieval/agentharm-trajectories
Viewer
•
Updated
30 days ago
•
160
•
14
Load more