Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
smolagents
/
ml-intern
like
317
Running
on
CPU Upgrade
App
Files
Files
Community
33
Fetching metadata from the HF Docker repository...
7b4e2da
ml-intern
/
eval
310 kB
Ctrl+K
Ctrl+K
21 contributors
History:
19 commits
akseljoonas
HF Staff
generated, filled in and verfied 250 eval questions
8541221
5 months ago
scrape_discussions
thinking if we want eval or not
6 months ago
.amp_batch_solve.py.swp
12.3 kB
intermediate commit until i let amp loose
5 months ago
README.md
5.18 kB
eval readme update
5 months ago
__init__.py
89 Bytes
updated eval
5 months ago
check_completeness.py
5.33 kB
generated, filled in and verfied 250 eval questions
5 months ago
claude_batch_solve.py
7.75 kB
generated, filled in and verfied 250 eval questions
5 months ago
create_eval_dataset.py
5.11 kB
dataset creation script
6 months ago
eval_set.ipynb
182 kB
generated, filled in and verfied 250 eval questions
5 months ago
generate_rubrics.py
14.7 kB
generated, filled in and verfied 250 eval questions
5 months ago
generated_tasks_with_difficulty.json
37.5 kB
intermediate commit until i let amp loose
5 months ago
hf_agent_connector.py
2.86 kB
fixing tracing
5 months ago
hf_io.py
7.2 kB
updated eval
5 months ago
leaderboard.py
4.95 kB
rename
5 months ago
models.py
1.51 kB
eval runs
6 months ago
rubric_eval.py
3.82 kB
gpt 5 nano judge
5 months ago
run_eval_with_leaderboard.py
6.79 kB
rename
5 months ago
solvers.py
5.39 kB
fixing tracing
5 months ago
task.py
3.94 kB
gpt 5 nano judge
5 months ago