Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
smolagents
/
ml-intern
like
315
Running
on
CPU Upgrade
App
Files
Files
Community
33
Fetching metadata from the HF Docker repository...
00a57cd
ml-intern
/
eval
Ctrl+K
Ctrl+K
21 contributors
History:
24 commits
akseljoonas
HF Staff
fix: properly close SDK message on error, show tool errorText
c68afb6
about 2 months ago
scrape_discussions
thinking if we want eval or not
6 months ago
README.md
Safe
5.18 kB
eval readme update
5 months ago
__init__.py
Safe
89 Bytes
updated eval
5 months ago
check_completeness.py
Safe
5.33 kB
generated, filled in and verfied 250 eval questions
5 months ago
claude_batch_solve.py
Safe
7.75 kB
generated, filled in and verfied 250 eval questions
5 months ago
create_eval_dataset.py
Safe
5.11 kB
dataset creation script
6 months ago
eval_set.ipynb
Safe
182 kB
generated, filled in and verfied 250 eval questions
5 months ago
generate_rubrics.py
Safe
14.7 kB
generated, filled in and verfied 250 eval questions
5 months ago
generated_tasks_with_difficulty.json
Safe
37.5 kB
intermediate commit until i let amp loose
5 months ago
hf_agent_connector.py
Safe
2.81 kB
fix: properly close SDK message on error, show tool errorText
about 2 months ago
hf_io.py
Safe
7.2 kB
updated eval
5 months ago
leaderboard.py
Safe
4.95 kB
rename
5 months ago
models.py
Safe
1.51 kB
eval runs
6 months ago
rubric_eval.py
Safe
3.82 kB
gpt 5 nano judge
5 months ago
run_eval_with_leaderboard.py
Safe
6.79 kB
rename
5 months ago
solvers.py
Safe
5.15 kB
remove lmnr dependency
about 2 months ago
task.py
Safe
3.94 kB
gpt 5 nano judge
5 months ago