Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
smolagents
/
ml-intern
Running on CPU Upgrade

App Files Files Community
33
Fetching metadata from the HF Docker repository...
ml-intern / eval
310 kB
Ctrl+K
Ctrl+K
  • 21 contributors
History: 19 commits
akseljoonas's picture
akseljoonas HF Staff
generated, filled in and verfied 250 eval questions
8541221 5 months ago
  • scrape_discussions
    thinking if we want eval or not 6 months ago
  • .amp_batch_solve.py.swp
    12.3 kB
    intermediate commit until i let amp loose 5 months ago
  • README.md
    5.18 kB
    eval readme update 5 months ago
  • __init__.py
    89 Bytes
    updated eval 5 months ago
  • check_completeness.py
    5.33 kB
    generated, filled in and verfied 250 eval questions 5 months ago
  • claude_batch_solve.py
    7.75 kB
    generated, filled in and verfied 250 eval questions 5 months ago
  • create_eval_dataset.py
    5.11 kB
    dataset creation script 6 months ago
  • eval_set.ipynb
    182 kB
    generated, filled in and verfied 250 eval questions 5 months ago
  • generate_rubrics.py
    14.7 kB
    generated, filled in and verfied 250 eval questions 5 months ago
  • generated_tasks_with_difficulty.json
    37.5 kB
    intermediate commit until i let amp loose 5 months ago
  • hf_agent_connector.py
    2.86 kB
    fixing tracing 5 months ago
  • hf_io.py
    7.2 kB
    updated eval 5 months ago
  • leaderboard.py
    4.95 kB
    rename 5 months ago
  • models.py
    1.51 kB
    eval runs 6 months ago
  • rubric_eval.py
    3.82 kB
    gpt 5 nano judge 5 months ago
  • run_eval_with_leaderboard.py
    6.79 kB
    rename 5 months ago
  • solvers.py
    5.39 kB
    fixing tracing 5 months ago
  • task.py
    3.94 kB
    gpt 5 nano judge 5 months ago