Update submissions.json

#43

training command

hf jobs uv run
--flavor a100-large
--timeout 3h
--secrets HF_TOKEN
dpo_training.py

eval command:

hf jobs uv run
--flavor a10g-large
--with "lighteval[vllm]@git+https://github.com/huggingface/lighteval"
--secrets HF_TOKEN
lighteval vllm "model_name=tomascufaro/smollm3-dpo-aligned"
"lighteval|hellaswag|0,lighteval|mmlu:abstract_algebra|0,lighteval|winogrande|0"
--push-to-hub --results-org tomascufaro

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment