Sky-v2.0-11B / benchmarks.json
Atharvsinh's picture
Sky v2.0 — CREST 11B (adaptive-depth reasoning)
4bbb59c verified
raw
history blame contribute delete
408 Bytes
{
"code_eval": {
"passed": 25,
"total": 25,
"accuracy": 100.0
},
"mmlu_pro_200": {
"correct": 91,
"total": 200,
"accuracy": 45.5
},
"identity": "Sky v2.0 / 0labs / Atharvsinh Jadav",
"architecture": "CREST (4 steps, adaptive halting)",
"base_model": "Qwen/Qwen3.5-4B",
"total_params": "11.00B",
"training_time": "2.3 hours",
"hardware": "AMD MI300X (205GB VRAM)"
}