Ailiance-fr
/

devstral-docker-devops-bf16-lora

@@ -111,3 +111,22 @@ LoRA weights: **apache-2.0** — see License chain table above for derivation ra
 ## Related
 See the full [Ailiance-fr LoRA collection](https://huggingface.co/Ailiance-fr).

 ## Related
 See the full [Ailiance-fr LoRA collection](https://huggingface.co/Ailiance-fr).
+## Bench comparison (2026-05-11)
+### Base model (Devstral-Small-2-24B-MLX-4bit) capability
+| Task | Score | Notes |
+|---|---:|---|
+| GSM8K-CoT flex EM | **0.96** | W3 lm-eval-harness (--limit 100) |
+| ARC-Easy acc / acc_norm | **0.80 / 0.75** | |
+| MMLU-Pro Computer Science | **0.64** | |
+Source: <https://github.com/ailiance/ailiance/tree/main/output/lm-eval-base-2026-05-11>
+### This LoRA (tuned) — bench PENDING
+Will include kicad-sch / iact-bench validators + W3 lm-eval delta. See spec for
+methodology:
+<https://github.com/ailiance/ailiance-bench/blob/main/docs/superpowers/specs/2026-05-11-kicad-sch-gap-design.md>