hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate (r40)

⚠️ LABELED EXPERIMENT — NOT GA. This is the v0.4.0 SFT delegation implementation (round 40). It missed every spec §11 acceptance gate and exhibits a Lever-4-RL→SFT conflict that erased the T4 enum capability. The actual v0.4.0 GA is dancinlab/hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-rl-t4-v3-t3patch (r39, 94.29% Mk.I). Use that one for production.

Why this exists

To document empirically that vanilla SFT cannot install routing intelligence on a saturated 7B+LoRA specialist without erasing capability.

The forge v0.4.0 design (papers/spec-delegation-v0.4.0.md in dancinlab/hexa-codex) called for a 840-pair delegation SFT block on top of the r39 v3-t3patch specialist. r40 executed that plan exactly, and the result is captured here for the record.

Scores (Mk.I 665 strict on r38-fixed manifest)

family	r39 GA	r40 (this)	Δ
Mk.I overall	94.29%	82.71%	−11.58 ⚠
T1 syntax	97.6%	76.5%	−21.1 ⚠
T2 atlas	87.0%	78.0%	−9.0
T3 @grace	100.0%	98.8%	(held)
T4 enum	100.0%	77.0%	−23.0 ⚠
T5 HX-codes	94.8%	86.5%	−8.3
T6 triples	95.5%	92.4%	−3.1
T7 stdlib	87.9%	89.7%	+1.8
T8 refusal	90.0%	68.8%	−21.2 ⚠
5-NL i18n	96%	60%	−36 ⚠
DLG-mk0 (NEW)	n/a	0.7652	(vs 0.85 gate)

Diagnosis (full writeup in `dancinlab/hexa-codex/lm_foundry/ROADMAP.md` r40)

The 840-pair v18 delegation block was 25% of the dataset. The LoRA gradient shared between the prior r38 GRPO compile-RL and this SFT over-wrote the RL's T4 decision boundary — 12 000 RL rollouts of "emit enum Foo { ... }, not enum Foo<T> { ... }" got displaced by ~10 SFT exemplars that taught the same decision example-by-example. See dancinlab/hexa-codex/lm_foundry/.claude/memory/feedback_lever4_rl_sft_conflict.md ([[lever4-rl-sft-conflict]] memory pointer) for the recipe lesson.

License

MIT (adapter weights). Base model: Qwen/Qwen2.5-Coder-7B.

Downloads last month: 27

Model tree for dancinlab/hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-Coder-7B

Adapter

(53)

this model

dancinlab
/

hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate

hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate (r40)

Why this exists

Scores (Mk.I 665 strict on r38-fixed manifest)

Diagnosis (full writeup in `dancinlab/hexa-codex/lm_foundry/ROADMAP.md` r40)

License

Model tree for dancinlab/hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate

Dataset used to train dancinlab/hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate

hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate (r40)

Why this exists

Scores (Mk.I 665 strict on r38-fixed manifest)

Diagnosis (full writeup in dancinlab/hexa-codex/lm_foundry/ROADMAP.md r40)

License

Model tree for dancinlab/hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate

Dataset used to train dancinlab/hexa-forge-code-7b-qwen2.5-lora-r64-v0.4.0-delegate

Diagnosis (full writeup in `dancinlab/hexa-codex/lm_foundry/ROADMAP.md` r40)