This model was produced by merging Qwen/Qwen3-8B-Base with Qwen/Qwen3-8B, OpenDataArena/Qwen3-8B-ODA-Math-460k, mlabonne/Qwen3-8B-abliterated using canonical LOT Merging (Sun et al., NeurIPS 2025; arXiv:2505.23859). The Eq. 9 closed-form (Moore-Penrose pseudoinverse) was used for all linear projections in attention and MLP blocks; Eq. 12 (per-dimension feature-norm-weighted average) was used for input_layernorm and post_attention_layernorm RMSNorm scales; embeddings, lm_head and the final norm fall back to the mean of task vectors. Calibration source per specialist: instruction=mix, reasoning=mix, uncensored=mix.

Downloads last month
-
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for libvm/mm-cand-v3-lot_paper_mix

Finetuned
(39)
this model

Paper for libvm/mm-cand-v3-lot_paper_mix