Outlier Server V3.2
Collection
V3.2 ternary MoE overlay on Qwen 2.5 server-class models. • 4 items • Updated
Ternary mixture-of-experts overlay on Qwen/Qwen2.5-32B-Instruct. 70B total effective parameters, 32B active per forward pass.
[VERIFIED]from transformers import AutoModelForCausalLM, AutoTokenizer
name = "Outlier-Ai/Outlier-70B-V3.2"
tok = AutoTokenizer.from_pretrained(name)
model = AutoModelForCausalLM.from_pretrained(
name, trust_remote_code=True, torch_dtype="auto"
)
prompt = tok.apply_chat_template(
[{"role": "user", "content": "What is the capital of France?"}],
tokenize=False, add_generation_prompt=True,
)
inputs = tok(prompt, return_tensors="pt").to(model.device)
print(tok.decode(model.generate(**inputs, max_new_tokens=200)[0]))
For consumer Apple Silicon inference use MLX or GGUF tiers:
| Metric | Value | Provenance |
|---|---|---|
| MMLU | 81.49% | [VERIFIED] — full-sample, n=14,042, stderr 0.00314, source cloud_sprint_day12/results/70b_v3_2_mmlu_full.json, 62 task results, full provenance retained. |
Rule 66 provenance labels:
[VERIFIED] — full source JSON with config.limit=None, n-samples complete, model_args present, reproducible from commit SHA.[INCOMPLETE] — number exists on disk but provenance fields are stripped; cannot be cited publicly.[CLAIM] — historical smoke-test value pending full re-verification on cluster.[PENDING] — benchmark scheduled; results expected by a specific date.Ternary-weight arithmetic ({-1, 0, +1}) reduces a matmul to a stream of additions and subtractions — no multiplications — enabling consumer hardware to run flagship-scale models at usable speeds.
Three provisional patents filed April 2026 (61 claims total) covering ternary MoE weight composition, expert paging, and specialist merging techniques.
[CLAIM] below are historical smoke-test values awaiting verification.[VERIFIED] MMLU 77.80%[VERIFIED] MMLU 81.49%@misc{outlier2026,
author = {Kerr, Matt},
title = {Outlier: Ternary Mixture-of-Experts for Consumer Hardware},
year = {2026},
publisher = {Hugging Face},
howpublished = {\url{https://huggingface.co/Outlier-Ai}}
}