Ornstein3.6-27B-MTP-NSC-ACE-SABER banner

Ornstein3.6-27B-MTP-NSC-ACE-SABER

Support this work on Ko-fi

Methods

This checkpoint was built as a staged release pipeline:

Stage Purpose Key settings
NSC-ACE Neural Steering Committee for Agentic Co-Evolution; steered rollouts are rewarded when independent latent paths converge on the same correct tool-call structure. internal steering committee, tool-call convergence reward
SABER Compliance-first calibration, then KLD/PPL selection among high-compliance candidates. 1046 HarmBench + AdvBench-style compliance prompts; selected row reports 968/1046 by keyword proxy
Ornstein SFT Reasoning/personality refinement after the SABER checkpoint. premium reasoning v1, 1 epoch, 50 steps, rank 32/32, dropout 0.05, cosine schedule

Results

Metric Value
SABER selected compliance proxy 92.54% (968/1046 eval prompts)
SABER selected keyword residual 7.46% (78/1046 eval prompts)
SABER selected HarmBench classifier ASR 0.67% (7/1046 eval prompts)
SABER selected KLD 0.008302
SABER selected PPL ratio 1.103853
SABER selected post PPL 17.5988
SABER selected base PPL 15.9431
MTP status present and verified
mtp_num_hidden_layers 1
mtp.* tensors 15
Corrected tensor count 866

Ornstein3.6-27B-MTP-NSC-ACE-SABER is a Qwen3.6/Qwen3.5-family 27B dense text checkpoint built through an NSC-ACE -> SABER -> Ornstein pipeline. The release objective is tool-use and compliance reliability first, with KLD/PPL drift kept as low as possible after the compliance target is reached.

Release Snapshot

Item Value
Release name Ornstein3.6-27B-MTP-NSC-ACE-SABER
Format full safetensors checkpoint
Base model Qwen/Qwen3.6-27B-MTP
GGUF repo GestaltLabs/Ornstein3.6-27B-MTP-NSC-ACE-SABER-GGUF

MTP Status

MTP support is present and verified. This release includes mtp_num_hidden_layers=1 and MTP/nextn tensors in the GGUF validation path. Use a llama.cpp build with Qwen3.5/Qwen3.6 MTP support and run with --spec-type mtp.

Training Notes

NSC-ACE means Neural Steering Committee for Agentic Co-Evolution. The method generates multiple internally steered rollouts for the same prompt and rewards convergence in tool-call structure across independently steered latent modes. The intent is stronger function selection, argument filling, formatting stability, and fewer repeated tool loops.

SABER was used as the compliance and drift calibration stage. The selected checkpoint optimized for compliance first, then KLD and PPL retention.

Ornstein was merged last using the premium reasoning SFT recipe listed above.

Related Repositories

Intended Use

This is an experimental agentic/tool-calling checkpoint intended for research, local evaluation, and downstream experimentation. Validate behavior for your own task distribution before production use.

Downloads last month
-
Safetensors
Model size
27B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for GestaltLabs/Ornstein3.6-27B-MTP-NSC-ACE-SABER

Quantizations
1 model