GRM-2.6
Collection
GRM-Repository • 3 items • Updated • 1
Same elite reasoning. Less hand-holding.
GRM-2.6-Plus-Primal is a uncensored variant of OrionLLM/GRM-2.6-Plus — 27B reasoning powerhouse built on the Qwen3.6 architecture. Primal takes the already-impressive base and strips away the overcautious refusal behavior that tends to get in the way of real, complex work.
The goal is to create a model that does not judge or label its users, while maintaining the model’s ability to think and serve users.
This Model Support MTP
Using Heretic-ARA by p-e-w.
| Metric | GRM-PRIMAL | Original Model GRM-2.6-Plus |
|---|---|---|
| KL divergence | 0.0678 | Base-Model |
| Refusals | 06/100 | 91/100 |
from vllm import LLM, SamplingParams
sampling_params = SamplingParams(
temperature=1.0,
top_p=0.95,
max_tokens=81920,
)
llm = LLM(model="morikomorizz/GRM-2.6-Plus-Primal")
messages = [
{"role": "user", "content": "Your prompt here"},
]
outputs = llm.chat(messages, sampling_params=sampling_params)
for output in outputs:
print(output.outputs[0].text)