Commit ·
d9d2f65
1
Parent(s): 96ead18
Update README.md
Browse files
README.md
CHANGED
|
@@ -28,35 +28,28 @@ That second number matters a lot. Low KL divergence means the model's "brain" di
|
|
| 28 |
|
| 29 |
---
|
| 30 |
|
| 31 |
-
## Base Model Capabilities (inherited from GRM-2.6-Plus)
|
| 32 |
-
|
| 33 |
-
GRM-2.6-Plus already punches well above its weight class for a 27B model — competitive with much larger systems on reasoning and coding benchmarks. Primal inherits all of that:
|
| 34 |
-
|
| 35 |
-
- Elite step-by-step reasoning across complex domains
|
| 36 |
-
- Strong code generation and agentic/tool-use workflows
|
| 37 |
-
- High performance on math, science, and general knowledge
|
| 38 |
-
- Practical for local deployment on capable consumer hardware
|
| 39 |
-
|
| 40 |
-
---
|
| 41 |
-
|
| 42 |
## Usage
|
| 43 |
|
| 44 |
```python
|
| 45 |
from vllm import LLM, SamplingParams
|
| 46 |
|
|
|
|
| 47 |
sampling_params = SamplingParams(
|
| 48 |
temperature=1.0,
|
| 49 |
top_p=0.95,
|
| 50 |
-
max_tokens=81920
|
| 51 |
)
|
| 52 |
|
|
|
|
| 53 |
llm = LLM(model="morikomorizz/GRM-2.6-Plus-Primal")
|
| 54 |
|
| 55 |
-
|
|
|
|
|
|
|
|
|
|
| 56 |
|
| 57 |
outputs = llm.chat(messages, sampling_params=sampling_params)
|
| 58 |
|
| 59 |
for output in outputs:
|
| 60 |
print(output.outputs[0].text)
|
| 61 |
-
```
|
| 62 |
-
*Based on [OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus) · Architecture: Qwen3.6-27B · Tensor type: BF16*
|
|
|
|
| 28 |
|
| 29 |
---
|
| 30 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 31 |
## Usage
|
| 32 |
|
| 33 |
```python
|
| 34 |
from vllm import LLM, SamplingParams
|
| 35 |
|
| 36 |
+
# Sampling configuration
|
| 37 |
sampling_params = SamplingParams(
|
| 38 |
temperature=1.0,
|
| 39 |
top_p=0.95,
|
| 40 |
+
max_tokens=81920,
|
| 41 |
)
|
| 42 |
|
| 43 |
+
# Load the model
|
| 44 |
llm = LLM(model="morikomorizz/GRM-2.6-Plus-Primal")
|
| 45 |
|
| 46 |
+
# Run inference
|
| 47 |
+
messages = [
|
| 48 |
+
{"role": "user", "content": "Your prompt here"},
|
| 49 |
+
]
|
| 50 |
|
| 51 |
outputs = llm.chat(messages, sampling_params=sampling_params)
|
| 52 |
|
| 53 |
for output in outputs:
|
| 54 |
print(output.outputs[0].text)
|
| 55 |
+
```
|
|
|