morikomorizz commited on
Commit
d9d2f65
·
1 Parent(s): 96ead18

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -15
README.md CHANGED
@@ -28,35 +28,28 @@ That second number matters a lot. Low KL divergence means the model's "brain" di
28
 
29
  ---
30
 
31
- ## Base Model Capabilities (inherited from GRM-2.6-Plus)
32
-
33
- GRM-2.6-Plus already punches well above its weight class for a 27B model — competitive with much larger systems on reasoning and coding benchmarks. Primal inherits all of that:
34
-
35
- - Elite step-by-step reasoning across complex domains
36
- - Strong code generation and agentic/tool-use workflows
37
- - High performance on math, science, and general knowledge
38
- - Practical for local deployment on capable consumer hardware
39
-
40
- ---
41
-
42
  ## Usage
43
 
44
  ```python
45
  from vllm import LLM, SamplingParams
46
 
 
47
  sampling_params = SamplingParams(
48
  temperature=1.0,
49
  top_p=0.95,
50
- max_tokens=81920
51
  )
52
 
 
53
  llm = LLM(model="morikomorizz/GRM-2.6-Plus-Primal")
54
 
55
- messages = [{"role": "user", "content": "Your prompt here"}]
 
 
 
56
 
57
  outputs = llm.chat(messages, sampling_params=sampling_params)
58
 
59
  for output in outputs:
60
  print(output.outputs[0].text)
61
- ```
62
- *Based on [OrionLLM/GRM-2.6-Plus](https://huggingface.co/OrionLLM/GRM-2.6-Plus) · Architecture: Qwen3.6-27B · Tensor type: BF16*
 
28
 
29
  ---
30
 
 
 
 
 
 
 
 
 
 
 
 
31
  ## Usage
32
 
33
  ```python
34
  from vllm import LLM, SamplingParams
35
 
36
+ # Sampling configuration
37
  sampling_params = SamplingParams(
38
  temperature=1.0,
39
  top_p=0.95,
40
+ max_tokens=81920,
41
  )
42
 
43
+ # Load the model
44
  llm = LLM(model="morikomorizz/GRM-2.6-Plus-Primal")
45
 
46
+ # Run inference
47
+ messages = [
48
+ {"role": "user", "content": "Your prompt here"},
49
+ ]
50
 
51
  outputs = llm.chat(messages, sampling_params=sampling_params)
52
 
53
  for output in outputs:
54
  print(output.outputs[0].text)
55
+ ```