hmahadik commited on
Commit
17c8362
·
verified ·
1 Parent(s): b6ab551

Fix Modelfile FROM paths + clarify Ollama install paths

Browse files
Files changed (1) hide show
  1. README.md +27 -7
README.md CHANGED
@@ -33,18 +33,38 @@ for chat / out-of-scope prompts. Full schema lives in the demo repo
33
 
34
  ## Quick start (Ollama)
35
 
 
 
 
 
 
 
36
  ```bash
37
- # Pull and run the compact (default) model
38
  ollama pull hf.co/BrinqAI/coral-functiongemma-270m:compact-Q4_K_M
39
- ollama run hf.co/BrinqAI/coral-functiongemma-270m:compact-Q4_K_M
40
-
41
- # Native format
42
  ollama pull hf.co/BrinqAI/coral-functiongemma-270m:native-Q4_K_M
43
  ```
44
 
45
- Both files ship with a `Modelfile.<format>` that wires the right stop tokens
46
- (`<end>`, `<end_of_turn>`, `<eos>`) and runtime defaults (`temperature=0`,
47
- `num_ctx=1024`, `num_predict=80`).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
 
49
  The model expects prompts built via the FunctionGemma chat template
50
  (developer role + user role, tools list passed via
 
33
 
34
  ## Quick start (Ollama)
35
 
36
+ Two install paths. **Pick the second** unless you know your client sets the
37
+ stop tokens itself — `ollama pull hf.co/...` ignores the shipped Modelfile,
38
+ so the compact format will run past `<end>` until it hits `num_predict`.
39
+
40
+ ### Option A — direct HF pull (defaults only)
41
+
42
  ```bash
 
43
  ollama pull hf.co/BrinqAI/coral-functiongemma-270m:compact-Q4_K_M
 
 
 
44
  ollama pull hf.co/BrinqAI/coral-functiongemma-270m:native-Q4_K_M
45
  ```
46
 
47
+ Stop tokens (`<end>`, `<end_of_turn>`, `<eos>`) and runtime params
48
+ (`temperature=0`, `num_ctx=1024`, `num_predict=80`) are **not** applied
49
+ Ollama generates a default Modelfile from the GGUF. Use only if your client
50
+ injects stop tokens at request time (the demo `inference/backend.py` does
51
+ this via `options.stop`).
52
+
53
+ ### Option B — local `ollama create` (recommended)
54
+
55
+ ```bash
56
+ # Download GGUF + Modelfile into the same dir
57
+ huggingface-cli download BrinqAI/coral-functiongemma-270m \
58
+ coral-functiongemma-v4c-compact-Q4_K_M.gguf Modelfile.compact \
59
+ --local-dir ./coral-fg
60
+
61
+ cd coral-fg
62
+ ollama create coral-functiongemma:compact -f Modelfile.compact
63
+ ollama run coral-functiongemma:compact
64
+ ```
65
+
66
+ Same flow for native: swap `compact` → `native` in both filenames and tag.
67
+ This path bakes the stop tokens and decode params into the registered model.
68
 
69
  The model expects prompts built via the FunctionGemma chat template
70
  (developer role + user role, tools list passed via