kashif HF Staff commited on
Commit
217b2a7
·
verified ·
1 Parent(s): db11d5c

card: add draft-model download instructions

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -53,6 +53,14 @@ cd llama.cpp && cmake -B build && cmake --build build -j
53
 
54
  ### Speculative decoding with Carbon-500M draft (~1.35x speedup)
55
 
 
 
 
 
 
 
 
 
56
  ```bash
57
  ./build/bin/llama-speculative \
58
  -m carbon-3b-bf16.gguf \
 
53
 
54
  ### Speculative decoding with Carbon-500M draft (~1.35x speedup)
55
 
56
+ Carbon-500M shares the HybridDNA vocab, so it works as a drop-in draft model. Grab it first:
57
+
58
+ ```bash
59
+ hf download HuggingFaceBio/Carbon-500M-GGUF carbon-500m-bf16.gguf --local-dir .
60
+ ```
61
+
62
+ Then run with `--model-draft`:
63
+
64
  ```bash
65
  ./build/bin/llama-speculative \
66
  -m carbon-3b-bf16.gguf \