kashif HF Staff commited on
Commit
37becc5
·
verified ·
1 Parent(s): 6f43ce0

card: add draft-model download instructions

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -45,7 +45,13 @@ cd llama.cpp && cmake -B build && cmake --build build -j
45
 
46
  ### Speculative decoding with Carbon-500M draft (~2x speedup)
47
 
48
- The 500M shares the HybridDNA vocab, so it's a near-ideal draft. Measured ~2.1x speedup at temp=0 with 87% accept rate on DNA prompts:
 
 
 
 
 
 
49
 
50
  ```bash
51
  ./build/bin/llama-speculative \
 
45
 
46
  ### Speculative decoding with Carbon-500M draft (~2x speedup)
47
 
48
+ The 500M shares the HybridDNA vocab, so it's a near-ideal draft. Measured ~2.1x speedup at temp=0 with 87% accept rate on DNA prompts. Grab the draft GGUF first:
49
+
50
+ ```bash
51
+ hf download HuggingFaceBio/Carbon-500M-GGUF carbon-500m-bf16.gguf --local-dir .
52
+ ```
53
+
54
+ Then run with `--model-draft`:
55
 
56
  ```bash
57
  ./build/bin/llama-speculative \