athena129 commited on
Commit
c300739
·
verified ·
1 Parent(s): aa0deee

Drop NVIDIA SKU mentions, keep technical portability claim

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -111,7 +111,7 @@ The numbers below are first-principles estimates from the bf16 weight footprint
111
  | bf16 weight file on disk | ~8.0 GB | ~16 GB |
112
  | Inference VRAM, weights only (bf16) | ~8 GB | ~16 GB |
113
  | Inference VRAM, weights + 4 K KV cache (bf16) | ~9–10 GB | ~17–18 GB |
114
- | Single-GPU class (bf16, headroom for batch ≥ 1) | Fits on 12 GB+ consumer GPU (e.g., RTX 3060 12 GB, RTX 4070 12 GB, T4 16 GB) | Typically requires 24 GB+ (e.g., RTX 4090, A10, A100 40 GB) |
115
  | AMD Instinct MI300X 192 GB (validated) | Fits trivially with very large batch / long context | Fits trivially |
116
 
117
  Notes:
 
111
  | bf16 weight file on disk | ~8.0 GB | ~16 GB |
112
  | Inference VRAM, weights only (bf16) | ~8 GB | ~16 GB |
113
  | Inference VRAM, weights + 4 K KV cache (bf16) | ~9–10 GB | ~17–18 GB |
114
+ | Single-GPU class (bf16, headroom for batch ≥ 1) | Fits on any 12 GB+ consumer card | Typically requires a 24 GB+ datacenter card |
115
  | AMD Instinct MI300X 192 GB (validated) | Fits trivially with very large batch / long context | Fits trivially |
116
 
117
  Notes: