alexrs commited on
Commit
af22adb
·
verified ·
1 Parent(s): 2ec088d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -86,7 +86,7 @@ The following quantizations are available with example minimum GPU requirements
86
  | [FP8 (8-bit)](https://huggingface.co/CohereLabs/command-a-plus-05-2026-fp8) | 2 x B200 | 4 x H100 |
87
  | [W4A4 (4-bit)](https://huggingface.co/CohereLabs/command-a-plus-05-2026-w4a4) | 1 x B200 | 2 x H100 |
88
 
89
- All three quantizations show negligible differences in benchmark quality and performance.
90
 
91
  For more details, please check out our [blog post](http://cohere.com/blog/command-a-plus).
92
 
 
86
  | [FP8 (8-bit)](https://huggingface.co/CohereLabs/command-a-plus-05-2026-fp8) | 2 x B200 | 4 x H100 |
87
  | [W4A4 (4-bit)](https://huggingface.co/CohereLabs/command-a-plus-05-2026-w4a4) | 1 x B200 | 2 x H100 |
88
 
89
+ All three quantizations show negligible differences in benchmark quality and performance. **Our recommended quantization for most uses is [W4A4](https://huggingface.co/CohereLabs/command-a-plus-05-2026-w4a4) which boasts superior speed and latency characteristics alongside a smaller hardware footprint.**
90
 
91
  For more details, please check out our [blog post](http://cohere.com/blog/command-a-plus).
92