Update vision status: untestable due to server crash
Browse files
README.md
CHANGED
|
@@ -37,7 +37,7 @@ Standard community GPTQ under-calibrates rare experts due to routing imbalance.
|
|
| 37 |
|
| 38 |
## Known Limitations
|
| 39 |
|
| 40 |
-
- **Vision:
|
| 41 |
|
| 42 |
## Usage with SGLang
|
| 43 |
|
|
|
|
| 37 |
|
| 38 |
## Known Limitations
|
| 39 |
|
| 40 |
+
- **Vision: UNTESTABLE** — Vision encoder layers (`embed_vision.*`) were quantized to INT4, which likely degrades vision quality. Server crashes on first request (pre-existing RDNA4 triton issue with this model's SWA configuration, not vision-specific). **Text-only inference recommended.** A future version should add vision layers to `modules_to_not_convert`.
|
| 41 |
|
| 42 |
## Usage with SGLang
|
| 43 |
|