GRM-2.6-Opus / README.md
DedeProGames's picture
Update README.md
ef0c1da verified
---
title: GRM-2.6-Opus
emoji: 🔥
colorFrom: gray
colorTo: purple
sdk: gradio
sdk_version: 6.11.0
app_file: app.py
pinned: false
---
Text-only ZeroGPU Space for `GRM-2.6-Opus`.
Notes:
- Built for ZeroGPU with `@spaces.GPU`
- Uses 4-bit NF4 quantization to reduce memory pressure
- Keeps the UI text-only because the Qwen model card explicitly recommends text-only deployment to save memory and free more KV cache
- Exposes Qwen3.6 thinking controls through `enable_thinking` and `preserve_thinking`
- Uses shorter default generation lengths than the model card recommendations to behave better in shared ZeroGPU queues