Spaces:
Running on Zero
Running on Zero
File size: 623 Bytes
be95cba ef0c1da e8e5451 be95cba e8e5451 be95cba ef0c1da e8e5451 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 | ---
title: GRM-2.6-Opus
emoji: 🔥
colorFrom: gray
colorTo: purple
sdk: gradio
sdk_version: 6.11.0
app_file: app.py
pinned: false
---
Text-only ZeroGPU Space for `GRM-2.6-Opus`.
Notes:
- Built for ZeroGPU with `@spaces.GPU`
- Uses 4-bit NF4 quantization to reduce memory pressure
- Keeps the UI text-only because the Qwen model card explicitly recommends text-only deployment to save memory and free more KV cache
- Exposes Qwen3.6 thinking controls through `enable_thinking` and `preserve_thinking`
- Uses shorter default generation lengths than the model card recommendations to behave better in shared ZeroGPU queues
|