Image-Text-to-Text
Transformers
GGUF
text-generation-inference
unsloth
qwen3_5
reasoning
chain-of-thought
lora
sft
agent
tool-use
function-calling
coder
conversational
Instructions to use Jackrong/Qwopus3.5-9B-Coder-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="Jackrong/Qwopus3.5-9B-Coder-GGUF") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Jackrong/Qwopus3.5-9B-Coder-GGUF", dtype="auto") - llama-cpp-python
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="Jackrong/Qwopus3.5-9B-Coder-GGUF", filename="Qwopus3.5-9B-coder-Exp-BF16.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Use Docker
docker model run hf.co/Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Jackrong/Qwopus3.5-9B-Coder-GGUF" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Jackrong/Qwopus3.5-9B-Coder-GGUF", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
- SGLang
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Jackrong/Qwopus3.5-9B-Coder-GGUF" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Jackrong/Qwopus3.5-9B-Coder-GGUF", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Jackrong/Qwopus3.5-9B-Coder-GGUF" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Jackrong/Qwopus3.5-9B-Coder-GGUF", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Ollama
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with Ollama:
ollama run hf.co/Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
- Unsloth Studio new
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Jackrong/Qwopus3.5-9B-Coder-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Jackrong/Qwopus3.5-9B-Coder-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Jackrong/Qwopus3.5-9B-Coder-GGUF to start chatting
- Pi new
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with Docker Model Runner:
docker model run hf.co/Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
- Lemonade
How to use Jackrong/Qwopus3.5-9B-Coder-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull Jackrong/Qwopus3.5-9B-Coder-GGUF:Q4_K_M
Run and chat with the model
lemonade run user.Qwopus3.5-9B-Coder-GGUF-Q4_K_M
List all available models
lemonade list
File size: 32,965 Bytes
615bcd7 555f1ef 615bcd7 36c9200 615bcd7 883d7cd 615bcd7 883d7cd 615bcd7 f05db49 615bcd7 883d7cd 615bcd7 a4e5602 3b8ca56 a4e5602 615bcd7 05f9193 615bcd7 473573d 615bcd7 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 | ---
base_model:
- Jackrong/Qwopus3.5-9B-v3.5
tags:
- text-generation-inference
- transformers
- unsloth
- qwen3_5
- reasoning
- chain-of-thought
- lora
- sft
- agent
- tool-use
- function-calling
- coder
license: apache-2.0
language:
- en
- zh
- es
- ru
- ja
pipeline_tag: image-text-to-text
datasets:
- lambda/hermes-agent-reasoning-traces
- Jackrong/Claude-opus-4.7-TraceInversion-5000x
- Jackrong/Claude-opus-4.6-TraceInversion-9000x
---
# π Qwopus3.5-9B-coder
## π Model Fine-Tuning and Logical Alignment (Qwopus3.5-9B-coder)
As the base model of this model, **Qwopus3.5-9B-v3.5** is already a model with powerful capabilities. On this foundation, **Qwopus3.5-9B-coder** is specially optimized and fine-tuned for high-performance **π€ Agentic Coding, complex Tool Calling, and logical reasoning.**
> π‘ **Why the 9B Dense Model?**
> We believe that the 9B dense architecture represents the perfect **"sweet spot"** for large language models. It runs seamlessly at 8-bit precision on entry-level 16GB RAM devicesβsuch as standard laptops and the Mac miniβmaking it exceptionally lightweight yet highly versatile. Without requiring expensive hardware, it allows you to achieve excellent performance paired with impressive inference speeds. Simply put, **Qwen3.5-9B is currently the best open-source model in its class.**

> [!TIP]
>**Vision & Tool Calling Support**: This model supports visual capabilities and tool calling. To enable vision, please place the `mmproj.gguf` file from the [GGUF repository](https://huggingface.co/Jackrong/Qwopus3.5-9B-coder-GGUF) into the same directory as the main `.gguf` file.
---
### π Training Strategy
The fine-tuning process of this model deeply integrates **Trace Inversion** data augmentation technology with high-quality **Agent Traces**. This systematic approach not only strengthens the model's ability to solve complex programming tasks, but also greatly improves its logical coherence and accuracy when using various tools.
This model is designed specifically for the following goals:
- π§© More structured and stronger logical reasoning capabilities, reducing repetitive thinking
- π» More powerful capabilities in code writing, debugging, and repository-level task processing
- π More stable and accurate Tool Calling capabilities for terminal commands, file operations, and browsers
- π Better cross-data source distillation alignment
> [!WARNING]
> - **Community Release Notice**: Qwopus3.5-9B-coder is released purely as an experimental community version, aiming to explore the combination of Agent capabilities and deep reasoning, and is only for research and exploration use.
> - **Warning**: Because this model is vertically fine-tuned for programming agents and deep reasoning, and has not undergone comprehensive general performance evaluation, its capabilities in general domains or specific non-programming tasks may suffer from Capability Decay. Users are advised to be aware of its limitations in other scenarios while exploring its core capabilities.
---
## π Baseline Performance Comparison
To verify the execution efficiency and logical robustness of **Qwopus3.5-9B-coder** in actual agent scenarios, we adopted the open-source testing framework [benchlocal](https://github.com/stevibe/benchlocal).
### Test Configuration
- **Hardware Environment**: Apple Silicon (Mac)
- **Inference Backend**: LM Studio / MLX / GGUF
- **Testing Platform**: [benchlocal](https://github.com/stevibe/benchlocal) - An evaluation suite focusing on local model agent capabilities.
- π You can see the actual inference speeds of different model formats on the same device.
### π§ͺ Benchmark Results
<div style="display: inline-block; padding: 6px 16px; background: #e0f2fe; color: #0369a1; border: 1px solid #bae6fd; border-radius: 8px; font-weight: 700; font-size: 16px; margin-bottom: 12px;">1. Complex Agent Performance - HermesAgent-20</div>
The following is the comparative performance under the HermesAgent-20 task set:
<table style="width: 100%; border-collapse: collapse; font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif;">
<thead>
<tr>
<td colspan="4" style="padding: 8px 12px; font-weight: 600; color: #7c3aed; border-bottom: 1px solid rgba(124, 58, 237, 0.2); background: rgba(124, 58, 237, 0.05);">HermesAgent-20 Performance Metrics</td>
</tr>
<tr style="background: rgba(128, 128, 128, 0.02);">
<th style="padding: 7px 7px; padding-left: 20px; text-align: left; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Model</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Test Set</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Comprehensive Score</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Core Dimensions (M/O/S/S/B)</th>
</tr>
</thead>
<tbody>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><b><a href="https://huggingface.co/Jackrong/Qwopus3.5-9B-coder-GGUF" style="color: #7c3aed; text-decoration: none;">Qwopus3.5-9B-coder</a></b></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">HermesAgent-20</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); color: #7c3aed; font-weight: bold;">85</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">84 / 93 / 88 / 75 / 84</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/Qwen/Qwen3.5-9B" style="color: #666; text-decoration: none;">Qwen/Qwen3.5-9B</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">HermesAgent-20</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">71</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">75 / 58 / 100 / 53 / 69</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/armand0e/Qwen3.5-9B-Agent" style="color: #666; text-decoration: none;">armand0e/Qwen3.5-9B-Agent</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">HermesAgent-20</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">68</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">71 / 83 / 43 / 61 / 80</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/DJLougen/Harmonic-Hermes-9B" style="color: #666; text-decoration: none;">DJLougen/Harmonic-Hermes-9B</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">HermesAgent-20</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">47</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">60 / 45 / 23 / 69 / 38</td>
</tr>
</tbody>
</table>
<div style="display: inline-block; padding: 6px 16px; background: #e0f2fe; color: #0369a1; border: 1px solid #bae6fd; border-radius: 8px; font-weight: 700; font-size: 16px; margin-bottom: 12px;">2. Tool Call Stability - ToolCall-15</div>
This is a ToolCall-15 test set targeting the stability of tool calls, aiming to test the stability of the model in tool calling:
<table style="width: 100%; border-collapse: collapse; font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif;">
<thead>
<tr>
<td colspan="4" style="padding: 8px 12px; font-weight: 600; color: #7c3aed; border-bottom: 1px solid rgba(124, 58, 237, 0.2); background: rgba(124, 58, 237, 0.05);">ToolCall-15 Stability Metrics</td>
</tr>
<tr style="background: rgba(128, 128, 128, 0.02);">
<th style="padding: 7px 7px; padding-left: 20px; text-align: left; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Model</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Test Set</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Comprehensive Score</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Dimension Scores (A/B/C/D/E)</th>
</tr>
</thead>
<tbody>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><b><a href="https://huggingface.co/Jackrong/Qwopus3.5-9B-coder-GGUF" style="color: #7c3aed; text-decoration: none;">Qwopus3.5-9B-coder</a></b></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">ToolCall-15</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); color: #7c3aed; font-weight: bold;">100</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">100 / 100 / 100 / 100 / 100</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/Qwen/Qwen3.5-9B" style="color: #666; text-decoration: none;">Qwen/Qwen3.5-9B</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">ToolCall-15</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); color: #7c3aed; font-weight: bold;">100</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">100 / 100 / 100 / 100 / 100</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/armand0e/Qwen3.5-9B-Agent" style="color: #666; text-decoration: none;">armand0e/Qwen3.5-9B-Agent</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">ToolCall-15</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">93</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">100 / 100 / 100 / 67 / 100</td>
</tr>
</tbody>
</table>
<div style="display: inline-block; padding: 6px 16px; background: #e0f2fe; color: #0369a1; border: 1px solid #bae6fd; border-radius: 8px; font-weight: 700; font-size: 16px; margin-bottom: 12px;">3. Code Debugging & Bug Fixing - BugFind-15</div>
BugFind-15 is a test set containing 15 scenarios from shallow to deep, aiming to evaluate the real debugging capabilities of the model in discovering and fixing syntax, logical errors, and "trap" code in multiple programming languages through deterministic environment runtime verification.
<table style="width: 100%; border-collapse: collapse; font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif;">
<thead>
<tr>
<td colspan="4" style="padding: 8px 12px; font-weight: 600; color: #7c3aed; border-bottom: 1px solid rgba(124, 58, 237, 0.2); background: rgba(124, 58, 237, 0.05);">BugFind-15 Performance Metrics</td>
</tr>
<tr style="background: rgba(128, 128, 128, 0.02);">
<th style="padding: 7px 7px; padding-left: 20px; text-align: left; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Model</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Test Set</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Comprehensive Score</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Dimension Scores (A/B/C/D/E)</th>
</tr>
</thead>
<tbody>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><b><a href="https://huggingface.co/Jackrong/Qwopus3.5-9B-coder-GGUF" style="color: #7c3aed; text-decoration: none;">Qwopus3.5-9B-coder</a></b></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">BugFind-15</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); color: #7c3aed; font-weight: bold;">79</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">67 / 87 / 100 / 77 / 43</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/Jackrong/MLX-Qwen3.5-9B-DeepSeek-V4-Flash-8bit" style="color: #666; text-decoration: none;">Jackrong/MLX-Qwen3.5-9B-DeepSeek-V4-Flash</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">BugFind-15</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">75</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">67 / 100 / 67 / 57 / 80</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/armand0e/Qwen3.5-9B-Agent" style="color: #666; text-decoration: none;">armand0e/Qwen3.5-9B-Agent</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">BugFind-15</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">58</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">29 / 87 / 73 / 20 / 67</td>
</tr>
</tbody>
</table>
### πͺ SWE-bench Verified Performance (Repository-level Coding Capability)
The following shows the comparative performance on **SWE-bench Verified**, which evaluates language models on resolving software engineering issues in real-world open-source repositories:
<table style="width: 100%; border-collapse: collapse; font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif;">
<thead>
<tr>
<td colspan="3" style="padding: 8px 12px; font-weight: 600; color: #7c3aed; border-bottom: 1px solid rgba(124, 58, 237, 0.2); background: rgba(124, 58, 237, 0.05);">SWE-bench Verified Performance Metrics</td>
</tr>
<tr style="background: rgba(128, 128, 128, 0.02);">
<th style="padding: 7px 7px; padding-left: 20px; text-align: left; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Model</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Test Set</th>
<th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Comprehensive Score (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><span style="color: #666;">Claude 4.5 Opus</span></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">80.9</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/Qwen/Qwen3.5-27B" style="color: #666; text-decoration: none;">Qwen/Qwen3.5-27B</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">75.0</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/Qwen/Qwen3.6-35B-A3B" style="color: #666; text-decoration: none;">Qwen/Qwen3.6-35B-A3B</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">73.4</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><b><a href="https://huggingface.co/Jackrong/Qwopus3.5-9B-coder-GGUF" style="color: #7c3aed; text-decoration: none;">Qwopus3.5-9B-coder</a></b></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); color: #7c3aed; font-weight: bold;">53.89</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/google/gemma-4-31B-it" style="color: #666; text-decoration: none;">google/gemma-4-31B-it</a></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">52.0</td>
</tr>
<tr>
<td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><span style="color: #666;">google/gemma-4-26B-A4B</span></td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
<td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">45.0 - 48.0</td>
</tr>
</tbody>
</table>
> [!IMPORTANT]
> - βοΈ All tests were conducted with a temperature of 1 as officially recommended by qwen3.5. All errors and model issues were attempted to be regenerated twice after a test failure. If both attempts fail, it is considered a failure.
> - π All screenshots of the test interfaces have been uploaded to the image folder in the repository. Click the link below to view and verify:
> - π [View Test Screenshots](https://huggingface.co/Jackrong/Qwopus3.5-9B-coder/tree/main/test_images)
> - β€οΈ **Kyle Hessling** for his generous hardware and equipment support. You can follow him for more updates on X / Twitter: [@KyleHessling1](https://x.com/KyleHessling1).
---
### π§ͺ Core Dataset Usage: Trace Inversion and High-Quality Agent Traces
In order to break through the "reasoning bubble" limitation of the model in actual programming and tool usage, and to endow it with real Agent behavioral capabilities, this model introduced core augmented datasets during training:
#### 1. Reasoning Synthetic Data Combining Trace Inversion
**Currently, based on public information, commercial models such as OpenAI's GPT series and Anthropic's Claude series have very clearly hidden the true internal reasoning chains of their models. For these models, what we can ultimately see in the API or front-end interface can often only be considered a highly compressed "Reasoning Bubble".**
To break through this limitation, we adopted the **Trace Inversion** technology. This technology utilizes an external "surrogate model" to reconstruct a complete and logically coherent deep reasoning chain based on the "question + final answer + compressed reasoning summary" published by commercial models. The "reasoning bubble", which originally consisted of only a few sentences and logical leaps, is expanded into a high-quality deep learning trace with complete derivation, calculation, and logical verification, providing step-by-step logical learning signals for the model.

#### 2. GLM-5.1 Agent Real Trace Data: lambda/hermes-agent-reasoning-traces
To significantly enhance the model's execution and coding capabilities in real environments, this model additionally introduced the **`lambda/hermes-agent-reasoning-traces`** dataset.

- **Data Source and Scale**: This data subset contains approximately 10,000 high-quality multi-turn Tool Calling Trajectories generated based on the ZhipuAI GLM-5.1 and kimi-4.6 models.
- **Real Agent Behavior**: Unlike traditional synthetic data, these samples represent real Agent conversations. Each sample not only contains the step-by-step reasoning process in the `<think>` tags, but also includes actual tool execution results (rather than fabricated outputs out of thin air).
- **Extensive Domain Coverage**:
- **Terminal & Coding**: Script writing, code debugging, environment configuration, and data processing.
- **Repository Tasks**: Involving real code repository work, such as bug fixes, refactoring, and code review.
- **Browser Automation**: Web navigation, scraping, and form filling.
- **Agent Tools**: Memory persistence, task delegation, skill management, etc.
By learning these Agent trajectories that contain real feedback and thoughtful processes, Qwopus3.5-9B-coder can exhibit thinking and operational modes closer to human experts when facing complex programming and system operations tasks.
---
## πΊοΈ Training Pipeline Overview
The training of this model integrates a phased learning pipeline of **Trace Inversion** data augmentation technology and **high-quality Agent Trajectories data**. Its core logic lies in restoring the highly compressed "reasoning bubble" of commercial models into a deep path for learning, and combining it with real agent operational traces to comprehensively improve the model's logical reasoning and code execution capabilities.
```text
[ πΊοΈ Trace Inversion: Full Process of Data Inversion and "Attack" Distillation ]
A. Surrogate Model Training
Open Source Model (GLM-5.1 / DS-V4) βββΊ Complete Reasoning Chain βββΊ [ Qwen3-235B Compression ] βββΊ Reasoning Bubbles
β β
ββββββββββββΊ [ Training ] βββββββββββ
(Base: Qwen3-4B-Instruct)
(Result: Trace-Inverter-4B)
B. Inversion Phase: "Attacking" Claude-4.7-Max
_______________________________________________________
| |
| Claude-4.7-Max API βββΊ Compressed Bubbles + Final Answer |
|_______________________________________________________|
β
βΌ
[ π§ Trace-Inverter-4B (Logical Reconstructor) ] βββββΊ Synthetic CoT
β
βΌ
[ π§© Data Splicing ] βββββββββββ (Original Prompt + Response)
(Embed the inverted chain of thought into <think> tags, and splice with the original Q&A pair for restoration)
β
βΌ
(Result: claude-opus-4.6/4.7 Inversion Set)
C. Final SFT Pipeline
___________________________________________
| |
| Base Model (Qwopus3.5-9B-v3.5) |
|___________________________________________|
β
βΌ
[ π¦ Stage 1: Format Establishment and Logic Injection ] ββββββββΊ [ π οΈ Stage 2: Agent Trajectories and Programming Reinforcement ]
(Integrate inverted reasoning data, stabilize thinking format) (Introduce GLM-5.1 Agent Trajectories, reinforce interaction and execution)
β β
β βΌ
β __________________________________________________
β | π Hermes Agent Trace Sample Structure Breakdown (GLM-5.1) |
β | 1. [π οΈ System] -> JSON Tool Definition |
β | 2. [π€ Human] -> Initial Task Instruction |
β | ββββββββββββββββββββββββββββββββββββββββββββββββ |
β | β π Multi-turn Loop: β |
β | β 3. [π§ GPT] -> <think> Logical Reasoning/Reflection β |
β | β 4. [π€ GPT] -> Tool Call Execution Action β |
β | β 5. [βοΈ Tool] -> Real Feedback β |
β | ββββββββββββββββββββββββββββββββββββββββββββββββ |
β |__________________________________________________|
β β
ββββββββββββββββββ¬βββββββββββββββββ
βΌ
___________________________________
| |
| π Final Model: Qwopus3.5-9B-coder |
|___________________________________|
```
> [!NOTE]
> Because agent trajectory datasets are complex and diverse. The datasets have undergone rigorous cleaning and formatting.
## π― Three-Stage Curriculum Learning
**Qwopus3.5-9B-coder** adopts a phased reasoning data mixture strategy similar to Curriculum Learning, gradually increasing the difficulty and complexity of training signals:
1. **Early Stage (Format Establishment):** Focuses on short-to-medium length reasoning samples with stable formats. The primary goal of this stage is to establish a reliable, structured new reasoning format while avoiding overwhelming the model with extreme complexity.
2. **Middle Stage (Complexity Scaling & Multi-Teacher Distillation):** Gradually increases the proportion of complex reasoning samples from multiple teacher models.
- The distillation data is sourced from more powerful models whose style distribution closely matches the base model, ensuring that the capability gap is not too wide, thereby achieving efficient learning.
3. **Late Stage (Long-Context Reinforcement & Drift Prevention):** Reinforces reasoning capabilities in long contexts. Crucially, this stage retains **short-sample replay** to ensure the model maintains its short-context instruction-following capability and minimizes capability drift.
---
## π Context Length and Long-Context Usage
During fine-tuning, this model was trained with a maximum sequence length of **32K tokens**. The training data mixture was also constructed around samples up to **32K tokens**, so the "Context Length Distribution" shown in this model card reflects the fine-tuning data distribution rather than a hard architectural limit.
The model still inherits the native long-context capability of the Qwen3.6 base model. Therefore, longer context windows such as **128K** or **256K** may be available in compatible inference runtimes, depending on the backend and configuration.
For practical long-context inference beyond 32K, especially when using **llama.cpp / GGUF**, it is recommended to enable **RoPE/YaRN scaling** instead of only increasing `n_ctx` / `--ctx-size`. Directly setting a larger context window without RoPE scaling may work in some cases, but it can be less stable and may not achieve the expected long-context performance.
This is consistent with Qwen community guidance for long-context GGUF usage: **128K context generally requires YaRN/RoPE scaling**, and it is not necessarily enabled by default in llama.cpp. For example, Qwen maintainers have noted that "128K context length needs YaRN" and that it should be explicitly enabled when supported by the runtime.
Reference: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct-GGUF/discussions/2
Community feedback also suggests that RoPE/YaRN scaling can improve long-context stability for this model family. One user reported that, on **HermesAgent-20**, `Qwopus3.6-35B-A3B-v1` performed better when extending from **32K to 128K via RoPE scaling** than when directly setting a **128K context window** without scaling, with scores of **83 vs. 72** in their setup. This result may vary depending on the backend, quantization type, KV cache settings, hardware, and benchmark configuration, but it is consistent with the recommendation to use RoPE/YaRN scaling for contexts beyond 32K.
Example llama.cpp configuration for extending from 32K to 128K:
```bash
./llama-server \
-m model.gguf \
--ctx-size 131072 \
--rope-scaling yarn \
--rope-scale 4 \
--yarn-orig-ctx 32768
```
For 256K context, users may need to adjust the scaling factor accordingly and validate the result in their own workload:
```bash
./llama-server \
-m model.gguf \
--ctx-size 262144 \
--rope-scaling yarn \
--rope-scale 8 \
--yarn-orig-ctx 32768
```
Please note that long-context behavior may vary depending on the inference backend, quantization type, KV cache settings, available memory, and task type. For best results, users should benchmark their own target workload when using contexts beyond 32K.
---
## π€ Collaboration & Training Details
This model is the result of continuous exploration in Agentic AI and reasoning capabilities.
**Training Infrastructure & Configuration:**
- π₯οΈ **Hardware:** Local compute devices / Cloud GPUs (e.g. GB10 / H100 / RTX 5090 / A100)
- βοΈ **Framework:** Unsloth for efficient fine-tuning
---
## β οΈ IMPORTANT
> [!CAUTION]
> **Compatibility and Deployment Notice**
> - **Tool Calling Format**: When using this model for tool calling, please ensure that you use a Prompt format and System Prompt that match the training data to activate its Agent capabilities.
> - **Reasoning Output Extraction**: The model's thinking process is typically wrapped in `<think>` and `</think>` tags. When deploying to front-end applications, these tags may need to be parsed and hidden.
---
## π Resources & Guides
π **[GitHub Repository: Jackrong-llm-finetuning-guide](https://github.com/R6410418/Jackrong-llm-finetuning-guide.git)**
Visit the repository to dive into our fine-tuning codebase and guides.
---
## π Acknowledgements
Special thanks to:
- The Qwen team for the strong Qwen3.6 MoE base model.
- Unsloth for efficient fine-tuning frameworks.
- Open-source datasets and community contributors.
- **Kyle Hessling** for his generous hardware and equipment support. You can follow him for more updates on X / Twitter: [@KyleHessling1](https://x.com/KyleHessling1).
---
## π Citation
```bibtex
@misc{jackrong_qwopus35_9b_coder,
title = {Qwopus3.5-9B-coder},
author = {Jackrong},
year = {2026},
publisher = {Hugging Face}
}
``` |