Image-Text-to-Text
Transformers
GGUF
text-generation-inference
unsloth
qwen3_5
reasoning
chain-of-thought
lora
sft
agent
tool-use
function-calling
coder
conversational
Jackrong commited on
Commit
a4e5602
·
verified ·
1 Parent(s): 36c9200

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md CHANGED
@@ -191,6 +191,53 @@ BugFind-15 is a test set containing 15 scenarios from shallow to deep, aiming to
191
  </tbody>
192
  </table>
193
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
194
 
195
 
196
  > [!IMPORTANT]
 
191
  </tbody>
192
  </table>
193
 
194
+ ### 🪐 SWE-bench Verified Performance (Repository-level Coding Capability)
195
+ The following shows the comparative performance on **SWE-bench Verified**, which evaluates language models on resolving software engineering issues in real-world open-source repositories:
196
+
197
+ <table style="width: 100%; border-collapse: collapse; font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif;">
198
+ <thead>
199
+ <tr>
200
+ <td colspan="3" style="padding: 8px 12px; font-weight: 600; color: #7c3aed; border-bottom: 1px solid rgba(124, 58, 237, 0.2); background: rgba(124, 58, 237, 0.05);">SWE-bench Verified Performance Metrics</td>
201
+ </tr>
202
+ <tr style="background: rgba(128, 128, 128, 0.02);">
203
+ <th style="padding: 7px 7px; padding-left: 20px; text-align: left; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Model</th>
204
+ <th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Test Set</th>
205
+ <th style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); font-size: 13px; color: #666;">Comprehensive Score (%)</th>
206
+ </tr>
207
+ </thead>
208
+ <tbody>
209
+ <tr>
210
+ <td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><span style="color: #666;">Claude 4.5 Opus</span></td>
211
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
212
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">80.9</td>
213
+ </tr>
214
+ <tr>
215
+ <td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/Qwen/Qwen3.5-27B" style="color: #666; text-decoration: none;">Qwen/Qwen3.5-27B</a></td>
216
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
217
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">75.0</td>
218
+ </tr>
219
+ <tr>
220
+ <td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/Qwen/Qwen3.6-35B-A3B" style="color: #666; text-decoration: none;">Qwen/Qwen3.6-35B-A3B</a></td>
221
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
222
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">73.4</td>
223
+ </tr>
224
+ <tr>
225
+ <td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><b><a href="https://huggingface.co/Jackrong/Qwopus3.5-9B-coder-GGUF" style="color: #7c3aed; text-decoration: none;">Qwopus3.5-9B-coder</a></b></td>
226
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
227
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15); color: #7c3aed; font-weight: bold;">53.33</td>
228
+ </tr>
229
+ <tr>
230
+ <td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><a href="https://huggingface.co/google/gemma-4-31B-it" style="color: #666; text-decoration: none;">google/gemma-4-31B-it</a></td>
231
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
232
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">52.0</td>
233
+ </tr>
234
+ <tr>
235
+ <td style="padding: 7px 7px; padding-left: 20px; border-bottom: 1px solid rgba(128, 128, 128, 0.15);"><span style="color: #666;">google/gemma-4-26B-A4B</span></td>
236
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">SWE-bench Verified</td>
237
+ <td style="padding: 7px 7px; text-align: center; border-bottom: 1px solid rgba(128, 128, 128, 0.15);">45.0 - 48.0</td>
238
+ </tr>
239
+ </tbody>
240
+ </table>
241
 
242
 
243
  > [!IMPORTANT]