Upload ugi-leaderboard-data.csv
Browse files- ugi-leaderboard-data.csv +11 -0
ugi-leaderboard-data.csv
CHANGED
|
@@ -1206,3 +1206,14 @@ tencent/Hy3-preview (reasoning=disabled),https://huggingface.co/tencent/Hy3-prev
|
|
| 1206 |
tencent/Hy3-preview (reasoning=enabled),https://huggingface.co/tencent/Hy3-preview,4/22/2026,5/2/2026,,21.0,295.0,295.0,FALSE,FALSE,TRUE,51.12,40.09,53.88,7.1,4.4,5.3,1.2,1.0,1.5,50.11,72.49,39.66,38.17,53.88,0.5686,0.158,0.6354,0.2308,0.3156,-24.2%,68.4%,46.0%,44.1%,65.9%,42.9%,60.4%,41.2%,31.5%,37.3%,26.0%,51.2%,52.1%,29.0%,70.4%,60.6%,66.7%,Liberalism,True,0,0,,25.8,0.67,12.1,5.9,0.327,18.0,74.0,0.883,0.443,0.308,1.373,0.24,0.275,26.1,7627.0,70.6,23.43,2.4,3.4
|
| 1207 |
Qwen/Qwen3.6-27B (<think> prefill),https://huggingface.co/Qwen/Qwen3.6-27B,4/22/2026,5/2/2026,chatml w/ <think> prefill,27.0,27.0,27.0,FALSE,FALSE,TRUE,42.47,27.15,26.98,4.7,1.2,2.9,2.8,4.0,1.5,33.16,43.52,17.24,38.72,26.98,0.3473,0.302,0.5662,0.3619,0.3585,-20.0%,64.8%,45.9%,44.2%,62.2%,43.3%,67.1%,48.1%,35.0%,39.2%,31.5%,48.5%,49.8%,34.2%,61.7%,62.3%,62.5%,Liberalism,True,12387,6,Qwen3_5ForConditionalGeneration,30.6,0.73,12.0,6.1,0.305,24.0,75.0,0.869,0.406,0.288,1.36,0.171,0.336,41.1,4370.0,77.8,21.83,2.0,5.3
|
| 1208 |
Qwen/Qwen3.6-27B (no think),https://huggingface.co/Qwen/Qwen3.6-27B,4/22/2026,5/3/2026,chatml w/ no think,27.0,27.0,27.0,FALSE,FALSE,TRUE,38.83,19.32,17.73,1.8,1.0,2.7,2.2,3.0,1.5,30.67,38.4,12.76,40.84,17.73,0.401,0.1551,0.5169,0.6611,0.308,-24.4%,70.1%,47.4%,42.6%,63.5%,39.6%,64.8%,46.7%,30.0%,34.0%,25.6%,48.8%,51.9%,27.3%,57.5%,61.9%,71.0%,Liberalism,False,0,0,Qwen3_5ForConditionalGeneration,19.4,0.74,12.8,5.8,0.347,9.0,71.0,0.843,0.433,0.324,1.39,0.22,0.268,36.7,7739.0,83.1,19.03,2.1,3.2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1206 |
tencent/Hy3-preview (reasoning=enabled),https://huggingface.co/tencent/Hy3-preview,4/22/2026,5/2/2026,,21.0,295.0,295.0,FALSE,FALSE,TRUE,51.12,40.09,53.88,7.1,4.4,5.3,1.2,1.0,1.5,50.11,72.49,39.66,38.17,53.88,0.5686,0.158,0.6354,0.2308,0.3156,-24.2%,68.4%,46.0%,44.1%,65.9%,42.9%,60.4%,41.2%,31.5%,37.3%,26.0%,51.2%,52.1%,29.0%,70.4%,60.6%,66.7%,Liberalism,True,0,0,,25.8,0.67,12.1,5.9,0.327,18.0,74.0,0.883,0.443,0.308,1.373,0.24,0.275,26.1,7627.0,70.6,23.43,2.4,3.4
|
| 1207 |
Qwen/Qwen3.6-27B (<think> prefill),https://huggingface.co/Qwen/Qwen3.6-27B,4/22/2026,5/2/2026,chatml w/ <think> prefill,27.0,27.0,27.0,FALSE,FALSE,TRUE,42.47,27.15,26.98,4.7,1.2,2.9,2.8,4.0,1.5,33.16,43.52,17.24,38.72,26.98,0.3473,0.302,0.5662,0.3619,0.3585,-20.0%,64.8%,45.9%,44.2%,62.2%,43.3%,67.1%,48.1%,35.0%,39.2%,31.5%,48.5%,49.8%,34.2%,61.7%,62.3%,62.5%,Liberalism,True,12387,6,Qwen3_5ForConditionalGeneration,30.6,0.73,12.0,6.1,0.305,24.0,75.0,0.869,0.406,0.288,1.36,0.171,0.336,41.1,4370.0,77.8,21.83,2.0,5.3
|
| 1208 |
Qwen/Qwen3.6-27B (no think),https://huggingface.co/Qwen/Qwen3.6-27B,4/22/2026,5/3/2026,chatml w/ no think,27.0,27.0,27.0,FALSE,FALSE,TRUE,38.83,19.32,17.73,1.8,1.0,2.7,2.2,3.0,1.5,30.67,38.4,12.76,40.84,17.73,0.401,0.1551,0.5169,0.6611,0.308,-24.4%,70.1%,47.4%,42.6%,63.5%,39.6%,64.8%,46.7%,30.0%,34.0%,25.6%,48.8%,51.9%,27.3%,57.5%,61.9%,71.0%,Liberalism,False,0,0,Qwen3_5ForConditionalGeneration,19.4,0.74,12.8,5.8,0.347,9.0,71.0,0.843,0.433,0.324,1.39,0.22,0.268,36.7,7739.0,83.1,19.03,2.1,3.2
|
| 1209 |
+
SicariusSicariiStuff/Assistant_Pepe_32B (/no_think),https://huggingface.co/SicariusSicariiStuff/Assistant_Pepe_32B,5/3/2026,5/3/2026,chatml w/ /no_think,32.0,32.0,32.0,True,False,False,28.36,41.63,24.94,4.1,1.4,2.6,7.5,7.0,8.0,20.98,30.75,9.66,22.55,24.94,0.1085,0.1884,0.2102,0.3432,0.277,2.2%,55.5%,48.6%,42.8%,59.3%,44.2%,60.4%,50.4%,44.6%,45.4%,43.5%,39.2%,42.5%,46.9%,61.0%,49.4%,67.5%,Liberalism,False,0,6,Qwen3ForCausalLM,38.9,1.13,12.7,3.3,0.335,79.0,91.0,0.831,0.423,0.448,1.383,0.344,0.224,86.0,6618.0,132.2,22.03,9.9,7.5
|
| 1210 |
+
SicariusSicariiStuff/Assistant_Pepe_32B,https://huggingface.co/SicariusSicariiStuff/Assistant_Pepe_32B,5/3/2026,5/3/2026,chatml,32.0,32.0,32.0,True,False,False,29.41,44.73,27.1,3.5,1.4,3.7,8.0,8.0,8.0,19.57,19.09,7.93,31.67,27.1,0.196,0.1987,0.3411,0.6107,0.2371,-3.8%,56.2%,49.5%,46.3%,58.3%,40.4%,58.8%,47.7%,44.6%,44.0%,42.7%,49.4%,46.2%,43.3%,61.2%,50.6%,63.1%,Liberalism,False,0,5,Qwen3ForCausalLM,39.2,1.18,12.8,3.2,0.324,70.0,90.0,0.826,0.417,0.457,1.393,0.439,0.164,60.8,6334.0,105.9,19.5,9.9,8.2
|
| 1211 |
+
llmfan46/Mistral-Small-3.2-24B-Instruct-2506-ultra-uncensored-heretic,https://huggingface.co/llmfan46/Mistral-Small-3.2-24B-Instruct-2506-ultra-uncensored-heretic,3/19/2026,5/3/2026,mistral V7-Tekken,24.0,24.0,24.0,True,False,False,NA,4.38,4.06,0.6,0.6,0.0,0.5,1.0,0.0,5.09,3.53,0.0,11.73,4.06,0.2492,0.0423,0.2893,0.0058,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,False,0,0,Mistral3ForConditionalGeneration,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,52.1,20015.0,114.7,36.28,NA,NA
|
| 1212 |
+
llmfan46/Qwen3.5-35B-A3B-uncensored-heretic (no thinking),https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic,2/27/2026,5/3/2026,chatml w/ no thinking,3.0,35.0,35.0,True,False,False,39.52,42.13,14.44,2.4,1.4,0.7,9.8,10.0,9.5,27.72,30.35,15.17,37.63,14.44,0.4402,0.1785,0.4847,0.511,0.2672,-16.0%,60.1%,50.1%,46.9%,57.4%,45.6%,58.3%,54.2%,40.6%,46.0%,32.9%,52.1%,49.6%,39.2%,46.7%,56.0%,69.4%,Liberalism,False,0,1,Qwen3_5MoeForConditionalGeneration,39.1,0.74,13.1,6.3,0.331,16.0,90.0,0.823,0.446,0.317,1.48,0.222,0.247,33.8,6917.0,86.8,20.4,4.2,3.1
|
| 1213 |
+
llmfan46/Qwen3.5-35B-A3B-uncensored-heretic (<think> prefill),https://huggingface.co/llmfan46/Qwen3.5-35B-A3B-uncensored-heretic,2/27/2026,5/3/2026,chatml w/ <think> prefill,3.0,35.0,35.0,True,False,False,43.18,49.65,24.48,4.1,1.4,2.4,10.0,10.0,10.0,35.61,52.19,23.45,31.2,24.48,0.1713,0.0963,0.6384,0.3796,0.2746,-23.3%,65.8%,47.4%,46.2%,60.6%,43.3%,64.4%,49.8%,33.1%,43.8%,25.8%,51.7%,53.3%,33.8%,51.7%,58.3%,71.9%,Liberalism,True,14817,11,Qwen3_5MoeForConditionalGeneration,20.0,0.71,12.6,6.4,0.274,16.0,75.0,0.846,0.415,0.295,1.433,0.366,0.261,66.0,11120.0,70.3,21.65,4.2,3.2
|
| 1214 |
+
Nubinu/Qwen3.5-4B-MiniFantasy (no thinking),https://huggingface.co/Nubinu/Qwen3.5-4B-MiniFantasy,3/25/2026,5/3/2026,chatml w/ no thinking,4.0,4.0,4.0,True,False,False,30.24,40.05,16.32,2.4,1.7,1.0,8.8,9.0,8.5,14.22,21.38,1.72,19.54,16.32,0.1259,0.1833,0.173,0.3463,0.1487,-9.6%,56.6%,48.7%,50.1%,57.3%,47.7%,60.8%,54.6%,47.5%,42.3%,40.4%,46.7%,54.2%,49.6%,41.5%,54.4%,76.0%,Liberalism,False,0,0,Qwen3_5ForConditionalGeneration,42.6,0.79,13.4,6.6,0.299,21.0,74.0,0.828,0.46,0.309,1.76,0.22,0.121,79.0,6768.0,143.2,22.0,6.5,4.3
|
| 1215 |
+
Nubinu/Qwen3.5-4B-MiniFantasy (<think> prefill),https://huggingface.co/Nubinu/Qwen3.5-4B-MiniFantasy,3/25/2026,5/3/2026,chatml w/ <think> prefill,4.0,4.0,4.0,True,False,False,25.14,41.04,15.3,2.4,1.5,0.9,9.2,10.0,8.5,17.05,25.16,5.17,20.83,15.3,0.1493,0.1589,0.2411,0.3829,0.1093,-19.7%,58.9%,49.5%,50.9%,57.8%,45.6%,58.8%,52.9%,45.0%,45.0%,33.3%,54.8%,53.8%,44.2%,45.6%,58.5%,69.2%,Liberalism,True,16647,5,Qwen3_5ForConditionalGeneration,27.4,0.72,12.0,6.5,0.271,18.0,70.0,0.825,0.436,0.298,1.747,0.567,0.094,71.6,7595.0,124.6,21.62,6.0,4.0
|
| 1216 |
+
aifeifei798/Gemma-4-Queen-31B-it (<|channel>thought prefill),https://huggingface.co/aifeifei798/Gemma-4-Queen-31B-it,4/10/2026,5/3/2026,gemma-4 w/ <|channel>thought prefill,31.0,31.0,31.0,True,False,False,42.6,34.89,43.58,8.2,1.9,4.3,1.8,2.0,1.5,38.29,42.33,34.14,38.4,43.58,0.4665,0.4331,0.2816,0.48,0.2587,-19.7%,70.3%,47.0%,40.3%,66.2%,41.7%,61.0%,43.8%,28.1%,33.3%,27.7%,47.9%,44.2%,29.0%,64.0%,61.2%,73.5%,Liberalism,True,3591,2,Gemma4ForConditionalGeneration,26.7,0.68,12.2,6.8,0.342,5.0,31.0,0.848,0.442,0.313,1.42,0.488,0.237,32.1,2991.0,116.2,20.68,2.9,2.9
|
| 1217 |
+
aifeifei798/Gemma-4-Queen-31B-it,https://huggingface.co/aifeifei798/Gemma-4-Queen-31B-it,4/10/2026,5/3/2026,gemma-4,31.0,31.0,31.0,True,False,False,40.34,20.89,18.83,0.0,1.9,3.4,2.5,2.0,3.0,36.32,40.01,29.66,39.29,18.83,0.472,0.3475,0.4195,0.3944,0.331,-20.3%,69.5%,43.4%,40.6%,64.0%,38.1%,68.5%,36.9%,30.8%,33.3%,27.3%,49.2%,43.5%,29.0%,56.9%,61.5%,73.8%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,27.3,0.69,11.9,6.6,0.329,5.0,19.0,0.85,0.443,0.312,1.327,0.466,0.309,31.7,3805.0,94.7,21.5,3.4,2.2
|
| 1218 |
+
aifeifei798/Qwen3.5-Queen-27B (<think> prefill),https://huggingface.co/aifeifei798/Qwen3.5-Queen-27B,3/19/2026,5/3/2026,chatml w/ <think> prefill,27.0,27.0,27.0,True,False,False,39.7,57.73,36.59,5.3,2.0,4.4,10.0,10.0,10.0,33.58,54.03,17.93,28.78,36.59,0.2407,0.2038,0.3865,0.3371,0.2709,-24.7%,65.1%,48.8%,46.1%,63.3%,46.7%,60.4%,53.5%,32.1%,44.8%,27.7%,54.6%,51.2%,32.5%,59.6%,61.9%,68.3%,Liberalism,True,13115,6,Qwen3_5ForConditionalGeneration,25.0,0.75,11.8,5.9,0.303,15.0,79.0,0.846,0.411,0.313,1.467,0.377,0.279,53.3,6200.0,99.2,22.1,2.2,1.9
|
| 1219 |
+
aifeifei798/Qwen3.5-Queen-27B (no thinking),https://huggingface.co/aifeifei798/Qwen3.5-Queen-27B,3/19/2026,5/3/2026,chatml w/ no thinking,27.0,27.0,27.0,True,False,False,39.79,45.34,24.26,4.1,2.0,1.6,8.8,9.0,8.5,28.93,30.7,21.72,34.37,24.26,0.3671,0.2294,0.4125,0.4495,0.26,-19.9%,66.2%,49.8%,43.0%,60.3%,40.6%,59.0%,49.0%,32.1%,41.9%,27.5%,50.2%,49.2%,29.6%,54.0%,57.7%,69.2%,Liberalism,False,0,2,Qwen3_5ForConditionalGeneration,40.4,0.78,12.9,6.1,0.345,19.0,95.0,0.824,0.445,0.328,1.47,0.487,0.275,39.4,5608.0,95.6,20.97,2.9,2.6
|