Upload ugi-leaderboard-data.csv
Browse files- ugi-leaderboard-data.csv +17 -0
ugi-leaderboard-data.csv
CHANGED
|
@@ -1149,3 +1149,20 @@ MuXodious/gpt-4o-distil-Llama-3.3-70B-Instruct-PaperWitch-heresy,https://hugging
|
|
| 1149 |
KaraKaraWitch/gpt4o-distil-paperwitch-abliteration-L33-70b,https://huggingface.co/KaraKaraWitch/gpt4o-distil-paperwitch-abliteration-L33-70b,2/26/2026,3/21/2026,llama-3,70.0,70.0,70.0,TRUE,FALSE,FALSE,36.62,50.99,36.48,3.5,3.6,3.8,8.0,8.0,8.0,30.68,37.03,34.14,20.88,36.48,0.3047,0.1774,0.1106,0.2049,0.2464,-10.0%,60.8%,48.5%,43.6%,63.1%,41.2%,62.7%,49.6%,45.8%,34.4%,37.5%,49.2%,37.5%,44.2%,64.0%,59.2%,66.2%,Liberalism,False,0,0,LlamaForCausalLM,22.2,0.83,13.8,6.9,0.366,15.0,31.0,0.885,0.486,0.265,1.58,0.245,0.265,45.3,6951.0,170.3,23.83,2.9,3.1
|
| 1150 |
schonsense/70B_llama33_stock_unslop,https://huggingface.co/schonsense/70B_llama33_stock_unslop,2/28/2026,3/21/2026,llama-3,70.0,70.0,70.0,TRUE,TRUE,FALSE,26.19,36.79,33.93,3.5,3.2,3.5,4.2,4.0,4.5,30.31,39.71,29.66,21.56,33.93,0.3374,0.1723,0.1438,0.1929,0.2318,-17.2%,66.2%,48.2%,43.9%,61.5%,39.0%,64.2%,47.7%,36.7%,31.5%,33.3%,45.6%,47.3%,38.8%,60.0%,62.7%,61.9%,Liberalism,False,0,2,LlamaForCausalLM,41.0,0.87,11.0,6.9,0.363,9.0,46.0,0.907,0.542,0.317,1.55,0.345,0.24,42.0,7118.0,154.0,24.03,2.7,3.8
|
| 1151 |
MiniMaxAI/MiniMax-M2.7,,3/18/2026,3/29/2026,,,,,FALSE,FALSE,TRUE,37.53,16.94,14.16,0.0,1.2,2.8,2.2,2.0,2.5,32.22,46.95,13.45,36.25,14.16,0.4556,0.1289,0.4307,0.4355,0.3617,-15.5%,63.2%,46.4%,45.4%,59.2%,46.5%,61.2%,46.9%,37.1%,39.4%,34.0%,48.5%,51.7%,36.0%,53.8%,57.1%,66.9%,Liberalism,True,0,0,,48.3,0.86,11.3,5.0,0.363,36.0,74.0,0.84,0.429,0.346,1.3,0.282,0.316,32.8,8936.0,93.3,21.1,0.6,3.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1149 |
KaraKaraWitch/gpt4o-distil-paperwitch-abliteration-L33-70b,https://huggingface.co/KaraKaraWitch/gpt4o-distil-paperwitch-abliteration-L33-70b,2/26/2026,3/21/2026,llama-3,70.0,70.0,70.0,TRUE,FALSE,FALSE,36.62,50.99,36.48,3.5,3.6,3.8,8.0,8.0,8.0,30.68,37.03,34.14,20.88,36.48,0.3047,0.1774,0.1106,0.2049,0.2464,-10.0%,60.8%,48.5%,43.6%,63.1%,41.2%,62.7%,49.6%,45.8%,34.4%,37.5%,49.2%,37.5%,44.2%,64.0%,59.2%,66.2%,Liberalism,False,0,0,LlamaForCausalLM,22.2,0.83,13.8,6.9,0.366,15.0,31.0,0.885,0.486,0.265,1.58,0.245,0.265,45.3,6951.0,170.3,23.83,2.9,3.1
|
| 1150 |
schonsense/70B_llama33_stock_unslop,https://huggingface.co/schonsense/70B_llama33_stock_unslop,2/28/2026,3/21/2026,llama-3,70.0,70.0,70.0,TRUE,TRUE,FALSE,26.19,36.79,33.93,3.5,3.2,3.5,4.2,4.0,4.5,30.31,39.71,29.66,21.56,33.93,0.3374,0.1723,0.1438,0.1929,0.2318,-17.2%,66.2%,48.2%,43.9%,61.5%,39.0%,64.2%,47.7%,36.7%,31.5%,33.3%,45.6%,47.3%,38.8%,60.0%,62.7%,61.9%,Liberalism,False,0,2,LlamaForCausalLM,41.0,0.87,11.0,6.9,0.363,9.0,46.0,0.907,0.542,0.317,1.55,0.345,0.24,42.0,7118.0,154.0,24.03,2.7,3.8
|
| 1151 |
MiniMaxAI/MiniMax-M2.7,,3/18/2026,3/29/2026,,,,,FALSE,FALSE,TRUE,37.53,16.94,14.16,0.0,1.2,2.8,2.2,2.0,2.5,32.22,46.95,13.45,36.25,14.16,0.4556,0.1289,0.4307,0.4355,0.3617,-15.5%,63.2%,46.4%,45.4%,59.2%,46.5%,61.2%,46.9%,37.1%,39.4%,34.0%,48.5%,51.7%,36.0%,53.8%,57.1%,66.9%,Liberalism,True,0,0,,48.3,0.86,11.3,5.0,0.363,36.0,74.0,0.84,0.429,0.346,1.3,0.282,0.316,32.8,8936.0,93.3,21.1,0.6,3.0
|
| 1152 |
+
SicariusSicariiStuff/Assistant_Pepe_70B,https://huggingface.co/SicariusSicariiStuff/Assistant_Pepe_70B,4/1/2026,4/5/2026,llama-3,70.0,70.0,70.0,TRUE,FALSE,FALSE,34.32,59.52,41.78,3.5,3.6,5.5,9.5,10.0,9.0,35.25,42.64,35.86,27.24,41.78,0.3181,0.1981,0.2257,0.378,0.2423,2.3%,52.2%,51.3%,39.0%,58.4%,45.2%,56.9%,56.0%,50.4%,46.2%,46.9%,39.4%,33.8%,43.8%,54.6%,56.9%,63.8%,Centrism,False,0,1,LlamaForCausalLM,40.5,1.04,11.9,5.3,0.351,51.0,91.0,0.879,0.487,0.343,1.55,0.233,0.238,43.9,6349.0,128.2,21.67,9.2,7.5
|
| 1153 |
+
ArliAI/Qwen3.5-35B-A3B-Derestricted (no thinking),https://huggingface.co/ArliAI/Qwen3.5-35B-A3B-Derestricted,3/18/2026,4/5/2026,chatml w/ no thinking,3.0,35.0,35.0,TRUE,FALSE,FALSE,36.67,43.02,18.28,4.1,1.5,0.3,9.2,9.0,9.5,23.68,24.17,10.69,36.17,18.28,0.4522,0.1575,0.455,0.4963,0.2476,-11.5%,59.5%,47.8%,47.4%,54.4%,42.3%,61.2%,46.9%,40.4%,44.4%,36.7%,47.1%,50.4%,44.6%,44.6%,55.2%,63.5%,Liberalism,False,0,3,Qwen3_5MoeForConditionalGeneration,42.7,0.76,12.8,6.2,0.324,18.0,95.0,0.826,0.44,0.329,1.527,0.205,0.23,33.0,7648.0,90.3,20.53,3.3,3.9
|
| 1154 |
+
ArliAI/Qwen3.5-35B-A3B-Derestricted (<think> prefill),https://huggingface.co/ArliAI/Qwen3.5-35B-A3B-Derestricted,3/18/2026,4/5/2026,chatml w/ <think> prefill,3.0,35.0,35.0,TRUE,FALSE,FALSE,39.2,50.84,26.26,4.7,1.8,2.0,10.0,10.0,10.0,28.11,41.97,13.45,28.92,26.26,0.2776,0.1507,0.5778,0.2132,0.2265,-15.6%,63.0%,44.6%,43.3%,61.1%,44.4%,68.1%,46.2%,37.7%,37.9%,35.4%,45.8%,50.2%,33.8%,52.1%,60.8%,70.4%,Liberalism,True,14534,7,Qwen3_5MoeForConditionalGeneration,25.6,0.72,12.3,6.4,0.284,17.0,90.0,0.851,0.403,0.299,1.607,0.234,0.233,48.4,7916.0,76.6,23.7,3.2,4.0
|
| 1155 |
+
ArliAI/Qwen3.5-27B-Derestricted (<think> prefill),https://huggingface.co/ArliAI/Qwen3.5-27B-Derestricted,3/8/2026,4/5/2026,chatml w/ <think> prefill,27.0,27.0,27.0,TRUE,FALSE,FALSE,41.03,48.55,22.83,4.1,1.4,1.9,10.0,10.0,10.0,28.66,49.02,10.0,26.95,22.83,0.2934,0.0769,0.5087,0.2018,0.2668,-15.8%,62.8%,49.5%,45.7%,59.1%,44.4%,57.7%,50.6%,36.0%,42.9%,32.5%,54.0%,45.8%,37.3%,57.5%,56.2%,63.5%,Liberalism,True,12593,9,Qwen3_5ForConditionalGeneration,26.4,0.72,12.5,5.7,0.309,14.0,81.0,0.849,0.405,0.297,1.453,0.738,0.302,46.6,13105.0,84.0,23.88,2.2,1.6
|
| 1156 |
+
ArliAI/Qwen3.5-27B-Derestricted (no thinking),https://huggingface.co/ArliAI/Qwen3.5-27B-Derestricted,3/8/2026,4/5/2026,chatml w/ no thinking,27.0,27.0,27.0,TRUE,FALSE,FALSE,40.51,48.68,23.01,3.5,1.5,2.3,10.0,10.0,10.0,29.23,36.8,18.97,31.92,23.01,0.4223,0.1885,0.3736,0.302,0.3095,-14.5%,62.8%,49.2%,46.2%,58.5%,43.3%,57.1%,47.9%,37.9%,40.6%,32.9%,55.6%,43.3%,39.6%,52.7%,55.2%,67.7%,Liberalism,False,0,3,Qwen3_5ForConditionalGeneration,40.3,0.76,12.9,5.8,0.339,20.0,98.0,0.828,0.442,0.308,1.39,0.62,0.336,35.1,6616.0,101.0,22.5,2.2,1.6
|
| 1157 |
+
Crownelius/Crow-9B-HERETIC-4.6 (no thinking),https://huggingface.co/Crownelius/Crow-9B-HERETIC-4.6,3/3/2026,4/5/2026,chatml w/ no thinking,9.0,9.0,9.0,TRUE,FALSE,FALSE,31.83,35.69,14.79,1.2,1.8,1.4,7.8,8.0,7.5,19.5,28.25,7.93,22.31,14.79,0.1902,0.1503,0.1681,0.3978,0.2089,-9.2%,61.7%,48.1%,45.3%,59.5%,41.5%,62.5%,48.3%,40.6%,34.4%,40.0%,45.4%,48.1%,42.3%,56.9%,57.1%,64.6%,Liberalism,False,0,0,Qwen3_5ForConditionalGeneration,18.4,0.65,13.9,4.7,0.334,16.0,54.0,0.829,0.422,0.336,1.637,0.232,0.208,61.9,7932.0,144.9,21.47,4.5,3.4
|
| 1158 |
+
Crownelius/Crow-9B-HERETIC-4.6 (<think> prefill),https://huggingface.co/Crownelius/Crow-9B-HERETIC-4.6,3/3/2026,4/5/2026,chatml w/ <think> prefill,9.0,9.0,9.0,TRUE,FALSE,FALSE,29.13,27.78,12.91,1.2,1.7,0.9,5.8,5.0,6.5,15.95,33.2,1.72,12.91,12.91,0.1489,0.1207,0.182,0.004,0.19,-13.2%,60.8%,49.7%,44.6%,62.7%,39.4%,60.8%,49.4%,33.1%,44.4%,40.0%,48.1%,46.0%,39.6%,59.6%,62.7%,65.8%,Liberalism,True,12321,8,Qwen3_5ForConditionalGeneration,25.8,0.7,12.2,5.9,0.299,13.0,64.0,0.858,0.42,0.303,1.614,0.218,0.157,71.7,9396.0,140.3,37.82,5.0,5.8
|
| 1159 |
+
trohrbaugh/Qwen3.5-9B-heretic-v2 (no thinking),https://huggingface.co/trohrbaugh/Qwen3.5-9B-heretic-v2,3/2/2026,4/5/2026,chatml w/ no thinking,9.0,9.0,9.0,TRUE,FALSE,FALSE,32.52,37.57,10.1,1.2,0.9,1.0,9.2,9.0,9.5,16.05,17.23,7.93,23.0,10.1,0.2306,0.1752,0.2564,0.3236,0.164,-7.3%,59.9%,49.3%,47.2%,56.7%,48.1%,55.2%,51.2%,38.8%,41.2%,40.4%,46.7%,47.3%,47.7%,46.0%,54.4%,69.8%,Liberalism,False,0,0,Qwen3_5ForConditionalGeneration,38.7,0.73,13.4,7.1,0.316,23.0,96.0,0.833,0.442,0.303,1.73,0.554,0.203,54.8,7021.0,121.2,22.25,4.0,3.4
|
| 1160 |
+
trohrbaugh/Qwen3.5-9B-heretic-v2 (<think> prefill),https://huggingface.co/trohrbaugh/Qwen3.5-9B-heretic-v2,3/2/2026,4/5/2026,chatml w/ <think> prefill,9.0,9.0,9.0,TRUE,FALSE,FALSE,35.71,40.53,13.29,1.8,0.8,1.7,9.5,9.0,10.0,24.2,40.25,5.17,27.16,13.29,0.12,0.1611,0.4288,0.3603,0.2878,-16.7%,60.2%,47.6%,45.0%,61.6%,49.0%,62.3%,54.0%,40.6%,48.1%,30.6%,48.8%,44.6%,41.7%,55.2%,59.0%,70.6%,Liberalism,True,16439,13,Qwen3_5ForConditionalGeneration,22.0,0.66,12.7,7.4,0.29,20.0,79.0,0.846,0.412,0.297,1.348,0.86,0.27,81.2,7510.0,93.5,21.85,3.7,3.1
|
| 1161 |
+
DavidAU/Qwen3.5-9B-Claude-4.6-HighIQ-THINKING-HERETIC-UNCENSORED (no thinking),https://huggingface.co/DavidAU/Qwen3.5-9B-Claude-4.6-HighIQ-THINKING-HERETIC-UNCENSORED,3/4/2026,4/5/2026,chatml w/ no thinking,9.0,9.0,9.0,TRUE,FALSE,FALSE,29.57,18.19,13.54,1.8,1.4,1.0,2.8,4.0,1.5,15.29,24.3,1.72,19.85,13.54,0.2039,0.1031,0.0586,0.3587,0.2684,-2.5%,57.3%,48.7%,45.9%,52.5%,46.7%,55.4%,48.3%,42.5%,39.6%,45.6%,46.7%,53.8%,40.0%,47.3%,49.2%,61.2%,Centrism,False,0,0,Qwen3_5ForConditionalGeneration,43.5,0.87,12.6,5.9,0.324,17.0,48.0,0.849,0.457,0.301,1.447,0.378,0.26,59.3,10571.0,214.5,21.87,2.7,3.6
|
| 1162 |
+
DavidAU/Qwen3.5-9B-Claude-4.6-HighIQ-THINKING-HERETIC-UNCENSORED (<think> prefill),https://huggingface.co/DavidAU/Qwen3.5-9B-Claude-4.6-HighIQ-THINKING-HERETIC-UNCENSORED,3/4/2026,4/5/2026,chatml w/ <think> prefill,9.0,9.0,9.0,TRUE,FALSE,FALSE,32.09,30.32,12.98,1.2,1.4,1.3,6.5,7.0,6.0,22.93,35.34,7.93,25.52,12.98,0.0789,0.1999,0.2924,0.4285,0.2761,-14.4%,66.4%,45.3%,39.2%,63.8%,45.0%,65.0%,45.8%,32.9%,35.4%,32.5%,46.9%,44.2%,26.5%,66.5%,58.5%,66.5%,Liberalism,True,8959,0,Qwen3_5ForConditionalGeneration,26.3,0.71,11.6,6.2,0.298,23.0,44.0,0.865,0.409,0.3,1.421,0.564,0.284,102.5,6303.0,114.1,21.17,3.3,4.2
|
| 1163 |
+
huihui-ai/Huihui-Qwen3.5-9B-abliterated (no thinking),https://huggingface.co/huihui-ai/Huihui-Qwen3.5-9B-abliterated,3/9/2026,4/5/2026,chatml w/ no thinking,9.0,9.0,9.0,TRUE,FALSE,FALSE,29.12,37.63,11.44,1.8,0.9,0.9,9.0,9.0,9.0,14.88,19.53,5.17,19.95,11.44,0.2297,0.1909,0.1446,0.2331,0.199,-6.0%,60.0%,49.8%,46.7%,52.2%,44.0%,61.2%,54.6%,36.5%,44.4%,39.2%,41.9%,42.1%,56.0%,35.6%,56.2%,64.6%,Centrism,False,0,0,Qwen3_5ForConditionalGeneration,37.8,0.73,13.2,7.5,0.303,20.0,90.0,0.833,0.456,0.296,1.522,0.684,0.199,54.9,6548.0,153.7,23.4,5.2,3.4
|
| 1164 |
+
huihui-ai/Huihui-Qwen3.5-9B-abliterated (<think> prefill),https://huggingface.co/huihui-ai/Huihui-Qwen3.5-9B-abliterated,3/9/2026,4/5/2026,chatml w/ <think> prefill,9.0,9.0,9.0,TRUE,FALSE,FALSE,NA,43.16,17.23,1.2,1.5,2.4,9.5,9.0,10.0,19.8,27.07,5.17,27.14,17.23,0.1834,0.2118,0.3917,0.2936,0.2767,-7.2%,54.3%,46.9%,39.9%,57.0%,49.0%,61.0%,50.6%,50.4%,52.1%,34.6%,38.1%,42.5%,39.0%,42.9%,55.0%,73.1%,Centrism,True,16286,33,Qwen3_5ForConditionalGeneration,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.376,0.902,0.273,63.3,6005.0,98.4,22.6,NA,NA
|
| 1165 |
+
Kewk/Heretical-Qwen3.5-9B (no thinking),https://huggingface.co/Kewk/Heretical-Qwen3.5-9B,3/3/2026,4/5/2026,chatml w/ no thinking,9.0,9.0,9.0,TRUE,FALSE,FALSE,8.67,30.69,7.29,1.2,0.6,0.5,7.8,6.0,9.5,6.57,9.17,1.72,8.81,7.29,0.1705,0.1089,0.0422,0.0368,0.082,3.4%,52.3%,53.5%,48.1%,45.0%,43.8%,57.9%,62.3%,52.1%,47.5%,43.5%,45.6%,53.8%,45.0%,46.7%,38.8%,49.6%,Centrism,False,0,3,Qwen3_5ForConditionalGeneration,31.6,0.73,10.5,7.0,0.327,18.0,85.0,0.855,0.474,0.329,1.993,0.478,0.078,66.2,10151.0,240.7,29.47,3.7,2.6
|
| 1166 |
+
Kewk/Heretical-Qwen3.5-9B (<think> prefill),https://huggingface.co/Kewk/Heretical-Qwen3.5-9B,3/3/2026,4/5/2026,chatml w/ <think> prefill,9.0,9.0,9.0,TRUE,FALSE,FALSE,NA,29.39,4.09,0.6,0.5,0.1,8.0,8.0,8.0,6.33,8.02,0.0,10.98,4.09,0.068,0.1873,0.2289,0.0293,0.0353,-7.2%,55.5%,54.4%,52.8%,51.0%,37.9%,49.4%,50.4%,52.7%,40.0%,40.8%,61.0%,46.9%,50.6%,39.8%,47.7%,65.4%,Centrism,True,15436,49,Qwen3_5ForConditionalGeneration,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.993,0.663,-0.041,111.1,6650.0,127.4,30.25,NA,NA
|
| 1167 |
+
Qwen/Qwen3.6-Plus (reasoning=disabled),,4/2/2026,4/5/2026,,,,,FALSE,FALSE,TRUE,46.1,22.94,25.65,1.2,2.7,3.5,1.8,2.0,1.5,42.3,51.73,34.14,41.02,25.65,0.4149,0.2709,0.6465,0.4268,0.2921,-18.2%,64.2%,47.0%,44.4%,59.8%,40.6%,63.1%,44.8%,36.7%,35.4%,35.2%,47.1%,51.7%,34.6%,50.8%,63.1%,65.4%,Liberalism,False,0,0,,20.0,0.7,12.6,5.6,0.351,11.0,90.0,0.842,0.434,0.349,1.537,0.267,0.344,35.6,4838.0,69.5,21.18,2.7,3.4
|
| 1168 |
+
Qwen/Qwen3.6-Plus (reasoning=enabled),,4/2/2026,4/5/2026,,,,,FALSE,FALSE,TRUE,58.23,43.51,50.26,7.1,3.7,5.0,3.0,3.0,3.0,51.02,75.37,40.34,37.35,50.26,0.2771,0.2012,0.5381,0.4692,0.3819,-22.2%,66.6%,47.0%,45.8%,63.3%,43.1%,64.0%,48.1%,31.7%,36.2%,32.3%,47.3%,57.7%,32.5%,59.0%,63.1%,67.7%,Liberalism,True,0,0,,21.4,0.76,13.1,5.2,0.325,8.0,42.0,0.846,0.41,0.307,1.35,0.09,0.353,48.5,6268.0,80.8,20.78,2.0,2.2
|