DontPlanToEnd commited on
Commit
26c91e8
·
verified ·
1 Parent(s): 9b05be5

Upload ugi-leaderboard-data.csv

Browse files
Files changed (1) hide show
  1. ugi-leaderboard-data.csv +8 -4
ugi-leaderboard-data.csv CHANGED
@@ -1166,7 +1166,11 @@ Kewk/Heretical-Qwen3.5-9B (no thinking),https://huggingface.co/Kewk/Heretical-Qw
1166
  Kewk/Heretical-Qwen3.5-9B (<think> prefill),https://huggingface.co/Kewk/Heretical-Qwen3.5-9B,3/3/2026,4/5/2026,chatml w/ <think> prefill,9.0,9.0,9.0,TRUE,FALSE,FALSE,NA,29.39,4.09,0.6,0.5,0.1,8.0,8.0,8.0,6.33,8.02,0.0,10.98,4.09,0.068,0.1873,0.2289,0.0293,0.0353,-7.2%,55.5%,54.4%,52.8%,51.0%,37.9%,49.4%,50.4%,52.7%,40.0%,40.8%,61.0%,46.9%,50.6%,39.8%,47.7%,65.4%,Centrism,True,15436,49,Qwen3_5ForConditionalGeneration,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.993,0.663,-0.041,111.1,6650.0,127.4,30.25,NA,NA
1167
  Qwen/Qwen3.6-Plus (reasoning=disabled),,4/2/2026,4/5/2026,,,,,FALSE,FALSE,TRUE,46.1,22.94,25.65,1.2,2.7,3.5,1.8,2.0,1.5,42.3,51.73,34.14,41.02,25.65,0.4149,0.2709,0.6465,0.4268,0.2921,-18.2%,64.2%,47.0%,44.4%,59.8%,40.6%,63.1%,44.8%,36.7%,35.4%,35.2%,47.1%,51.7%,34.6%,50.8%,63.1%,65.4%,Liberalism,False,0,0,,20.0,0.7,12.6,5.6,0.351,11.0,90.0,0.842,0.434,0.349,1.537,0.267,0.344,35.6,4838.0,69.5,21.18,2.7,3.4
1168
  Qwen/Qwen3.6-Plus (reasoning=enabled),,4/2/2026,4/5/2026,,,,,FALSE,FALSE,TRUE,58.23,43.51,50.26,7.1,3.7,5.0,3.0,3.0,3.0,51.02,75.37,40.34,37.35,50.26,0.2771,0.2012,0.5381,0.4692,0.3819,-22.2%,66.6%,47.0%,45.8%,63.3%,43.1%,64.0%,48.1%,31.7%,36.2%,32.3%,47.3%,57.7%,32.5%,59.0%,63.1%,67.7%,Liberalism,True,0,0,,21.4,0.76,13.1,5.2,0.325,8.0,42.0,0.846,0.41,0.307,1.35,0.09,0.353,48.5,6268.0,80.8,20.78,2.0,2.2
1169
- google/gemma-4-31B-it (<|channel>thought prefill),https://huggingface.co/google/gemma-4-31B-it,4/2/2026,4/12/2026,gemma-4 w/ <|channel>thought prefill,31.0,31.0,31.0,False,False,True,43.07,23.05,28.32,1.2,2.3,4.9,1.2,1.0,1.5,38.91,45.19,33.1,38.42,28.32,0.4379,0.3866,0.336,0.5092,0.2514,-20.8%,69.8%,46.4%,43.1%,64.3%,45.0%,62.1%,46.2%,29.8%,32.3%,28.5%,48.3%,47.9%,33.1%,60.0%,61.7%,71.2%,Liberalism,True,3684,0,Gemma4ForConditionalGeneration,29.9,0.65,12.0,6.8,0.36,6.0,31.0,0.855,0.445,0.327,1.45,0.488,0.243,34.0,3401.0,106.7,20.42,2.6,3.4
1170
- google/gemma-4-31B-it,https://huggingface.co/google/gemma-4-31B-it,4/2/2026,4/12/2026,gemma-4,31.0,31.0,31.0,False,False,True,38.57,21.54,19.81,0.0,1.9,3.7,2.5,2.0,3.0,34.36,35.81,31.38,35.88,19.81,0.472,0.1135,0.407,0.5037,0.2978,-19.4%,69.7%,45.0%,39.7%,66.3%,37.3%,66.7%,39.0%,30.4%,32.3%,28.1%,46.9%,44.6%,27.7%,62.1%,61.5%,75.4%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,30.3,0.65,11.7,6.8,0.344,5.0,20.0,0.857,0.446,0.329,1.34,0.498,0.256,31.7,9838.0,96.4,20.47,3.7,2.5
1171
- coder3101/gemma-4-31B-it-heretic,https://huggingface.co/coder3101/gemma-4-31B-it-heretic,4/3/2026,4/12/2026,gemma-4,31.0,31.0,31.0,True,False,False,39.59,56.38,34.57,4.7,2.3,3.9,10.0,10.0,10.0,35.64,37.69,28.62,40.6,34.57,0.4167,0.3562,0.5345,0.3478,0.3746,-20.4%,65.3%,46.3%,44.3%,59.9%,40.2%,64.4%,43.5%,34.0%,42.9%,27.3%,52.7%,43.1%,37.1%,47.9%,60.0%,71.9%,Liberalism,False,0,1,Gemma4ForConditionalGeneration,30.2,0.65,11.8,7.0,0.337,6.0,19.0,0.861,0.449,0.324,1.223,0.604,0.315,35.5,3709.0,81.2,21.98,3.0,0.9
1172
- coder3101/gemma-4-31B-it-heretic (<|channel>thought prefill),https://huggingface.co/coder3101/gemma-4-31B-it-heretic,4/3/2026,4/12/2026,gemma-4 w/ <|channel>thought prefill,31.0,31.0,31.0,True,False,False,40.49,65.69,48.54,7.6,2.5,5.5,10.0,10.0,10.0,36.53,42.82,29.66,37.13,48.54,0.493,0.092,0.4331,0.5313,0.307,-20.0%,65.0%,48.8%,44.4%,61.2%,42.1%,58.3%,46.9%,33.5%,45.0%,26.5%,50.0%,44.2%,39.2%,50.2%,59.4%,74.2%,Liberalism,True,3009,6,Gemma4ForConditionalGeneration,29.6,0.64,11.8,7.0,0.351,7.0,19.0,0.861,0.445,0.326,1.327,0.648,0.279,30.4,11504.0,93.0,20.22,2.3,1.2
 
 
 
 
 
1166
  Kewk/Heretical-Qwen3.5-9B (<think> prefill),https://huggingface.co/Kewk/Heretical-Qwen3.5-9B,3/3/2026,4/5/2026,chatml w/ <think> prefill,9.0,9.0,9.0,TRUE,FALSE,FALSE,NA,29.39,4.09,0.6,0.5,0.1,8.0,8.0,8.0,6.33,8.02,0.0,10.98,4.09,0.068,0.1873,0.2289,0.0293,0.0353,-7.2%,55.5%,54.4%,52.8%,51.0%,37.9%,49.4%,50.4%,52.7%,40.0%,40.8%,61.0%,46.9%,50.6%,39.8%,47.7%,65.4%,Centrism,True,15436,49,Qwen3_5ForConditionalGeneration,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.993,0.663,-0.041,111.1,6650.0,127.4,30.25,NA,NA
1167
  Qwen/Qwen3.6-Plus (reasoning=disabled),,4/2/2026,4/5/2026,,,,,FALSE,FALSE,TRUE,46.1,22.94,25.65,1.2,2.7,3.5,1.8,2.0,1.5,42.3,51.73,34.14,41.02,25.65,0.4149,0.2709,0.6465,0.4268,0.2921,-18.2%,64.2%,47.0%,44.4%,59.8%,40.6%,63.1%,44.8%,36.7%,35.4%,35.2%,47.1%,51.7%,34.6%,50.8%,63.1%,65.4%,Liberalism,False,0,0,,20.0,0.7,12.6,5.6,0.351,11.0,90.0,0.842,0.434,0.349,1.537,0.267,0.344,35.6,4838.0,69.5,21.18,2.7,3.4
1168
  Qwen/Qwen3.6-Plus (reasoning=enabled),,4/2/2026,4/5/2026,,,,,FALSE,FALSE,TRUE,58.23,43.51,50.26,7.1,3.7,5.0,3.0,3.0,3.0,51.02,75.37,40.34,37.35,50.26,0.2771,0.2012,0.5381,0.4692,0.3819,-22.2%,66.6%,47.0%,45.8%,63.3%,43.1%,64.0%,48.1%,31.7%,36.2%,32.3%,47.3%,57.7%,32.5%,59.0%,63.1%,67.7%,Liberalism,True,0,0,,21.4,0.76,13.1,5.2,0.325,8.0,42.0,0.846,0.41,0.307,1.35,0.09,0.353,48.5,6268.0,80.8,20.78,2.0,2.2
1169
+ google/gemma-4-31B-it (<|channel>thought prefill),https://huggingface.co/google/gemma-4-31B-it,4/2/2026,4/12/2026,gemma-4 w/ <|channel>thought prefill,31.0,31.0,31.0,FALSE,FALSE,TRUE,43.07,23.05,28.32,1.2,2.3,4.9,1.2,1.0,1.5,38.91,45.19,33.1,38.42,28.32,0.4379,0.3866,0.336,0.5092,0.2514,-20.8%,69.8%,46.4%,43.1%,64.3%,45.0%,62.1%,46.2%,29.8%,32.3%,28.5%,48.3%,47.9%,33.1%,60.0%,61.7%,71.2%,Liberalism,True,3684,0,Gemma4ForConditionalGeneration,29.9,0.65,12.0,6.8,0.36,6.0,31.0,0.855,0.445,0.327,1.45,0.488,0.243,34.0,3401.0,106.7,20.42,2.6,3.4
1170
+ google/gemma-4-31B-it,https://huggingface.co/google/gemma-4-31B-it,4/2/2026,4/12/2026,gemma-4,31.0,31.0,31.0,FALSE,FALSE,TRUE,38.57,21.54,19.81,0.0,1.9,3.7,2.5,2.0,3.0,34.36,35.81,31.38,35.88,19.81,0.472,0.1135,0.407,0.5037,0.2978,-19.4%,69.7%,45.0%,39.7%,66.3%,37.3%,66.7%,39.0%,30.4%,32.3%,28.1%,46.9%,44.6%,27.7%,62.1%,61.5%,75.4%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,30.3,0.65,11.7,6.8,0.344,5.0,20.0,0.857,0.446,0.329,1.34,0.498,0.256,31.7,9838.0,96.4,20.47,3.7,2.5
1171
+ coder3101/gemma-4-31B-it-heretic,https://huggingface.co/coder3101/gemma-4-31B-it-heretic,4/3/2026,4/12/2026,gemma-4,31.0,31.0,31.0,TRUE,FALSE,FALSE,39.59,56.38,34.57,4.7,2.3,3.9,10.0,10.0,10.0,35.64,37.69,28.62,40.6,34.57,0.4167,0.3562,0.5345,0.3478,0.3746,-20.4%,65.3%,46.3%,44.3%,59.9%,40.2%,64.4%,43.5%,34.0%,42.9%,27.3%,52.7%,43.1%,37.1%,47.9%,60.0%,71.9%,Liberalism,False,0,1,Gemma4ForConditionalGeneration,30.2,0.65,11.8,7.0,0.337,6.0,19.0,0.861,0.449,0.324,1.223,0.604,0.315,35.5,3709.0,81.2,21.98,3.0,0.9
1172
+ coder3101/gemma-4-31B-it-heretic (<|channel>thought prefill),https://huggingface.co/coder3101/gemma-4-31B-it-heretic,4/3/2026,4/12/2026,gemma-4 w/ <|channel>thought prefill,31.0,31.0,31.0,TRUE,FALSE,FALSE,40.49,65.69,48.54,7.6,2.5,5.5,10.0,10.0,10.0,36.53,42.82,29.66,37.13,48.54,0.493,0.092,0.4331,0.5313,0.307,-20.0%,65.0%,48.8%,44.4%,61.2%,42.1%,58.3%,46.9%,33.5%,45.0%,26.5%,50.0%,44.2%,39.2%,50.2%,59.4%,74.2%,Liberalism,True,3009,6,Gemma4ForConditionalGeneration,29.6,0.64,11.8,7.0,0.351,7.0,19.0,0.861,0.445,0.326,1.327,0.648,0.279,30.4,11504.0,93.0,20.22,2.3,1.2
1173
+ google/gemma-4-26B-A4B-it,https://huggingface.co/google/gemma-4-26B-A4B-it,4/2/2026,4/12/2026,gemma-4,4.0,26.0,26.0,FALSE,FALSE,TRUE,41.62,20.77,22.41,2.9,2.2,1.8,1.8,2.0,1.5,34.44,36.54,28.97,37.81,22.41,0.3538,0.2281,0.4582,0.4854,0.365,-18.2%,65.7%,48.4%,42.0%,59.8%,39.8%,60.6%,45.6%,32.3%,40.2%,30.4%,43.1%,46.0%,36.9%,48.1%,63.1%,68.1%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,31.7,0.67,12.6,6.5,0.343,8.0,65.0,0.821,0.439,0.347,1.287,0.305,0.315,40.5,5636.0,89.9,20.63,3.9,5.0
1174
+ google/gemma-4-26B-A4B-it (<|channel>thought prefill),https://huggingface.co/google/gemma-4-26B-A4B-it,4/2/2026,4/12/2026,gemma-4 w/ <|channel>thought prefill,4.0,26.0,26.0,FALSE,FALSE,TRUE,43.8,25.21,29.06,5.3,2.2,1.9,1.8,2.0,1.5,34.31,46.27,26.9,29.77,29.06,0.3223,0.1,0.3432,0.413,0.31,-15.2%,65.1%,48.1%,42.7%,63.9%,40.6%,62.5%,47.5%,30.4%,40.2%,34.0%,42.1%,50.6%,35.4%,57.7%,60.4%,73.5%,Liberalism,True,5795,7,Gemma4ForConditionalGeneration,29.0,0.67,12.9,6.7,0.352,8.0,66.0,0.835,0.433,0.332,1.367,0.345,0.28,43.5,10820.0,105.5,21.32,3.4,5.1
1175
+ zai-org/GLM-5.1 (reasoning=disabled),https://huggingface.co/zai-org/GLM-5.1,4/4/2026,4/12/2026,,40.0,744.0,744.0,FALSE,FALSE,TRUE,54.9,39.99,53.73,4.7,5.5,5.8,1.2,1.0,1.5,49.51,51.43,51.38,45.71,53.73,0.3153,0.4727,0.5404,0.5923,0.3647,-18.6%,62.2%,46.6%,48.0%,61.7%,44.4%,55.8%,40.0%,40.8%,40.0%,32.5%,51.2%,51.2%,41.5%,62.7%,59.2%,63.3%,Liberalism,False,0,0,,34.9,0.74,12.6,5.7,0.374,16.0,84.0,0.877,0.443,0.336,1.363,0.113,0.335,44.2,2689.0,80.5,19.67,1.7,4.6
1176
+ zai-org/GLM-5.1 (reasoning=enabled),https://huggingface.co/zai-org/GLM-5.1,4/4/2026,4/15/2026,,40.0,744.0,744.0,FALSE,FALSE,TRUE,61.74,52.88,76.82,8.2,6.9,8.2,0.5,1.0,0.0,57.47,71.87,53.79,46.76,76.82,0.613,0.373,0.5099,0.4891,0.3532,-16.7%,63.8%,47.1%,45.3%,61.2%,47.7%,58.3%,47.3%,40.2%,36.0%,32.5%,49.0%,48.1%,39.0%,62.3%,59.0%,62.3%,Liberalism,True,0,0,,47.5,0.84,12.0,4.9,0.382,9.0,60.0,0.851,0.43,0.351,1.361,0.153,0.321,23.8,3534.0,83.9,20.6,1.7,3.0