DontPlanToEnd commited on
Commit
c1a9920
·
verified ·
1 Parent(s): d6a37bb

Upload ugi-leaderboard-data.csv

Browse files
Files changed (1) hide show
  1. ugi-leaderboard-data.csv +6 -2
ugi-leaderboard-data.csv CHANGED
@@ -395,8 +395,8 @@ MarinaraSpaghetti/NemoRemix-12B,https://huggingface.co/MarinaraSpaghetti/NemoRem
395
  zai-org/GLM-4.6 (reasoning=enabled),https://huggingface.co/zai-org/GLM-4.6,9/29/2025,10/2/2025,,32.0,355.0,355.0,FALSE,FALSE,TRUE,53.31,45.76,4.7,5.3,6.1,2.0,4.0,0.0,44.94,63.65,43.1,28.08,53.81,0.2388,0.4197,0.377,0.1282,0.2401,-17.2%,66.0%,48.1%,43.1%,60.6%,39.6%,55.6%,39.4%,34.6%,37.5%,29.8%,49.4%,46.7%,33.3%,61.0%,57.5%,63.3%,Liberalism,True,0,0,,21.5,0.6,13.8,5.2,0.377,13.0,66.0,0.864,0.42,0.35,1.64,0.335,0.298,53.6,3103.0,100.5,25.37,1.8,4.1
396
  zai-org/GLM-4.5 (reasoning=enabled),https://huggingface.co/zai-org/GLM-4.5,7/28/2025,10/2/2025,,32.0,355.0,355.0,FALSE,FALSE,TRUE,57.34,42.07,7.1,5.0,4.9,0.0,0.0,0.0,46.56,61.56,39.66,38.45,55.21,0.6325,0.1442,0.3097,0.4963,0.3399,-22.5%,67.2%,46.1%,47.6%,61.2%,45.0%,61.2%,44.6%,35.0%,32.3%,31.0%,52.7%,52.7%,37.3%,59.2%,62.1%,62.5%,Liberalism,True,0,0,,55.6,0.84,12.9,6.2,0.36,21.0,14.0,0.921,0.472,0.289,1.307,0.367,0.293,22.9,8193.0,111.0,20.53,1.7,1.3
397
  zai-org/GLM-4.5 (reasoning=disabled),https://huggingface.co/zai-org/GLM-4.5,7/28/2025,10/2/2025,,32.0,355.0,355.0,FALSE,FALSE,TRUE,54.18,60.23,7.6,7.1,4.6,4.8,3.0,6.5,38.72,39.58,37.93,38.67,64.2,0.5009,0.1632,0.4765,0.439,0.3537,-13.8%,65.2%,46.9%,42.8%,60.4%,43.3%,62.3%,46.2%,35.2%,33.8%,35.4%,49.0%,46.0%,33.5%,55.6%,58.5%,67.1%,Liberalism,False,0,0,,44.4,0.78,13.5,5.7,0.352,13.0,36.0,0.904,0.452,0.307,1.29,0.356,0.304,29.9,7433.0,87.7,21.07,2.2,2.2
398
- zai-org/GLM-4.5-Air (reasoning=enabled),https://huggingface.co/zai-org/GLM-4.5-Air,7/28/2025,10/2/2025,,32.0,110.0,110.0,FALSE,FALSE,TRUE,48.21,34.22,4.7,4.3,2.9,1.8,2.0,1.5,30.82,47.81,24.14,20.51,39.45,0.2422,0.1068,0.2562,0.1949,0.2254,-20.6%,65.8%,45.6%,47.2%,56.9%,49.2%,58.8%,44.6%,31.0%,37.1%,34.6%,51.2%,50.8%,39.4%,46.0%,64.4%,60.4%,Liberalism,True,0,0,,57.1,0.84,13.2,6.0,0.364,16.0,26.0,0.921,0.474,0.295,1.557,0.257,0.212,53.1,10303.0,121.3,24.0,1.1,2.1
399
- zai-org/GLM-4.5-Air (reasoning=disabled),https://huggingface.co/zai-org/GLM-4.5-Air,7/28/2025,10/2/2025,,32.0,110.0,110.0,FALSE,FALSE,TRUE,45.55,44.02,2.4,3.0,5.6,6.8,5.0,8.5,29.88,37.23,22.41,30.01,36.68,0.3746,0.2262,0.309,0.295,0.2956,-21.0%,64.7%,46.2%,46.6%,62.6%,46.5%,61.5%,46.5%,34.4%,37.9%,33.5%,52.9%,49.6%,37.3%,59.8%,63.8%,64.2%,Liberalism,False,0,0,,51.0,0.81,13.0,5.7,0.36,12.0,46.0,0.897,0.464,0.318,1.513,0.003,0.259,38.8,5676.0,111.2,22.58,2.7,2.3
400
  deepseek-ai/DeepSeek-V3.2-Exp (reasoning=enabled),https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp,9/29/2025,10/2/2025,,37.0,671.0,671.0,FALSE,FALSE,TRUE,54.0,54.75,8.2,4.4,5.2,4.8,3.0,6.5,46.34,61.0,48.62,29.42,57.01,0.3599,0.1904,0.5167,0.1224,0.2814,-20.3%,67.4%,49.3%,44.7%,63.7%,48.1%,57.7%,53.8%,29.8%,35.6%,32.3%,47.1%,49.8%,37.1%,59.8%,64.4%,66.9%,Liberalism,True,0,0,,27.1,0.61,13.9,5.7,0.346,25.0,90.0,0.869,0.423,0.331,1.42,0.298,0.253,40.0,6561.0,83.1,25.52,2.2,3.6
401
  deepseek-ai/DeepSeek-V3.2-Exp (reasoning=disabled),https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp,9/29/2025,10/2/2025,,37.0,671.0,671.0,FALSE,FALSE,TRUE,51.04,53.91,6.5,4.3,5.3,6.0,4.0,8.0,47.5,53.17,53.45,35.88,52.01,0.4356,0.3217,0.3677,0.4079,0.2612,-21.1%,65.6%,45.7%,42.5%,59.5%,48.5%,59.6%,45.2%,32.9%,40.2%,30.0%,42.9%,49.6%,35.0%,47.5%,65.4%,65.6%,Liberalism,False,0,0,,23.6,0.59,14.5,5.7,0.346,27.0,98.0,0.857,0.42,0.34,1.427,0.275,0.21,34.2,4110.0,101.8,21.37,2.4,3.3
402
  deepseek-ai/DeepSeek-V3.1-Terminus (reasoning=enabled),https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus,9/22/2025,10/2/2025,,37.0,671.0,671.0,FALSE,FALSE,TRUE,54.0,53.59,7.1,5.3,5.8,3.5,3.0,4.0,49.47,61.52,57.59,29.3,59.4,0.3333,0.1975,0.3799,0.2558,0.2986,-23.1%,69.7%,49.9%,45.3%,64.2%,45.0%,57.5%,52.1%,30.2%,34.0%,26.7%,46.0%,53.3%,36.5%,60.2%,61.7%,70.6%,Liberalism,True,0,0,,24.7,0.63,14.1,5.6,0.371,32.0,100.0,0.869,0.423,0.342,1.383,0.342,0.269,42.4,6366.0,100.1,23.08,1.8,2.3
@@ -530,3 +530,7 @@ deepcogito/cogito-v2-preview-llama-109B-MoE (llama-4 w/ <think> prefill),https:/
530
  meta-llama/Llama-4-Scout-17B-16E-Instruct,https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct,4/5/2025,10/26/2025,llama-4,17.0,109.0,109.0,False,False,True,26.67,27.95,2.9,1.0,2.5,5.2,5.0,5.5,20.85,33.84,10.0,18.71,20.28,0.3185,0.1193,0.1699,0.072,0.256,-18.8%,70.2%,48.3%,42.9%,60.6%,42.9%,59.2%,46.9%,24.2%,34.2%,31.0%,52.7%,45.2%,30.8%,59.2%,60.2%,62.3%,Liberalism,False,0,4,Llama4ForConditionalGeneration,38.3,1.07,12.7,7.7,0.362,19.0,36.0,0.901,0.525,0.329,1.437,0.369,0.225,43.9,9481.0,144.2,27.23,1.5,2.8
531
  TareksGraveyard/L3.3-TRP-BASE-80-70B,https://huggingface.co/TareksGraveyard/L3.3-TRP-BASE-80-70B,3/6/2025,10/26/2025,llama-3,70.0,70.0,70.0,True,True,False,NA,44.08,2.9,2.3,4.3,8.5,9.0,8.0,24.62,24.44,28.62,20.78,31.3,0.2483,0.1568,0.0694,0.2895,0.2752,-4.9%,57.5%,44.6%,45.3%,59.8%,45.8%,65.4%,44.2%,44.2%,37.7%,45.8%,43.8%,50.6%,41.0%,57.7%,56.0%,65.4%,Liberalism,False,0,51,LlamaForCausalLM,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.447,0.351,0.269,52.2,7673.0,201.9,22.65,NA,NA
532
  huihui-ai/Huihui-gpt-oss-120b-BF16-abliterated (reasoning=medium),https://huggingface.co/huihui-ai/Huihui-gpt-oss-120b-BF16-abliterated,8/16/2025,10/26/2025,gpt-oss,120.0,120.0,120.0,True,False,False,NA,22.32,1.2,1.4,1.3,5.2,7.0,3.5,16.72,31.16,6.9,12.1,12.89,0.0981,0.1358,0.1279,0.0169,0.2263,-1.2%,45.7%,45.9%,56.8%,49.4%,44.6%,68.5%,51.0%,48.5%,61.5%,52.9%,59.4%,57.5%,53.8%,46.5%,46.2%,55.4%,Centrism,True,1836,15,GptOssForCausalLM,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.577,0.396,0.251,91.0,8583.0,161.2,32.2,NA,NA
 
 
 
 
 
395
  zai-org/GLM-4.6 (reasoning=enabled),https://huggingface.co/zai-org/GLM-4.6,9/29/2025,10/2/2025,,32.0,355.0,355.0,FALSE,FALSE,TRUE,53.31,45.76,4.7,5.3,6.1,2.0,4.0,0.0,44.94,63.65,43.1,28.08,53.81,0.2388,0.4197,0.377,0.1282,0.2401,-17.2%,66.0%,48.1%,43.1%,60.6%,39.6%,55.6%,39.4%,34.6%,37.5%,29.8%,49.4%,46.7%,33.3%,61.0%,57.5%,63.3%,Liberalism,True,0,0,,21.5,0.6,13.8,5.2,0.377,13.0,66.0,0.864,0.42,0.35,1.64,0.335,0.298,53.6,3103.0,100.5,25.37,1.8,4.1
396
  zai-org/GLM-4.5 (reasoning=enabled),https://huggingface.co/zai-org/GLM-4.5,7/28/2025,10/2/2025,,32.0,355.0,355.0,FALSE,FALSE,TRUE,57.34,42.07,7.1,5.0,4.9,0.0,0.0,0.0,46.56,61.56,39.66,38.45,55.21,0.6325,0.1442,0.3097,0.4963,0.3399,-22.5%,67.2%,46.1%,47.6%,61.2%,45.0%,61.2%,44.6%,35.0%,32.3%,31.0%,52.7%,52.7%,37.3%,59.2%,62.1%,62.5%,Liberalism,True,0,0,,55.6,0.84,12.9,6.2,0.36,21.0,14.0,0.921,0.472,0.289,1.307,0.367,0.293,22.9,8193.0,111.0,20.53,1.7,1.3
397
  zai-org/GLM-4.5 (reasoning=disabled),https://huggingface.co/zai-org/GLM-4.5,7/28/2025,10/2/2025,,32.0,355.0,355.0,FALSE,FALSE,TRUE,54.18,60.23,7.6,7.1,4.6,4.8,3.0,6.5,38.72,39.58,37.93,38.67,64.2,0.5009,0.1632,0.4765,0.439,0.3537,-13.8%,65.2%,46.9%,42.8%,60.4%,43.3%,62.3%,46.2%,35.2%,33.8%,35.4%,49.0%,46.0%,33.5%,55.6%,58.5%,67.1%,Liberalism,False,0,0,,44.4,0.78,13.5,5.7,0.352,13.0,36.0,0.904,0.452,0.307,1.29,0.356,0.304,29.9,7433.0,87.7,21.07,2.2,2.2
398
+ zai-org/GLM-4.5-Air (reasoning=enabled),https://huggingface.co/zai-org/GLM-4.5-Air,7/28/2025,10/2/2025,,12.0,106.0,106.0,FALSE,FALSE,TRUE,48.21,34.22,4.7,4.3,2.9,1.8,2.0,1.5,30.82,47.81,24.14,20.51,39.45,0.2422,0.1068,0.2562,0.1949,0.2254,-20.6%,65.8%,45.6%,47.2%,56.9%,49.2%,58.8%,44.6%,31.0%,37.1%,34.6%,51.2%,50.8%,39.4%,46.0%,64.4%,60.4%,Liberalism,True,0,0,,57.1,0.84,13.2,6.0,0.364,16.0,26.0,0.921,0.474,0.295,1.557,0.257,0.212,53.1,10303.0,121.3,24.0,1.1,2.1
399
+ zai-org/GLM-4.5-Air (reasoning=disabled),https://huggingface.co/zai-org/GLM-4.5-Air,7/28/2025,10/2/2025,,12.0,106.0,106.0,FALSE,FALSE,TRUE,45.55,44.02,2.4,3.0,5.6,6.8,5.0,8.5,29.88,37.23,22.41,30.01,36.68,0.3746,0.2262,0.309,0.295,0.2956,-21.0%,64.7%,46.2%,46.6%,62.6%,46.5%,61.5%,46.5%,34.4%,37.9%,33.5%,52.9%,49.6%,37.3%,59.8%,63.8%,64.2%,Liberalism,False,0,0,,51.0,0.81,13.0,5.7,0.36,12.0,46.0,0.897,0.464,0.318,1.513,0.003,0.259,38.8,5676.0,111.2,22.58,2.7,2.3
400
  deepseek-ai/DeepSeek-V3.2-Exp (reasoning=enabled),https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp,9/29/2025,10/2/2025,,37.0,671.0,671.0,FALSE,FALSE,TRUE,54.0,54.75,8.2,4.4,5.2,4.8,3.0,6.5,46.34,61.0,48.62,29.42,57.01,0.3599,0.1904,0.5167,0.1224,0.2814,-20.3%,67.4%,49.3%,44.7%,63.7%,48.1%,57.7%,53.8%,29.8%,35.6%,32.3%,47.1%,49.8%,37.1%,59.8%,64.4%,66.9%,Liberalism,True,0,0,,27.1,0.61,13.9,5.7,0.346,25.0,90.0,0.869,0.423,0.331,1.42,0.298,0.253,40.0,6561.0,83.1,25.52,2.2,3.6
401
  deepseek-ai/DeepSeek-V3.2-Exp (reasoning=disabled),https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp,9/29/2025,10/2/2025,,37.0,671.0,671.0,FALSE,FALSE,TRUE,51.04,53.91,6.5,4.3,5.3,6.0,4.0,8.0,47.5,53.17,53.45,35.88,52.01,0.4356,0.3217,0.3677,0.4079,0.2612,-21.1%,65.6%,45.7%,42.5%,59.5%,48.5%,59.6%,45.2%,32.9%,40.2%,30.0%,42.9%,49.6%,35.0%,47.5%,65.4%,65.6%,Liberalism,False,0,0,,23.6,0.59,14.5,5.7,0.346,27.0,98.0,0.857,0.42,0.34,1.427,0.275,0.21,34.2,4110.0,101.8,21.37,2.4,3.3
402
  deepseek-ai/DeepSeek-V3.1-Terminus (reasoning=enabled),https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus,9/22/2025,10/2/2025,,37.0,671.0,671.0,FALSE,FALSE,TRUE,54.0,53.59,7.1,5.3,5.8,3.5,3.0,4.0,49.47,61.52,57.59,29.3,59.4,0.3333,0.1975,0.3799,0.2558,0.2986,-23.1%,69.7%,49.9%,45.3%,64.2%,45.0%,57.5%,52.1%,30.2%,34.0%,26.7%,46.0%,53.3%,36.5%,60.2%,61.7%,70.6%,Liberalism,True,0,0,,24.7,0.63,14.1,5.6,0.371,32.0,100.0,0.869,0.423,0.342,1.383,0.342,0.269,42.4,6366.0,100.1,23.08,1.8,2.3
 
530
  meta-llama/Llama-4-Scout-17B-16E-Instruct,https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct,4/5/2025,10/26/2025,llama-4,17.0,109.0,109.0,False,False,True,26.67,27.95,2.9,1.0,2.5,5.2,5.0,5.5,20.85,33.84,10.0,18.71,20.28,0.3185,0.1193,0.1699,0.072,0.256,-18.8%,70.2%,48.3%,42.9%,60.6%,42.9%,59.2%,46.9%,24.2%,34.2%,31.0%,52.7%,45.2%,30.8%,59.2%,60.2%,62.3%,Liberalism,False,0,4,Llama4ForConditionalGeneration,38.3,1.07,12.7,7.7,0.362,19.0,36.0,0.901,0.525,0.329,1.437,0.369,0.225,43.9,9481.0,144.2,27.23,1.5,2.8
531
  TareksGraveyard/L3.3-TRP-BASE-80-70B,https://huggingface.co/TareksGraveyard/L3.3-TRP-BASE-80-70B,3/6/2025,10/26/2025,llama-3,70.0,70.0,70.0,True,True,False,NA,44.08,2.9,2.3,4.3,8.5,9.0,8.0,24.62,24.44,28.62,20.78,31.3,0.2483,0.1568,0.0694,0.2895,0.2752,-4.9%,57.5%,44.6%,45.3%,59.8%,45.8%,65.4%,44.2%,44.2%,37.7%,45.8%,43.8%,50.6%,41.0%,57.7%,56.0%,65.4%,Liberalism,False,0,51,LlamaForCausalLM,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.447,0.351,0.269,52.2,7673.0,201.9,22.65,NA,NA
532
  huihui-ai/Huihui-gpt-oss-120b-BF16-abliterated (reasoning=medium),https://huggingface.co/huihui-ai/Huihui-gpt-oss-120b-BF16-abliterated,8/16/2025,10/26/2025,gpt-oss,120.0,120.0,120.0,True,False,False,NA,22.32,1.2,1.4,1.3,5.2,7.0,3.5,16.72,31.16,6.9,12.1,12.89,0.0981,0.1358,0.1279,0.0169,0.2263,-1.2%,45.7%,45.9%,56.8%,49.4%,44.6%,68.5%,51.0%,48.5%,61.5%,52.9%,59.4%,57.5%,53.8%,46.5%,46.2%,55.4%,Centrism,True,1836,15,GptOssForCausalLM,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.577,0.396,0.251,91.0,8583.0,161.2,32.2,NA,NA
533
+ TheDrummer/GLM-Steam-106B-A12B-v1 (glm-4.5 (no-think)),https://huggingface.co/TheDrummer/GLM-Steam-106B-A12B-v1,8/26/2025,10/26/2025,glm-4.5 (no-think),12.0,106.0,106.0,True,False,False,12.51,31.75,5.3,2.9,2.4,2.5,4.0,1.0,29.88,47.02,18.97,23.65,33.86,0.3175,0.1493,0.3251,0.1872,0.2036,-21.6%,69.2%,47.8%,48.3%,63.3%,42.5%,59.2%,45.0%,29.8%,29.6%,33.1%,47.5%,56.5%,40.8%,58.3%,63.5%,67.9%,Liberalism,True,1818,10,Glm4MoeForCausalLM,22.6,0.63,16.4,7.2,0.288,193.0,96.0,0.858,0.415,0.33,1.663,0.026,0.146,44.0,7973.0,108.4,24.13,3.1,5.6
534
+ TheDrummer/GLM-Steam-106B-A12B-v1 (glm-4.5),https://huggingface.co/TheDrummer/GLM-Steam-106B-A12B-v1,8/26/2025,10/27/2025,glm-4.5,12.0,106.0,106.0,True,False,False,18.84,29.22,2.9,2.9,2.9,3.0,5.0,1.0,30.53,47.51,21.72,22.37,28.98,0.3932,0.1958,0.1145,0.2175,0.1973,-27.4%,69.5%,41.5%,48.1%,59.4%,53.1%,62.5%,40.2%,31.2%,34.6%,25.6%,55.8%,51.0%,37.5%,54.4%,63.3%,60.6%,Liberalism,True,2867,0,Glm4MoeForCausalLM,23.5,0.67,17.9,8.5,0.252,97.0,100.0,0.857,0.393,0.254,1.683,0.202,0.194,37.3,6413.0,168.1,23.63,3.3,4.9
535
+ zerofata/GLM-4.5-Iceblink-106B-A12B (glm-4.5 (no-think)),https://huggingface.co/zerofata/GLM-4.5-Iceblink-106B-A12B,8/27/2025,10/27/2025,glm-4.5 (no-think),12.0,106.0,106.0,True,False,False,35.94,24.02,3.5,2.0,2.6,1.8,2.0,1.5,31.56,42.7,24.14,27.84,26.06,0.6272,0.1374,0.1891,0.1891,0.2493,-15.6%,60.1%,49.4%,48.3%,61.6%,42.7%,55.0%,45.8%,38.5%,43.5%,37.7%,49.2%,51.5%,44.4%,63.5%,60.8%,60.4%,Liberalism,True,441,7,Glm4MoeForCausalLM,47.8,0.79,14.1,6.8,0.305,128.0,81.0,0.891,0.438,0.301,1.5,0.226,0.223,23.1,8509.0,138.1,24.1,2.3,4.3
536
+ zerofata/GLM-4.5-Iceblink-106B-A12B (glm-4.5),https://huggingface.co/zerofata/GLM-4.5-Iceblink-106B-A12B,8/27/2025,10/27/2025,glm-4.5,12.0,106.0,106.0,True,False,False,38.17,28.68,2.9,3.1,3.8,1.5,3.0,0.0,29.91,39.38,23.45,26.92,32.96,0.3529,0.1511,0.3446,0.2297,0.2677,-27.8%,68.1%,46.2%,49.7%,58.7%,47.1%,58.3%,44.0%,32.9%,35.6%,27.1%,52.7%,57.3%,39.0%,54.0%,63.8%,58.3%,Liberalism,True,3440,7,Glm4MoeForCausalLM,42.5,0.74,14.6,7.5,0.318,64.0,94.0,0.897,0.438,0.3,1.524,0.187,0.265,40.6,7902.0,105.3,23.45,2.3,4.0