Spaces:
Running
Running
Create my ai of all
#88 opened about 23 hours ago
by
fadishk1122
New Assessments needed to tackle real world problems for Multi models
#87 opened about 1 month ago
by
balajim27
add a button to download the results in the tables
π 4
1
#86 opened 5 months ago
by
HUYSOSAT
Files not updated since August
π 1
2
#85 opened 6 months ago
by
Rachel0619
request model evaluation please?
1
#84 opened 7 months ago
by
legolasyiu
Files have not updated since September 3
π 1
1
#83 opened 7 months ago
by
MrLittleTexas
Files haven't been updated since Aug 4
βπ 6
10
#80 opened 8 months ago
by
maaxxxcal
Chatbot Arena Leaderboard Runtime Error
π 2
1
#78 opened 9 months ago
by
minpyaemoe
Add filters for 'Unlimited Free Access' and 'No Geo-Restrictions.'
π 1
#74 opened 12 months ago
by
wqqedfh
Let people vote on existing responses?
#73 opened about 1 year ago
by
endolith
Latest raw mt-bench results available
#72 opened about 1 year ago
by
lucweber
Cameroun
1
#69 opened over 1 year ago
by
EtCeterAi
Add Ovis-1.6 to Chatbot arena ?
#68 opened over 1 year ago
by
xxyyy123
I tried to plot AGI on the same Elo scale by comparing to "both bad" and "tie" votes
#67 opened over 1 year ago
by
endolith
Please add InternLM2.5-20B-Chat and InternLM2.5-7B-Chat to Leaderboard
#61 opened over 1 year ago
by
vansin
Upload leaderboard_table_20240716.csv
#50 opened over 1 year ago
by
connorchenn
Chatbot Arena: Classify requests/votes - ELO per category
#40 opened almost 2 years ago
by
NeuralByte
How am I supposed to search models by name when there's live scroll?
π 2
#38 opened almost 2 years ago
by
seedmanc
Number of parameters of the model and release date
1
#32 opened almost 2 years ago
by
oovm
Is the leaderboard space deprecated then?
π€― 2
#31 opened almost 2 years ago
by
zhiminy
Is the notebook version-controlled anywhere?
1
#30 opened about 2 years ago
by
endolith
Support benchmark for Long Context Recall abilities
#29 opened about 2 years ago
by
Nekochu
Is it fair to have web browsing allowed
πβ 5
1
#24 opened about 2 years ago
by
gearunclear
Dataset Update
βπ 6
1
#23 opened about 2 years ago
by
matthiaslau
Request: add two new models
π€ 2
2
#21 opened about 2 years ago
by
rombodawg
Removing LLM version clutter from the leaderboard ?
π 1
2
#20 opened about 2 years ago
by
zarglu
Re-evaluate GPT-4 ! Add a ELO-graph over time to the leaderboard
π 3
8
#19 opened about 2 years ago
by
cmp-nct
[enhancement] unaligned ranking column between leaderboards
#17 opened over 2 years ago
by
zhiminy
Is there any way to download the leaderboard as csv or json format?
π 1
7
#13 opened over 2 years ago
by
zhiminy
How does GPT-4 Turbo do so well?
π 2
10
#10 opened over 2 years ago
by
endolith
Human level representation?
π 2
5
#8 opened over 2 years ago
by
ehalit
Add quantized local models?
β€οΈ 1
#7 opened over 2 years ago
by
endolith
Synthetic evaluation hypothesis
1
#6 opened over 2 years ago
by
DmitriSS
You should add nous capybara 34b
π 1
#5 opened over 2 years ago
by
distantquant