Spaces:
Running
Running
Commit History
hide preview models from leaderboard 8e7b89a
Small fix to header 5a3d732
Added asterisk for closed models 472c111
update paper link 4a49ee3
Update leaderboard/md.py bfbc587 verified
Add new released models to frozen v1 scores f0c8dc0
ignore open_instruct_dev models 33298d1
rename column names a83b29d
switch results repo 3a984c4
fix links dd9384e
update dataset name and citation 640136e
Update app.py 866d755 verified
clean looks 1089af2
download button and style 75fed94
Merge branch 'main' of https://huggingface.co/spaces/allenai/reward-bench-v2 9a9d913
updates 51d7804
fix average and column names bc5408b
fix average and column names 62dcae0
fixes 74240b0
works ish c259566
init v1 port f460af4
hard reset repo 96e55d5
sorry this git history is v messy 88c98d4
attempting to widen model column 6a914d5
attempting to widen model column 92d9a0a
attempting to widen model column df8bd5a
attempting to widen model column f6ea81c
attempting to widen model column 94cbe00
updated domain names 16de828
updated domain names 72159f4
domains 5f95304
domains 6a82c0e
add domains bdc6e92
reweight domains a283682
updated domains fbec7a6
revert 3fec560
widen display for longer names 35668fc
widen display for longer names 8312b6e
widen display for longer names 18e30c5
widen display for longer names f09a224
remove ties from score 5b8ed68
shape of weights 30457c5
added 'ties' subset 8dee269
fix link 49a3cce
links go to v2 19f0320
rename split to 'test' 5883014
fix 64bb23a
root commited on
fix ea8a0be
root commited on
fix ad6389f
root commited on