Running on CPU Upgrade Agents 76 AIR-Bench Leaderboard π₯ 76 Explore and compare QA and long doc benchmarks
Running on CPU Upgrade Agents 124 Open Chinese LLM Leaderboard π 124 Explore LLM benchmark scores and submit your model
Running on CPU Upgrade 13.9k Open LLM Leaderboard π 13.9k Track, rank and evaluate open LLMs and chatbots