Running Agents 1.5k Big Code Models Leaderboard π 1.5k Explore and submit code model evaluations on a leaderboard
Running on CPU Upgrade 14k Open LLM Leaderboard π 14k Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade 193 LLM Hallucination Leaderboard π 193 View and filter LLM hallucination leaderboard