Running on CPU Upgrade Agents 60 Open CoT Leaderboard ๐ฅ 60 Track, rank and evaluate open LLMs' CoT quality
Runtime error Agents 9 Open CoT Dashboard ๐ 9 Visualize model performance with interactive plots and tables