Running Agents 1.5k Big Code Models Leaderboard 📈 1.5k Explore and submit code model evaluations on a leaderboard
Sleeping Agents 2 LLMRiddles-ChatGLM-EN 🚀 2 Play an interactive game with a language model by asking specific questions