Open Agent Leaderboard An open benchmark for comparing full agent systems across diverse real-world tasks. Reports both quality and cost. Running Open Agent Leaderboard 🤖 Explore AI agent performance and cost rankings Running The Open Agent Leaderboard 📊 Define comprehensive reports for AI agent evaluations open-agent-leaderboard/results Benchmark • Updated 16 days ago • 90 • 26 • 1 open-agent-leaderboard/agent-cards Updated 16 days ago • 14
Open Agent Leaderboard An open benchmark for comparing full agent systems across diverse real-world tasks. Reports both quality and cost. Running Open Agent Leaderboard 🤖 Explore AI agent performance and cost rankings Running The Open Agent Leaderboard 📊 Define comprehensive reports for AI agent evaluations open-agent-leaderboard/results Benchmark • Updated 16 days ago • 90 • 26 • 1 open-agent-leaderboard/agent-cards Updated 16 days ago • 14