Running on CPU Upgrade Agents 599 GAIA Leaderboard π¦Ύ 599 Submit your model answers to GAIA benchmark and view leaderboard
Running on CPU Upgrade 193 LLM Hallucination Leaderboard π 193 View and filter LLM hallucination leaderboard
Jofthomas/hermes-function-calling-thinking-V1 Viewer β’ Updated Feb 16, 2025 β’ 3.57k β’ 569 β’ 74
Running 3.79k The Ultra-Scale Playbook π 3.79k The ultimate guide to training LLM on large GPU Clusters
Running Agents 77 AI Energy Score Leaderboard π 77 Explore AI energy efficiency across various tasks
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 468k β’ β’ 2.7k
Running Agents 110 Judge Arena π» 110 View and compare openβsource AI model rankings with ELO scores
Running Featured 588 Image Arena Leaderboard π 588 Image Generation and Image Editing Arena & Leaderboard
meta-llama/Llama-3.1-8B-Instruct Text Generation β’ 8B β’ Updated Sep 25, 2024 β’ 9.55M β’ β’ 5.7k
meta-llama/Llama-3.1-405B-Instruct Text Generation β’ 406B β’ Updated Sep 25, 2024 β’ 227k β’ 595
meta-llama/Llama-3.1-70B-Instruct Text Generation β’ 71B β’ Updated Dec 15, 2024 β’ 873k β’ β’ 907