Find AI research papers with a simple query
Forecast evaluation benchmark
TabArena
Leaderboard for the Mechanistic Interpretability Benchmark
Browse and compare visual document retrieval model scores
GIFT-Eval: A Benchmark for General Time Series Forecasting
Chat with an AI assistant and see its thought process