ClawArena: Benchmarking AI Agents in Evolving Information Environments Paper • 2604.04202 • Published 8 days ago • 36
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published Mar 12 • 65
Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments Paper • 2603.23638 • Published 19 days ago • 11
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning Paper • 2603.12529 • Published about 1 month ago • 19
qualifire-oss/mcp-tool-use-quality-ranger-0.6b-GGUF Text Generation • 0.6B • Updated Sep 15, 2025 • 27 • 8
TeichAI/gemma-4-31B-it-Claude-Opus-Distill Image-Text-to-Text • 31B • Updated about 21 hours ago • 6.31k • 31
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models Paper • 2603.27481 • Published 15 days ago • 35