MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 16 days ago • 68
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published 30 days ago • 185
LeanDojo Collection Machine learning for theorem proving in Lean: https://leandojo.org/ • 10 items • Updated Jul 23, 2024 • 2