WebArbiter - Datasets Collection Benchmark, training data, and search trajectories for WebArbiter. ICLR 2026. • 3 items • Updated 6 days ago
WebArbiter - Models Collection WebArbiter process reward models for web agents. Reasoning distillation + RL. ICLR 2026. • 4 items • Updated 6 days ago
WebArbiter Collection Reasoning Process Reward Model for Web Agents. Models, data, and WebPRMBench. ICLR 2026. • 8 items • Updated 6 days ago
WebArbiter Collection Reasoning Process Reward Model for Web Agents. Models, data, and WebPRMBench. ICLR 2026. • 8 items • Updated 6 days ago