Exploration and Exploitation Errors Are Measurable for Language Model Agents Paper • 2604.13151 • Published 3 days ago • 22
Exploration and Exploitation Errors Are Measurable for Language Model Agents Paper • 2604.13151 • Published 3 days ago • 22
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data Paper • 2502.06737 • Published Feb 10, 2025
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks Paper • 1810.00825 • Published Oct 1, 2018 • 1
TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents Paper • 2602.19633 • Published Feb 23 • 8
Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions Paper • 2310.07174 • Published Oct 11, 2023 • 1
TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents Paper • 2602.19633 • Published Feb 23 • 8
Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions Paper • 2310.07174 • Published Oct 11, 2023 • 1