University of Wisconsin - Madison

university

Verified

https://www.wisc.edu

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

jungtaekkim authored a paper 2 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

jongwon-jeong authored a paper 3 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

jpark677 submitted a paper 3 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

View all activity

Papers

Exploration and Exploitation Errors Are Measurable for Language Model Agents

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

View all Papers

authored a paper 2 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published 5 days ago • 24

authored a paper 3 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published 5 days ago • 24

submitted a paper to Daily Papers 3 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published 5 days ago • 24

submitted a paper to Daily Papers 23 days ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published 24 days ago • 30

updated a model 25 days ago

uw-madison/mnemodyn

Updated 25 days ago • 1

submitted a paper to Daily Papers 26 days ago

Understanding Behavior Cloning with Action Quantization

Paper • 2603.20538 • Published 29 days ago • 2

published a model 26 days ago

uw-madison/mnemodyn

Updated 25 days ago • 1

submitted a paper to Daily Papers 26 days ago

Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Paper • 2603.19987 • Published 30 days ago • 9

submitted a paper to Daily Papers about 1 month ago

RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning

Paper • 2603.09160 • Published Mar 10 • 15

authored 4 papers about 2 months ago

How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects

Paper • 2503.04257 • Published Mar 6, 2025 • 2

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7, 2025 • 43

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23, 2025 • 81

How to Correctly Report LLM-as-a-Judge Evaluations

Paper • 2511.21140 • Published Nov 26, 2025 • 2

authored 2 papers about 2 months ago

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

Paper • 2502.06737 • Published Feb 10, 2025

Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

Paper • 1810.00825 • Published Oct 1, 2018 • 1

authored a paper about 2 months ago

CHAMMI-75: pre-training multi-channel models with heterogeneous microscopy images

Paper • 2512.20833 • Published Dec 23, 2025

authored a paper about 2 months ago

TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

Paper • 2602.19633 • Published Feb 23 • 8

submitted a paper to Daily Papers about 2 months ago

TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

Paper • 2602.19633 • Published Feb 23 • 8

authored a paper about 2 months ago

TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

Paper • 2602.19633 • Published Feb 23 • 8

submitted a paper to Daily Papers about 2 months ago

SkillOrchestra: Learning to Route Agents via Skill Transfer

Paper • 2602.19672 • Published Feb 23 • 57