What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
upvoted a paper about 2 months ago
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL liked a dataset about 2 months ago
microsoft/webgym_tasks updated a dataset 2 months ago
JackBAI/jack-latest-vllm-stack