Zujie Liang's picture

5

Zujie Liang

jokieleung

·

https://jokieleung.github.io/

AI & ML interests

LLM/VLM Agents, reasoning

Recent Activity

upvoted a paper about 1 month ago

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

upvoted a paper 3 months ago

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

upvoted a paper 6 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

View all activity

Organizations

upvoted a paper about 1 month ago

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

Paper • 2602.12705 • Published Feb 13 • 68

upvoted a paper 3 months ago

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

Paper • 2512.24265 • Published Dec 30, 2025 • 4

upvoted a paper 6 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 99

upvoted 2 papers 7 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 137

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47