xushaoyang's picture

xushaoyang

beiweixiaoxu

·

https://shaoyangxu.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

upvoted a paper 3 days ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

updated a model 19 days ago

beiweixiaoxu/For_YuanGe

View all activity

Organizations

upvoted a paper 2 days ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 6 days ago • 135

upvoted a paper 3 days ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published 5 days ago • 96

upvoted 2 papers 2 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 79

Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling

Paper • 2602.02453 • Published Feb 2 • 36

upvoted a paper 6 months ago

PEAR: Phase Entropy Aware Reward for Efficient Reasoning

Paper • 2510.08026 • Published Oct 9, 2025 • 9