1 5 4

xushaoyang

beiweixiaoxu

https://shaoyangxu.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

upvoted a paper 1 day ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

updated a model 18 days ago

beiweixiaoxu/For_YuanGe

View all activity

Organizations

upvoted 2 papers 1 day ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published 4 days ago • 132

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published 3 days ago • 94

updated a model 18 days ago

beiweixiaoxu/For_YuanGe

15B • Updated 18 days ago • 13

updated a model 19 days ago

beiweixiaoxu/For_YuanGe_25

15B • Updated 19 days ago • 13

published a model 19 days ago

beiweixiaoxu/For_YuanGe_25

15B • Updated 19 days ago • 13

updated a model 19 days ago

beiweixiaoxu/For_YuanGe_10

15B • Updated 19 days ago • 13

published a model 19 days ago

beiweixiaoxu/For_YuanGe_10

15B • Updated 19 days ago • 13

published a model 21 days ago

beiweixiaoxu/For_YuanGe

15B • Updated 18 days ago • 13

liked a dataset 28 days ago

iNLP-Lab/Moltbook-MoltNet

Viewer • Updated about 1 month ago • 5.42M • 63 • 3

upvoted 2 papers 2 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 79

Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling

Paper • 2602.02453 • Published Feb 2 • 36

submitted a paper to Daily Papers 3 months ago

Language of Thought Shapes Output Diversity in Large Language Models

Paper • 2601.11227 • Published Jan 16 • 9

authored 3 papers 3 months ago

upvoted a paper 6 months ago

PEAR: Phase Entropy Aware Reward for Efficient Reasoning

Paper • 2510.08026 • Published Oct 9, 2025 • 9

updated a model 10 months ago

beiweixiaoxu/CultureSPA

8B • Updated Jun 27, 2025 • 66

published a model 10 months ago

beiweixiaoxu/CultureSPA

8B • Updated Jun 27, 2025 • 66

liked 2 models over 1 year ago

TJUNLP/FuxiTranyu-8B

Text Generation • 8B • Updated Aug 19, 2024 • 77 • 6

TJUNLP/FuxiTranyu-8B-DPO

Text Generation • 8B • Updated Oct 24, 2024 • 3 • 1

xushaoyang

AI & ML interests

Recent Activity

Organizations

beiweixiaoxu's activity