Yuan Wang's picture

Yuan Wang

traveler2333

·

https://github.com/traveler2333

travler2333

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 6 months ago

Data-Efficient RLVR via Off-Policy Influence Guidance

Paper • 2510.26491 • Published Oct 30, 2025 • 11

upvoted a paper 10 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 254