Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Junkang Wu's picture
2 6

Junkang Wu

junkang0909
guoqingyu2004's profile picture 00ffcc's profile picture Rosykunai's profile picture
·
https://junkangwu.github.io/

AI & ML interests

LLM alignment

Recent Activity

upvoted a paper 20 days ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
upvoted a paper 7 months ago
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
authored a paper 7 months ago
Aligning Multimodal LLM with Human Preference: A Survey
View all activity

Organizations

None yet

commented a paper 7 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 120 •
2
commented a paper about 1 year ago

RePO: ReLU-based Preference Optimization

Paper • 2503.07426 • Published Mar 10, 2025 • 2 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs