Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YY's picture
4 4

YY

yy0514
HarryMayne's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago
Search-R1-v0.3
upvoted a paper 6 months ago
Agentic Reinforcement Learning for Search is Unsafe
commentedon a paper 6 months ago
Agentic Reinforcement Learning for Search is Unsafe
View all activity

Organizations

None yet

upvoted a collection about 2 months ago

Search-R1-v0.3

Collection
RL with outcome reward + format reward. https://arxiv.org/abs/2505.15117 • 12 items • Updated Aug 12, 2025 • 4
upvoted a paper 6 months ago

Agentic Reinforcement Learning for Search is Unsafe

Paper • 2510.17431 • Published Oct 20, 2025 • 5
upvoted a paper 12 months ago

Clinical knowledge in LLMs does not translate to human interactions

Paper • 2504.18919 • Published Apr 26, 2025 • 26
upvoted a paper over 1 year ago

Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction

Paper • 2411.06424 • Published Nov 10, 2024 • 5
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs