Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
zhangwentao
zhangwt97
Follow
Dangeroux's profile picture
charlescowan's profile picture
2 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 12 hours ago
Missing Old Logits in Asynchronous Agentic RL: Semantic Mismatch and Repair Methods for Off-Policy Correction
liked
a model
about 1 year ago
ibm-granite/granite-timeseries-ttm-r1
published
a model
about 1 year ago
zhangwt97/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
View all activity
Organizations
zhangwt97
's models
2
Sort: Recently updated
zhangwt97/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Feb 25, 2025
zhangwt97/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Feb 25, 2025
•
10