Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
Minjae Oh
Riasok
Follow
Riasok
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
upvoted
a
paper
1 day ago
KL for a KL: On-Policy Distillation with Control Variate Baseline
upvoted
a
paper
21 days ago
In-N-Out: A Parameter-Level API Graph Dataset for Tool Agents
View all activity
Organizations
None yet
Riasok
's models
33
Sort: Recently updated
Riasok/qwen-GSM8K-fpw-0.5-5e-6
Updated
Jul 1, 2025
•
1
Riasok/qwen-GSM8K-dponll-1e-6
Updated
Jul 1, 2025
•
2
Riasok/qwen-GSM8K-dpo-1e-6
Updated
Jul 1, 2025
•
1
Previous
1
2
Next