Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Tanmay Jain
tanmayyyj
Follow
0 followers
·
1 following
tanmayyyyj
tanmayyyj
AI & ML interests
Computer Vision, NLP, Reinforcement Learning
Recent Activity
updated
a Space
about 2 months ago
tanmayyyj/reward-fn-inference
published
a Space
about 2 months ago
tanmayyyj/reward-fn-inference
updated
a model
about 2 months ago
tanmayyyj/ministral-8b-reward-fn-dpo
View all activity
Organizations
spaces
2
Sort: Recently updated
pinned
Paused
Reward Fn Inference
🤖
pinned
Paused
Reward Fn Trainer
🏋
Fine‑tune a language model with DPO using your dataset
models
13
Sort: Recently updated
tanmayyyj/ministral-8b-reward-fn-dpo
Updated
Feb 28
tanmayyyj/ministral-8b-reward-fn-sft
Updated
Feb 28
tanmayyyj/Qwen-3-math-reasoning
Updated
Jun 2, 2025
tanmayyyj/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jul 4, 2023
•
16
tanmayyyj/Taxi-v3
Updated
Jun 30, 2023
tanmayyyj/Taxi_v3
Reinforcement Learning
•
Updated
Jun 30, 2023
tanmayyyj/q-FrozenLake-v2-4x4-Non_Slippery
Reinforcement Learning
•
Updated
Jun 30, 2023
•
1
tanmayyyj/ppo-PyramidsRND
Reinforcement Learning
•
Updated
Jun 20, 2023
tanmayyyj/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Jun 20, 2023
•
1
tanmayyyj/Cartpole-v1
Reinforcement Learning
•
Updated
Jun 19, 2023
View 13 models
datasets
2
Sort: Recently updated
tanmayyyj/reward-fn-dpo
Viewer
•
Updated
Feb 28
•
161
•
14
tanmayyyj/reward-fn-sft
Viewer
•
Updated
Feb 28
•
33
•
13