Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Tanmay Jain
tanmayyyj
Follow
0 followers
·
1 following
tanmayyyyj
tanmayyyj
AI & ML interests
Computer Vision, NLP, Reinforcement Learning
Recent Activity
updated
a Space
about 2 months ago
tanmayyyj/reward-fn-inference
published
a Space
about 2 months ago
tanmayyyj/reward-fn-inference
updated
a model
about 2 months ago
tanmayyyj/ministral-8b-reward-fn-dpo
View all activity
Organizations
tanmayyyj
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a Space
about 2 months ago
Paused
Reward Fn Inference
🤖
published
a Space
about 2 months ago
Paused
Reward Fn Inference
🤖
updated
a model
about 2 months ago
tanmayyyj/ministral-8b-reward-fn-dpo
Updated
Feb 28
published
a model
about 2 months ago
tanmayyyj/ministral-8b-reward-fn-dpo
Updated
Feb 28
updated
a model
about 2 months ago
tanmayyyj/ministral-8b-reward-fn-sft
Updated
Feb 28
published
a model
about 2 months ago
tanmayyyj/ministral-8b-reward-fn-sft
Updated
Feb 28
updated
a Space
about 2 months ago
Paused
Reward Fn Trainer
🏋
Fine‑tune a language model with DPO using your dataset
published
a Space
about 2 months ago
Paused
Reward Fn Trainer
🏋
Fine‑tune a language model with DPO using your dataset
updated
a dataset
about 2 months ago
tanmayyyj/reward-fn-dpo
Viewer
•
Updated
Feb 28
•
161
•
14
published
a dataset
about 2 months ago
tanmayyyj/reward-fn-dpo
Viewer
•
Updated
Feb 28
•
161
•
14
updated
a dataset
about 2 months ago
tanmayyyj/reward-fn-sft
Viewer
•
Updated
Feb 28
•
33
•
13
published
a dataset
about 2 months ago
tanmayyyj/reward-fn-sft
Viewer
•
Updated
Feb 28
•
33
•
13
upvoted
an
article
10 months ago
view article
Article
The Annotated Diffusion Model
Jun 7, 2022
•
340
updated
a model
11 months ago
tanmayyyj/Qwen-3-math-reasoning
Updated
Jun 2, 2025
published
a model
11 months ago
tanmayyyj/Qwen-3-math-reasoning
Updated
Jun 2, 2025
updated
a model
almost 3 years ago
tanmayyyj/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jul 4, 2023
•
16
liked
a model
almost 3 years ago
tanmayyyj/q-FrozenLake-v2-4x4-Non_Slippery
Reinforcement Learning
•
Updated
Jun 30, 2023
•
1
updated
3 models
almost 3 years ago
tanmayyyj/Taxi-v3
Updated
Jun 30, 2023
tanmayyyj/Taxi_v3
Reinforcement Learning
•
Updated
Jun 30, 2023
tanmayyyj/q-FrozenLake-v2-4x4-Non_Slippery
Reinforcement Learning
•
Updated
Jun 30, 2023
•
1
Load more