1 1

Tanmay Jain

tanmayyyj

AI & ML interests

Computer Vision, NLP, Reinforcement Learning

Recent Activity

updated a Space about 2 months ago

tanmayyyj/reward-fn-inference

published a Space about 2 months ago

tanmayyyj/reward-fn-inference

updated a model about 2 months ago

tanmayyyj/ministral-8b-reward-fn-dpo

View all activity

Organizations

updated a Space about 2 months ago

Reward Fn Inference

🤖

published a Space about 2 months ago

Reward Fn Inference

🤖

updated a model about 2 months ago

tanmayyyj/ministral-8b-reward-fn-dpo

Updated Feb 28

published a model about 2 months ago

tanmayyyj/ministral-8b-reward-fn-dpo

Updated Feb 28

updated a model about 2 months ago

tanmayyyj/ministral-8b-reward-fn-sft

Updated Feb 28

published a model about 2 months ago

tanmayyyj/ministral-8b-reward-fn-sft

Updated Feb 28

updated a Space about 2 months ago

Reward Fn Trainer

🏋

Fine‑tune a language model with DPO using your dataset

published a Space about 2 months ago

Reward Fn Trainer

🏋

Fine‑tune a language model with DPO using your dataset

updated a dataset about 2 months ago

tanmayyyj/reward-fn-dpo

Viewer • Updated Feb 28 • 161 • 14

published a dataset about 2 months ago

tanmayyyj/reward-fn-dpo

Viewer • Updated Feb 28 • 161 • 14

updated a dataset about 2 months ago

tanmayyyj/reward-fn-sft

Viewer • Updated Feb 28 • 33 • 13

published a dataset about 2 months ago

tanmayyyj/reward-fn-sft

Viewer • Updated Feb 28 • 33 • 13

upvoted an article 10 months ago

Article

The Annotated Diffusion Model

Jun 7, 2022

•

340

updated a model 11 months ago

tanmayyyj/Qwen-3-math-reasoning

Updated Jun 2, 2025

published a model 11 months ago

tanmayyyj/Qwen-3-math-reasoning

Updated Jun 2, 2025

updated a model almost 3 years ago

tanmayyyj/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Jul 4, 2023 • 16

liked a model almost 3 years ago

tanmayyyj/q-FrozenLake-v2-4x4-Non_Slippery

Reinforcement Learning • Updated Jun 30, 2023 • 1

updated 3 models almost 3 years ago

Tanmay Jain

AI & ML interests

Recent Activity

Organizations

tanmayyyj's activity

Reward Fn Inference

Reward Fn Inference

Reward Fn Trainer

Reward Fn Trainer

The Annotated Diffusion Model