Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tanmay Jain's picture
1 1

Tanmay Jain

tanmayyyj
·
  • tanmayyyyj
  • tanmayyyj

AI & ML interests

Computer Vision, NLP, Reinforcement Learning

Recent Activity

updated a Space about 2 months ago
tanmayyyj/reward-fn-inference
published a Space about 2 months ago
tanmayyyj/reward-fn-inference
updated a model about 2 months ago
tanmayyyj/ministral-8b-reward-fn-dpo
View all activity

Organizations

Mistral Hack-a-ton 2026's profile picture

spaces 2

pinned
Paused

Reward Fn Inference

🤖

Mar 1
pinned
Paused

Reward Fn Trainer

🏋

Fine‑tune a language model with DPO using your dataset

Feb 28

models 13

tanmayyyj/ministral-8b-reward-fn-dpo

Updated Feb 28

tanmayyyj/ministral-8b-reward-fn-sft

Updated Feb 28

tanmayyyj/Qwen-3-math-reasoning

Updated Jun 2, 2025

tanmayyyj/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Jul 4, 2023 • 16

tanmayyyj/Taxi-v3

Updated Jun 30, 2023

tanmayyyj/Taxi_v3

Reinforcement Learning • Updated Jun 30, 2023

tanmayyyj/q-FrozenLake-v2-4x4-Non_Slippery

Reinforcement Learning • Updated Jun 30, 2023 • 1

tanmayyyj/ppo-PyramidsRND

Reinforcement Learning • Updated Jun 20, 2023

tanmayyyj/ppo-SnowballTarget

Reinforcement Learning • Updated Jun 20, 2023 • 1

tanmayyyj/Cartpole-v1

Reinforcement Learning • Updated Jun 19, 2023
View 13 models

datasets 2

tanmayyyj/reward-fn-dpo

Viewer • Updated Feb 28 • 161 • 14

tanmayyyj/reward-fn-sft

Viewer • Updated Feb 28 • 33 • 13
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs