Fahim Tajwar

ftajwar

https://tajwarfahim.github.io/

AI & ML interests

LLMs, RLHF

Recent Activity

updated a dataset about 15 hours ago

ftajwar/knights_and_knaves_fraction_reward

published a dataset about 15 hours ago

ftajwar/knights_and_knaves_fraction_reward

updated a dataset about 15 hours ago

ftajwar/knights_and_knaves

View all activity

Organizations

updated a dataset about 15 hours ago

ftajwar/knights_and_knaves_fraction_reward

Updated about 15 hours ago

published a dataset about 15 hours ago

ftajwar/knights_and_knaves_fraction_reward

Updated about 15 hours ago

updated a dataset about 15 hours ago

ftajwar/knights_and_knaves

Updated about 15 hours ago

published a dataset about 15 hours ago

ftajwar/knights_and_knaves

Updated about 15 hours ago

updated a collection about 2 months ago

MaxRL

Collection

Qwen3-Base post-trained checkpoints for our paper, Maximum Likelihood Reinforcement Learning [https://zanette-labs.github.io/MaxRL/] • 4 items • Updated Feb 26 • 2

updated a model about 2 months ago

ftajwar/qwen3_4B_Base_MaxRL_Polaris_1000_steps

Text Generation • 4B • Updated Feb 26 • 15

published a model about 2 months ago

ftajwar/qwen3_4B_Base_MaxRL_Polaris_1000_steps

Text Generation • 4B • Updated Feb 26 • 15

updated a model about 2 months ago

ftajwar/qwen3_1.7B_Base_MaxRL_Polaris_1000_steps

Text Generation • 2B • Updated Feb 26 • 5

published a model about 2 months ago

ftajwar/qwen3_1.7B_Base_MaxRL_Polaris_1000_steps

Text Generation • 2B • Updated Feb 26 • 5

updated a model about 2 months ago

ftajwar/qwen3_4B_Base_GRPO_Polaris_1000_steps

Text Generation • 4B • Updated Feb 26 • 8

published a model about 2 months ago

ftajwar/qwen3_4B_Base_GRPO_Polaris_1000_steps

Text Generation • 4B • Updated Feb 26 • 8

updated a model about 2 months ago

ftajwar/qwen3_1.7B_Base_GRPO_Polaris_1000_steps

Text Generation • 2B • Updated Feb 26 • 8

published a model about 2 months ago

ftajwar/qwen3_1.7B_Base_GRPO_Polaris_1000_steps

Text Generation • 2B • Updated Feb 26 • 8

updated a model 3 months ago

guanning-ai/SmolLM-Checkpoints-Final-0124

Updated Jan 23

published a model 3 months ago

guanning-ai/SmolLM-Checkpoints-Final-0124

Updated Jan 23

updated a model 3 months ago

guanning-ai/SmolLM-Checkpoints-Final-0123

Updated Jan 23

published a model 3 months ago

guanning-ai/SmolLM-Checkpoints-Final-0123

Updated Jan 23

Fahim Tajwar

AI & ML interests

Recent Activity

Organizations

ftajwar's activity