Mehul Damani's picture

Mehul Damani PRO

mehuldamani

·

https://damanimehul.github.io

AI & ML interests

Reinforcement Learning, Large Language Models

Recent Activity

updated a model about 11 hours ago

mehuldamani/bug_fixing_sft-v1

published a model about 11 hours ago

mehuldamani/bug_fixing_sft-v1

updated a model 1 day ago

mehuldamani/code_gen_arl-ast-addmultiply-7b-v1

View all activity

Organizations

None yet

upvoted a paper 21 days ago

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Paper • 2603.24844 • Published 23 days ago • 10

upvoted a collection 9 months ago

RLCR

Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty • 10 items • Updated Aug 6, 2025 • 7

upvoted a paper 9 months ago

Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty

Paper • 2507.16806 • Published Jul 22, 2025 • 7