Tathagata Debnath

tathadn

AI & ML interests

LLM fine-tuning (DPO/RLHF with LoRA on multi-GPU H100s), agentic AI systems, Monte Carlo Tree Search for code generation, multi-agent orchestration, retrieval-augmented generation, and multimodal vision-language models. Built CodeQ — an MCTS + DPO self-improving code debugging agent achieving 84% fix rate on DebugBench using Qwen2.5-Coder-7B-Instruct. Currently working on VisionTriage (QLoRA fine-tuning of Qwen2.5-VL-7B-Instruct for visual bug triage). PhD candidate at NMSU. Published in IEEE TPAMI & IEEE/ACM TCBB (81+ citations). Two CRAN R packages.

Recent Activity

updated a dataset 4 days ago

tathadn/codeq-debugbench-dpo-pairs

updated a model 4 days ago

tathadn/codeq-qwen2.5-coder-7b-dpo-r2

published a dataset 4 days ago

tathadn/codeq-debugbench-dpo-pairs

View all activity

Organizations

None yet

models 1

tathadn/codeq-qwen2.5-coder-7b-dpo-r2

Text Generation • Updated 4 days ago • 12

datasets 1

tathadn/codeq-debugbench-dpo-pairs

Viewer • Updated 4 days ago • 4.91k • 22