AI & ML interests

Length-aware reinforcement learning fine-tuning, reasoning models, efficient inference, post-training, controllable generation, LLM alignment.

Recent Activity