Vaidehi Patil's picture

Vaidehi Patil

vaidehi99

·

https://vaidehi99.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

submitted a paper 1 day ago

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

authored a paper 7 months ago

Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns

View all activity

Organizations

None yet

upvoted a paper about 21 hours ago

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

Paper • 2604.11666 • Published 3 days ago • 3

upvoted a collection about 2 years ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 120