93 894 1

Yury Panikov

panikov

panikov

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Training a Student Expert via Semi-Supervised Foundation Model Distillation

upvoted a paper 3 days ago

ViVa: A Video-Generative Value Model for Robot Reinforcement Learning

upvoted a paper 3 days ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

View all activity

Organizations

None yet

upvoted 4 papers 3 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 6 days ago • 306

upvoted 16 papers 5 days ago

CUE-R: Beyond the Final Answer in Retrieval-Augmented Generation

Paper • 2604.05467 • Published 7 days ago • 7

Can Natural Image Autoencoders Compactly Tokenize fMRI Volumes for Long-Range Dynamics Modeling?

Paper • 2604.03619 • Published 10 days ago • 7

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published 8 days ago • 35

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Paper • 2604.04934 • Published 8 days ago • 42

Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning

Paper • 2604.05404 • Published 7 days ago • 41

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 12 days ago • 39

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

Paper • 2604.02648 • Published 11 days ago • 45

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Paper • 2604.03922 • Published 9 days ago • 53

Learning to Retrieve from Agent Trajectories

Paper • 2604.04949 • Published 15 days ago • 68

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published 7 days ago • 113

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 8 days ago • 231

BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs

Paper • 2604.02045 • Published 12 days ago • 33

Mimic Intent, Not Just Trajectories

Paper • 2602.08602 • Published 17 days ago • 14

Synthetic Sandbox for Training Machine Learning Engineering Agents

Paper • 2604.04872 • Published 8 days ago • 14

Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems

Paper • 2604.03295 • Published 18 days ago • 10

Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving

Paper • 2604.01483 • Published 13 days ago • 7

Yury Panikov

AI & ML interests

Recent Activity

Organizations

panikov's activity