Yuseung "Phillip" Lee

phillipinseoul

https://phillipinseoul.github.io/

phillipinseoul

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 3 days ago

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

upvoted a paper 3 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

upvoted a paper 3 days ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

View all activity

Organizations

upvoted 3 papers 3 days ago

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

Paper • 2604.07296 • Published 5 days ago • 31

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 5 days ago • 276

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 5 days ago • 151

upvoted a paper 4 days ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 6 days ago • 55

upvoted 3 papers 5 days ago

Action Images: End-to-End Policy Learning via Multiview Video Generation

Paper • 2604.06168 • Published 6 days ago • 12

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 7 days ago • 37

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Paper • 2604.04934 • Published 7 days ago • 40

liked a Space 6 days ago

StyleRenderer

🎨

Generate stylized video from game G‑buffer inputs

upvoted a paper 7 days ago

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published 10 days ago • 32

submitted a paper to Daily Papers 7 days ago

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published 10 days ago • 32

upvoted 2 papers 7 days ago

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published 11 days ago • 71

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published 11 days ago • 137

liked 2 datasets 9 days ago

ellisbrown/SIMS-VSI

Viewer • Updated Nov 7, 2025 • 242k • 171 • 7

rbler/MMSI-Video-Bench

Updated Feb 10 • 109 • 5

liked a dataset 10 days ago

bigai/SceneVersepp

Updated 10 days ago • 429 • 3

upvoted 2 papers 11 days ago

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Paper • 2604.00528 • Published 12 days ago • 12

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Paper • 2603.26653 • Published 16 days ago • 18

liked 2 datasets 11 days ago

nvidia/DLC-Bench

Viewer • Updated Apr 24, 2025 • 77 • 89 • 7

Journey9ni/vstibench

Viewer • Updated May 14, 2025 • 6.04k • 159 • 3

upvoted a paper 12 days ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

Paper • 2603.28069 • Published 14 days ago • 8

Yuseung "Phillip" Lee

AI & ML interests

Recent Activity

Organizations

phillipinseoul's activity

StyleRenderer