2 17 14

Gyuseong Lee

gses82

https://gseonglee.github.io

AI & ML interests

Diffusion Generative Models

Recent Activity

upvoted a paper 20 days ago

DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

upvoted a paper 20 days ago

Repurposing Geometric Foundation Models for Multi-view Diffusion

upvoted a paper 27 days ago

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

View all activity

Organizations

upvoted 2 papers 20 days ago

DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

Paper • 2603.23499 • Published 21 days ago • 51

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published 22 days ago • 47

upvoted a paper 27 days ago

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

Paper • 2603.16871 • Published 28 days ago • 60

upvoted a paper 28 days ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 29 days ago • 153

upvoted 5 papers 6 months ago

Exploring Conditions for Diffusion models in Robotic Control

Paper • 2510.15510 • Published Oct 17, 2025 • 40

Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Paper • 2510.23581 • Published Oct 27, 2025 • 42

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 146

TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling

Paper • 2510.04533 • Published Oct 6, 2025 • 48

MATRIX: Mask Track Alignment for Interaction-aware Video Generation

Paper • 2510.07310 • Published Oct 8, 2025 • 36

liked a model 7 months ago

facebook/map-anything

Image-to-3D • 1B • Updated Jan 20 • 65.1k • 89

upvoted a paper 7 months ago

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84

upvoted 3 papers 10 months ago

Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation

Paper • 2506.11924 • Published Jun 13, 2025 • 35

Fine-Grained Perturbation Guidance via Attention Head Selection

Paper • 2506.10978 • Published Jun 12, 2025 • 25

Text-Aware Image Restoration with Diffusion Models

Paper • 2506.09993 • Published Jun 11, 2025 • 45

liked a model 12 months ago

naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B

Text Generation • Updated Sep 16, 2025 • 60.4k • 219

upvoted a paper about 1 year ago

URECA: Unique Region Caption Anything

Paper • 2504.05305 • Published Apr 7, 2025 • 35

liked a model about 1 year ago

kakaocorp/kanana-nano-2.1b-instruct

Text Generation • 2B • Updated Feb 27, 2025 • 1.5k • 75

upvoted a collection over 1 year ago

EXAONE-3.5

Collection

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 11 items • Updated Jul 7, 2025 • 121

upvoted a paper over 1 year ago

A Noise is Worth Diffusion Guidance

Paper • 2412.03895 • Published Dec 5, 2024 • 29

liked a model over 1 year ago

LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct

Text Generation • Updated Aug 8, 2024 • 29.4k • 417

Gyuseong Lee

AI & ML interests

Recent Activity

Organizations

gses82's activity