4 29 19

Kevin Zhang

Kevin-thu

https://kevin-thu.github.io/homepage

AI & ML interests

Computer Vision, Generation Models, Neural Rendering

Recent Activity

upvoted a paper about 15 hours ago

Lyra 2.0: Explorable Generative 3D Worlds

upvoted a paper 7 days ago

Self-Distilled RLVR

upvoted a paper 9 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

View all activity

Organizations

None yet

upvoted a paper about 15 hours ago

Lyra 2.0: Explorable Generative 3D Worlds

Paper • 2604.13036 • Published 2 days ago • 17

upvoted a paper 7 days ago

Self-Distilled RLVR

Paper • 2604.03128 • Published 13 days ago • 160

upvoted a paper 9 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 10 days ago • 200

upvoted a paper 23 days ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published 29 days ago • 109

upvoted 3 papers 29 days ago

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

Paper • 2603.16871 • Published 29 days ago • 60

Demystifing Video Reasoning

Paper • 2603.16870 • Published 29 days ago • 369

Attention Residuals

Paper • 2603.15031 • Published about 1 month ago • 180

upvoted 2 papers about 1 month ago

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Paper • 2603.03756 • Published Mar 4 • 89

Mode Seeking meets Mean Seeking for Fast Long Video Generation

Paper • 2602.24289 • Published Feb 27 • 41

upvoted 3 papers about 2 months ago

upvoted a paper 3 months ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 176

upvoted 2 papers 4 months ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 173

StoryMem: Multi-shot Long Video Storytelling with Memory

Paper • 2512.19539 • Published Dec 22, 2025 • 19

upvoted a paper 5 months ago

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published Nov 24, 2025 • 32

upvoted 4 papers 6 months ago

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published Oct 23, 2025 • 50

MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation

Paper • 2510.18692 • Published Oct 21, 2025 • 41

Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Paper • 2510.09212 • Published Oct 10, 2025 • 18

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 127

Kevin Zhang

AI & ML interests

Recent Activity

Organizations

Kevin-thu's activity