1 32

Sihui Ji

zjuJish

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Strips as Tokens: Artist Mesh Generation with Native UV Segmentation

upvoted a paper 23 days ago

Manifold-Aware Exploration for Reinforcement Learning in Video Generation

upvoted a paper 27 days ago

FASTER: Rethinking Real-Time Flow VLAs

View all activity

Organizations

upvoted a paper 2 days ago

Strips as Tokens: Artist Mesh Generation with Native UV Segmentation

Paper • 2604.09132 • Published 6 days ago • 47

upvoted a paper 23 days ago

Manifold-Aware Exploration for Reinforcement Learning in Video Generation

Paper • 2603.21872 • Published 23 days ago • 33

upvoted a paper 27 days ago

FASTER: Rethinking Real-Time Flow VLAs

Paper • 2603.19199 • Published 27 days ago • 58

upvoted a paper about 1 month ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 185

upvoted a paper 3 months ago

GARDO: Reinforcing Diffusion Models without Reward Hacking

Paper • 2512.24138 • Published Dec 30, 2025 • 30

upvoted a paper 4 months ago

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 50

updated a model 4 months ago

KlingTeam/MemFlow

Text-to-Video • Updated Dec 29, 2025 • 11

upvoted 2 papers 4 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 51

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published Dec 23, 2025 • 94

authored a paper 4 months ago

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Paper • 2512.14699 • Published Dec 16, 2025 • 28

upvoted 6 papers 4 months ago

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Paper • 2512.16905 • Published Dec 18, 2025 • 32

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published Dec 18, 2025 • 38

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 173

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 121

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Paper • 2512.14699 • Published Dec 16, 2025 • 28

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published Dec 15, 2025 • 76

authored 2 papers 4 months ago

LayerFlow: A Unified Model for Layer-aware Video Generation

Paper • 2506.04228 • Published Jun 4, 2025 • 13

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

Paper • 2501.12382 • Published Jan 21, 2025

published a model 4 months ago