Xi Yang's picture

Xi Yang

ianyeung

·

IanYeung

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

tencent/HY-Embodied-0.5

upvoted a paper 6 days ago

Vero: An Open RL Recipe for General Visual Reasoning

upvoted a paper 6 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

View all activity

Organizations

None yet

upvoted 2 papers 6 days ago

Vero: An Open RL Recipe for General Visual Reasoning

Paper • 2604.04917 • Published 7 days ago • 28

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 7 days ago • 199

upvoted 6 papers 12 days ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published 14 days ago • 85

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Paper • 2603.29664 • Published 12 days ago • 48

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Paper • 2603.29620 • Published 12 days ago • 46

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

Paper • 2603.26599 • Published 16 days ago • 61

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 15 days ago • 137

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published 13 days ago • 56

upvoted 2 papers 13 days ago

Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization

Paper • 2603.28342 • Published 13 days ago • 26

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published 17 days ago • 153

upvoted a paper 17 days ago

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Paper • 2603.25502 • Published 17 days ago • 56

upvoted a paper 19 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 20 days ago • 123

upvoted 4 papers about 1 month ago

ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation

Paper • 2603.11421 • Published Mar 12 • 34

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 45

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Paper • 2603.03646 • Published Mar 4 • 8

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 186

upvoted 3 papers about 2 months ago

Optimizing Few-Step Generation with Adaptive Matching Distillation

Paper • 2602.07345 • Published Feb 7 • 9

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 58

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published Feb 13 • 44

upvoted a paper 2 months ago

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36