yl-1993's picture

yl-1993

yl-1993

·

https://yanglei.me

yl-1993

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

liked a model 16 days ago

sensenova/SenseNova-SI-1.5-InternVL3-8B

updated a collection 21 days ago

View all activity

Organizations

upvoted a paper 25 days ago

MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction

Paper • 2603.19231 • Published 29 days ago • 36

upvoted a paper 28 days ago

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

Paper • 2603.19227 • Published 29 days ago • 42

upvoted a paper 30 days ago

Kinema4D: Kinematic 4D World Modeling for Spatiotemporal Embodied Simulation

Paper • 2603.16669 • Published about 1 month ago • 70

upvoted 2 papers about 1 month ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published about 1 month ago • 369

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 152

upvoted an article about 1 month ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

Mar 5

•

125

upvoted 4 papers about 2 months ago

SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning

Paper • 2512.24330 • Published Dec 30, 2025 • 36

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 264

ConsistCompose: Unified Multimodal Layout Control for Image Composition

Paper • 2511.18333 • Published Nov 23, 2025 • 4

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

upvoted a collection 3 months ago

NEO1_0

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Jan 27 • 9

upvoted 5 collections 4 months ago

Encoders-Lightx2v

2 items • Updated Dec 23, 2025 • 3

Wan2.1-Lightx2v

4 items • Updated Dec 23, 2025 • 2

Wan2.2-Lightx2v

4 items • Updated Dec 23, 2025 • 9

Qwen-Image-Lightx2v

4 items • Updated Feb 15 • 9

NVFP4-Lightx2v

1 item • Updated Dec 23, 2025 • 9

upvoted 2 papers 4 months ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 67

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published Dec 15, 2025 • 76

upvoted a collection 4 months ago

SenseNova-SI

Scaling Spatial Intelligence with Multimodal Foundation Models • 15 items • Updated about 23 hours ago • 16

upvoted a paper 5 months ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96