Hiring 💼

3 94 9

Chengxuan Qian

Raymond-Qiancx

https://qiancx.com/

AI & ML interests

Vision-Language Models

Recent Activity

upvoted a paper 3 days ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

upvoted a paper 3 days ago

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

upvoted a paper 3 days ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

View all activity

Organizations

None yet

upvoted 3 papers 3 days ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published 9 days ago • 91

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Paper • 2604.08995 • Published 7 days ago • 44

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 8 days ago • 232

commented a paper 3 days ago

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published 29 days ago • 25 •

upvoted a paper 3 days ago

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published 29 days ago • 25

upvoted 5 papers 4 days ago

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published Dec 16, 2025 • 73

upvoted 10 papers 5 days ago

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published Jan 15 • 31

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Paper • 2512.19678 • Published Dec 22, 2025 • 31

Spatia: Video Generation with Updatable Spatial Memory

Paper • 2512.15716 • Published Dec 17, 2025 • 34

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Paper • 2512.13507 • Published Dec 15, 2025 • 41

SpatialTree: How Spatial Abilities Branch Out in MLLMs

Paper • 2512.20617 • Published Dec 23, 2025 • 44

AnyDepth: Depth Estimation Made Easy

Paper • 2601.02760 • Published Jan 6 • 11

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published Jan 8 • 58

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 170

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 34

Chengxuan Qian

AI & ML interests

Recent Activity

Organizations

Raymond-Qiancx's activity