5 12 5

Ray Yang

rayruiyang

Yangr116

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

upvoted a paper 13 days ago

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

upvoted a paper 15 days ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

View all activity

Organizations

None yet

upvoted a paper 8 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 9 days ago • 107

upvoted a paper 13 days ago

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Paper • 2603.25823 • Published 19 days ago • 43

upvoted a paper 15 days ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

Paper • 2603.28069 • Published 16 days ago • 8

upvoted a paper 20 days ago

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 27

updated a dataset about 1 month ago

rayruiyang/vst_500k

Viewer • Updated Mar 13 • 563k • 2.67k • 3

upvoted 2 papers about 1 month ago

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published Mar 8 • 86

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 185

updated a collection 2 months ago

VST

Collection

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities. • 6 items • Updated Feb 1 • 6

updated a dataset 2 months ago

rayruiyang/vst_3d_grounding_benchmark

Preview • Updated Feb 1 • 8

published a dataset 2 months ago

rayruiyang/vst_3d_grounding_benchmark

Preview • Updated Feb 1 • 8

published a dataset 3 months ago

rayruiyang/vst_500k

Viewer • Updated Mar 13 • 563k • 2.67k • 3

upvoted a paper 4 months ago

DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning

Paper • 2512.12799 • Published Dec 14, 2025 • 12

upvoted a paper 5 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 216

updated 4 models 5 months ago

New activity in rayruiyang/VST-7B-RL 5 months ago

Add pipeline tag and library name to model card

#1 opened 5 months ago by

nielsr

New activity in rayruiyang/VST-3B-RL 5 months ago

Add `library_name` and `pipeline_tag` metadata

#1 opened 5 months ago by

nielsr

New activity in rayruiyang/VST-7B-SFT 5 months ago

Improve VST-7B-SFT model card with metadata, paper link, and usage clarity

#1 opened 5 months ago by

nielsr

Ray Yang

AI & ML interests

Recent Activity

Organizations

rayruiyang's activity

Add pipeline tag and library name to model card

Add `library_name` and `pipeline_tag` metadata

Improve VST-7B-SFT model card with metadata, paper link, and usage clarity