Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Yuhao Dong PRO
THUdyh
AI & ML interests
None yet
Recent Activity
upvoted a paper about 12 hours ago
Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation upvoted a paper 1 day ago
WildDet3D: Scaling Promptable 3D Detection in the Wild upvoted a paper 1 day ago
VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images