Zhaokai Yin's picture

5 1

Zhaokai Yin

QiuGuangwww

·

https://blog.qiuguang.top/

AI & ML interests

VLA/VLM/Embodied Intelligence

Recent Activity

upvoted a collection 17 days ago

upvoted a paper 2 months ago

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

upvoted a paper 3 months ago

Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

View all activity

Organizations

None yet

upvoted a collection 17 days ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 206

upvoted a paper 2 months ago

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Paper • 2504.09925 • Published Apr 14, 2025 • 39

upvoted a paper 3 months ago

Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

Paper • 2601.14171 • Published Jan 20 • 53

liked a dataset 3 months ago

lerobot/libero_spatial_image

Viewer • Updated Mar 9 • 53k • 4.35k • 7

upvoted a collection 3 months ago

timm DINOv3

Meta AI's DINOv3 weights in timm. ViTs with `qkvb` have a zero QV bias present, otherwise bias is disabled. QKV bias are all 0 in original weights. • 18 items • Updated Sep 19, 2025 • 33

upvoted a paper 3 months ago

InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

Paper • 2601.02281 • Published Jan 5 • 33