xiaokun sun
yscsb
AI & ML interests
None yet
Recent Activity
upvoted a paper about 10 hours ago
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient
Large Speech-Language Model upvoted a paper about 10 hours ago
DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model upvoted a paper about 10 hours ago
RISE-Video: Can Video Generators Decode Implicit World Rules?Organizations
None yet