Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xiaojie Jin's picture
6

Xiaojie Jin

xjjin
zhangysk's profile picture
·
https://scholar.google.com/citations?view_op=list_works&hl=en&hl=en&user=OEZ816YAAAAJ
  • XiaojieJin
  • xiaojie-jin-b10513121

AI & ML interests

Multimodal Reasoning & Decision, GenAI, Computer Vision

Organizations

None yet

upvoted a paper 2 months ago

VideoWorld 2: Learning Transferable Knowledge from Real-world Videos

Paper • 2602.10102 • Published Feb 10 • 14
upvoted a paper 4 months ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74
upvoted a paper 12 months ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21, 2025 • 23
upvoted a paper about 1 year ago

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Paper • 2501.09781 • Published Jan 16, 2025 • 27
upvoted a paper almost 2 years ago

Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams

Paper • 2406.08085 • Published Jun 12, 2024 • 17
upvoted a paper about 2 years ago

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12, 2024 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs