Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yifan Xu's picture
7

Yifan Xu

geekifan
21world's profile picture wangsssssss's profile picture
·

AI & ML interests

None yet

Organizations

Multimedia Computing Group-Nanjing University's profile picture Yifan's Organization's profile picture

upvoted a paper 2 months ago

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Paper • 2602.08711 • Published Feb 9 • 28
upvoted 2 papers 4 months ago

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Paper • 2512.14698 • Published Dec 16, 2025 • 21

SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation

Paper • 2511.19320 • Published Nov 24, 2025 • 43
upvoted a paper 5 months ago

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

Paper • 2511.03334 • Published Nov 5, 2025 • 54
upvoted a paper 9 months ago

PixNerd: Pixel Neural Field Diffusion

Paper • 2507.23268 • Published Jul 31, 2025 • 52
upvoted a collection about 1 year ago

CaRe

Collection
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser). • 6 items • Updated Mar 17, 2025 • 2
upvoted a paper over 1 year ago

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published Oct 30, 2024 • 55
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs