Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Junjie Fei's picture
1 4 2

Junjie Fei

FeiElysia

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago
Vision-CAIR/Tempo-6B
upvoted a collection about 3 hours ago
LongVU
upvoted a collection about 3 hours ago
Tempo
View all activity

Organizations

None yet

authored 5 papers 5 days ago

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Paper • 2305.02677 • Published May 4, 2023 • 1

Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents

Paper • 2411.16740 • Published Nov 23, 2024 • 2

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Paper • 2503.19065 • Published Mar 24, 2025 • 11

Neural Computers

Paper • 2604.06425 • Published 9 days ago • 28

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 7 days ago • 20
submitted a paper to Daily Papers 5 days ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 7 days ago • 20
authored a paper over 1 year ago

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Paper • 2307.16525 • Published Jul 31, 2023 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs