Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Qingkai Fang's picture
11 8 21

Qingkai Fang

poeroz
starkprince's profile picture SteveSHEN's profile picture 21world's profile picture
·
https://fangqingkai.github.io/
  • poeroz

AI & ML interests

Large Language Models, Speech-Language Models, Speech Translation

Organizations

Natural Language Processing Group, Institute of Computing Technology, Chinese Academy of Science's profile picture

poeroz 's collections 1

Paper list
  • Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

    Paper • 2403.02677 • Published Mar 5, 2024 • 18
  • Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

    Paper • 2403.03003 • Published Mar 5, 2024 • 11
  • InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding

    Paper • 2403.01487 • Published Mar 3, 2024 • 16
  • VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

    Paper • 2403.00522 • Published Mar 1, 2024 • 46
Paper list
  • Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

    Paper • 2403.02677 • Published Mar 5, 2024 • 18
  • Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

    Paper • 2403.03003 • Published Mar 5, 2024 • 11
  • InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding

    Paper • 2403.01487 • Published Mar 3, 2024 • 16
  • VisionLLaMA: A Unified LLaMA Interface for Vision Tasks

    Paper • 2403.00522 • Published Mar 1, 2024 • 46
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs