SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models Paper • 2603.19028 • Published about 1 month ago • 18
ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published about 1 month ago • 41
Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization Paper • 2602.04937 • Published Feb 4 • 1
Specificity-aware reinforcement learning for fine-grained open-world classification Paper • 2603.03197 • Published Mar 3 • 16
Compositional Caching for Training-free Open-vocabulary Attribute Detection Paper • 2503.19145 • Published Mar 24, 2025 • 3
Compositional Caching for Training-free Open-vocabulary Attribute Detection Paper • 2503.19145 • Published Mar 24, 2025 • 3
How to Take a Memorable Picture? Empowering Users with Actionable Feedback Paper • 2602.21877 • Published Feb 25 • 16