Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published about 1 month ago • 87
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content Paper • 2410.10783 • Published Oct 14, 2024 • 26
DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model Paper • 2408.07541 • Published Aug 14, 2024 • 1
Tell Me What You See: Text-Guided Real-World Image Denoising Paper • 2312.10191 • Published Dec 15, 2023