VCLab-HKPU 's Collections

Multimodal Perception, Understanding and Reasoning

VCLab's research in MLLM-based visual perception, understanding and reasoning, enhancing the multi-tasking capabilities of MLLMs.