PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos
Paper • 2604.08991 • Published
LoRA adapter weights for Qwen/Qwen3-VL-8B-Instruct, fine-tuned on PinpointQA.
train.jsonl in the dataset repository is not necessarily identical to the final serialized training samples used in training.Base model
Qwen/Qwen3-VL-8B-Instruct