MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation Paper โข 2601.06874 โข Published Jan 11 โข 12