SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated 4 days ago • 17
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published Dec 18, 2025 • 48
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Paper • 2404.03214 • Published Apr 4, 2024 • 3
VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs Paper • 2512.21194 • Published Dec 24, 2025
Falcon Perception Collection Falcon-Perception and Falcon-OCR model: early-fusion, natively multimodal, dense Autoregressive Transformer models. • 5 items • Updated 9 days ago • 13
SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated 4 days ago • 17