ADE20K Segmentation Probe β€” DINOv3 ViT-B/16 @ 256px input

Linear segmentation probe on the spatial features of facebook/dinov3-vitb16-pretrain-lvd1689m.

Usage

uv add "canvit-pytorch @ git+https://github.com/m2b3/CanViT-PyTorch.git"
import torch
from canvit_pytorch.probes import SegmentationProbe

probe = SegmentationProbe.from_pretrained("canvit/probe-ade20k-40k-dv3b-256px").eval()

# [B, H, W, D] DINOv3 ViT-B/16 spatial features at 256px input
features = torch.randn(1, 16, 16, 768)
with torch.inference_mode():
    logits = probe(features)    # [B, num_classes, H, W]
assert logits.shape == (1, 150, 16, 16)

Training

Architecture: Dropout β†’ BatchNorm β†’ Conv1Γ—1.

Hyperparameter Value
Input size 256 Γ— 256 px
Optimizer AdamW
Peak LR 3Γ—10βˆ’4 3 \times 10^{-4}
Weight decay 10βˆ’3 10^{-3}
LR schedule 1,500-step warmup β†’ cosine decay
Batch size 16
Max steps 40,000
Dropout 0.1
Augmentation RandomResizedCrop scale [0.5, 2] + HFlip
Precision bf16 (AMP)
Downloads last month
54
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for canvit/probe-ade20k-40k-dv3b-256px

Dataset used to train canvit/probe-ade20k-40k-dv3b-256px

Collection including canvit/probe-ade20k-40k-dv3b-256px

Paper for canvit/probe-ade20k-40k-dv3b-256px