OccAny: Generalized Unconstrained Urban 3D Occupancy
Paper • 2603.23502 • Published • 1
Part of the ANIMA Intelligence Compiler Suite by Robot Flow Labs.
OccAny: Generalized Unconstrained Urban 3D Occupancy Prediction (CVPR 2026) Anh-Quan Cao, Tuan-Hung Vu — Valeo AI
OccAny predicts dense 3D occupancy from RGB camera inputs without LiDAR supervision. The ANIMA implementation uses:
| Benchmark | Metric | Paper | ANIMA Target |
|---|---|---|---|
| SemanticKITTI sequence | IoU | 25.91 | >= 24.5 |
| SemanticKITTI monocular | IoU | 24.03 | >= 22.5 |
| Occ3D-nuScenes surround | IoU | 34.15 | >= 32.0 |
| Format | File | Size | Use Case |
|---|---|---|---|
| PyTorch (.pth) | pytorch/vis_occany_v1.pth |
108 MB | Training, fine-tuning |
| SafeTensors | pytorch/vis_occany_v1.safetensors |
108 MB | Fast loading, safe |
| ONNX | onnx/vis_occany_v1.onnx |
66 MB | Cross-platform inference |
| TensorRT FP32 | tensorrt/vis_occany_v1_fp32.trt |
67 MB | Full precision inference |
| TensorRT FP16 | tensorrt/vis_occany_v1_fp16.trt |
35 MB | Edge deployment (Jetson/L4) |
import torch
from safetensors.torch import load_file
# Load weights
state = load_file("pytorch/vis_occany_v1.safetensors")
# Or with full model
from anima_vis_occany.model.reconstruction import ReconstructionStage
model = ReconstructionStage(hidden_dim=384, decoder_depth=6, decoder_heads=6)
model.load_state_dict({k.replace("reconstruction.", ""): v for k, v in state.items() if k.startswith("reconstruction.")})
configs/training.toml| Endpoint | Method | Description |
|---|---|---|
/health |
GET | Module health status |
/ready |
GET | Checkpoint readiness |
/info |
GET | Module metadata |
/infer |
POST | Run 3D occupancy inference |
docker compose -f docker-compose.serve.yml --profile serve up -d
Apache 2.0 — Robot Flow Labs / AIFLOW LABS LIMITED