GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting
Paper • 2603.18912 • Published
Part of the ANIMA Perception Suite by Robot Flow Labs.
GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting (arXiv:2603.18912)
Ahmed Tawfik Aboukhadra, Marcel Rogge, Nadia Robertini, Abdalla Arafa, Jameel Malik, Ahmed Elhayek, Didier Stricker
GHOST reconstructs hand-object interactions from monocular RGB video using 3D Gaussian Splatting:
Synthetic validation checkpoint — trained on procedural sphere data to validate the GPU pipeline. Real ARCTIC/HO3D training requires dataset provisioning.
| Format | File | Use Case |
|---|---|---|
| PyTorch (.pth) | pytorch/gs_ghost_v1.pth |
Training, fine-tuning |
| SafeTensors | pytorch/gs_ghost_v1.safetensors |
Fast loading, safe |
| ONNX | N/A | GS rasterizer is CUDA-only |
| TensorRT | N/A | Requires ONNX first |
import torch
ckpt = torch.load("pytorch/gs_ghost_v1.pth", map_location="cpu")
# Keys: xyz, scaling, rotation, opacity, features_dc, features_rest, n_gaussians, step
configs/training.toml| Metric | Paper | Our Target |
|---|---|---|
| ARCTIC CDh | 18.40 cm² | ≤ 20 cm² |
| ARCTIC PSNR | 25.93 | ≥ 25.0 |
| HO3D LPIPS | 0.03 | ≤ 0.04 |
Apache 2.0 — Robot Flow Labs / AIFLOW LABS LIMITED