File size: 684 Bytes
74d3370 04d6a69 74d3370 04d6a69 74d3370 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 | ---
license: mit
---
# Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding
[[`Code`](https://github.com/YilmazKadir/Volt)] [[`arXiv`](https://arxiv.org/abs/2604.19609)] [[`Project Page`](https://vision.rwth-aachen.de/Volt)] [[`BibTeX`](#-Citation)]
## 🎓 Citation
If you use our work in your research, please use the following BibTeX entry.
```
@article{yilmaz2026volt,
title = {{Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding}},
author = {Yilmaz, Kadir and Kruse, Adrian and Höfer, Tristan and de Geus, Daan and Leibe, Bastian},
journal = {arXiv preprint arXiv:2604.19609},
year = {2026}
}
``` |