--- license: mit --- # Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding [[`Code`](https://github.com/YilmazKadir/Volt)] [[`arXiv`](https://arxiv.org/abs/2604.19609)] [[`Project Page`](https://vision.rwth-aachen.de/Volt)] [[`BibTeX`](#-Citation)] ## 🎓 Citation If you use our work in your research, please use the following BibTeX entry. ``` @article{yilmaz2026volt, title = {{Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding}}, author = {Yilmaz, Kadir and Kruse, Adrian and Höfer, Tristan and de Geus, Daan and Leibe, Bastian}, journal = {arXiv preprint arXiv:2604.19609}, year = {2026} } ```