| --- |
| license: mit |
| --- |
| |
| # Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding |
|
|
| [[`Code`](https://github.com/YilmazKadir/Volt)] [[`arXiv`](https://arxiv.org/abs/2604.19609)] [[`Project Page`](https://vision.rwth-aachen.de/Volt)] [[`BibTeX`](#-Citation)] |
|
|
| ## 🎓 Citation |
|
|
| If you use our work in your research, please use the following BibTeX entry. |
|
|
| ``` |
| @article{yilmaz2026volt, |
| title = {{Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding}}, |
| author = {Yilmaz, Kadir and Kruse, Adrian and Höfer, Tristan and de Geus, Daan and Leibe, Bastian}, |
| journal = {arXiv preprint arXiv:2604.19609}, |
| year = {2026} |
| } |
| ``` |