metadata
license: mit
Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding
[Code] [arXiv] [Project Page] [BibTeX]
🎓 Citation
If you use our work in your research, please use the following BibTeX entry.
@article{yilmaz2026volt,
title = {{Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding}},
author = {Yilmaz, Kadir and Kruse, Adrian and Höfer, Tristan and de Geus, Daan and Leibe, Bastian},
journal = {arXiv preprint arXiv:2604.19609},
year = {2026}
}