File size: 684 Bytes
74d3370
 
 
 
 
 
04d6a69
74d3370
 
 
 
 
 
 
 
 
04d6a69
74d3370
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
license: mit
---

#  Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding

[[`Code`](https://github.com/YilmazKadir/Volt)] [[`arXiv`](https://arxiv.org/abs/2604.19609)] [[`Project Page`](https://vision.rwth-aachen.de/Volt)] [[`BibTeX`](#-Citation)]

## 🎓 Citation

If you use our work in your research, please use the following BibTeX entry.

```
@article{yilmaz2026volt,
  title     = {{Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding}},
  author    = {Yilmaz, Kadir and Kruse, Adrian and Höfer, Tristan and de Geus, Daan and Leibe, Bastian},
  journal   = {arXiv preprint arXiv:2604.19609},
  year      = {2026}
}
```