File size: 980 Bytes
c6dfc69
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
# AuralSAM2
> **[CVPRF'26]** [AuralSAM2: Enabling SAM2 Hear
Through Pyramid Audio-Visual Feature Prompting](#)
>
> by Yuyuan Liu, Yuanhong Chen, Chong Wang, Junlin Han, Junde Wu, Can Peng, Jingkun Chen, Yu Tian and Gustavo Carneiro
>
<img src="./docs/overview.png" width="850" height="300" />

## Installation
please install the dependencies and dataset based on this [***installation***](./docs/installation.md) document.

## Getting start
please follow this [***instruction***](./docs/before_start.md) document to reproduce our results.

## Citation
please consider citing our work in your publications if it helps your research.

```bibtex
@article{liu2025auralsam2,
  title={AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting},
  author={Liu, Yuyuan and Chen, Yuanhong and Wang, Chong and Han, Junlin and Wu, Junde and Peng, Can and Chen, Jingkun and Tian, Yu and Carneiro, Gustavo},
  journal={arXiv preprint arXiv:2506.01015},
  year={2025}
}
```