File size: 1,651 Bytes
c6dfc69
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
# Before Start

This document provides a concise workflow to run AuralSAM2 experiments.

## โš™๏ธ Prepare environment and data

Please complete all setup steps in [installation](./installation.md) first.

## ๐Ÿš€ Training

Use the unified launcher script:

```bash
cd scripts
./run_avs_train.sh <v1s|v1m|v2> [gpus]
./run_ref_train.sh [gpus]
```
The experiments are implemented by 4 GPUs by default.

## ๐Ÿ” Inference (example)

```bash
cd avs.code/v2.code
python inference.py --gpus 1 --batch_size 1 --inference_ckpt /absolute/path/to/checkpoint.pth
```

## ๐Ÿ“Š Training Logs (Reproducibility)

Some examples of training details, please see [this wandb link](https://wandb.ai/pyedog1976/AVS-final-report/workspace?nw=nwuserpyedog1976).

In details, after clicking the run (e.g., [v1m-hiera-l](https://wandb.ai/pyedog1976/AVS-final-report/runs/gzp5dmwi/logs?nw=nwuserpyedog1976)), you can checkout:

1) <img src="https://user-images.githubusercontent.com/102338056/167979073-1c1b3144-8a72-4d8d-9084-31d7fdab3e9b.png" width="26" height="22"> overall information (e.g., command line, hardware information and training time).
2) <img src="https://user-images.githubusercontent.com/102338056/167978940-8c1f3d79-d062-4e7b-b56e-30b97d273ae8.png" width="26" height="22"> training curves and validation visualisation.
3) <img src="https://user-images.githubusercontent.com/102338056/167979238-4847430f-aa0b-483d-b735-8a10b43293a1.png" width="26" height="22"> output logs.


## ๐Ÿ’พ Checkpoints
We release both checkpoints and training logs in this [Google Drive link](https://drive.google.com/drive/folders/1n0HaCHMn48KaImXvX2mu4qKHUQg4mo9R?usp=sharing).