AuralSAM2 / docs /before_start.md
yyliu01's picture
Upload folder using huggingface_hub
c6dfc69 verified
|
raw
history blame
1.65 kB

Before Start

This document provides a concise workflow to run AuralSAM2 experiments.

βš™οΈ Prepare environment and data

Please complete all setup steps in installation first.

πŸš€ Training

Use the unified launcher script:

cd scripts
./run_avs_train.sh <v1s|v1m|v2> [gpus]
./run_ref_train.sh [gpus]

The experiments are implemented by 4 GPUs by default.

πŸ” Inference (example)

cd avs.code/v2.code
python inference.py --gpus 1 --batch_size 1 --inference_ckpt /absolute/path/to/checkpoint.pth

πŸ“Š Training Logs (Reproducibility)

Some examples of training details, please see this wandb link.

In details, after clicking the run (e.g., v1m-hiera-l), you can checkout:

  1. overall information (e.g., command line, hardware information and training time).
  2. training curves and validation visualisation.
  3. output logs.

πŸ’Ύ Checkpoints

We release both checkpoints and training logs in this Google Drive link.