streamlit torch>=2.0.0 torchvision>=0.15.0 torchaudio>=2.0.0 transformers>=4.30.0 diffusers>=0.21.0 accelerate>=0.21.0 huggingface_hub librosa soundfile opencv-python-headless pillow numpy scipy ffmpeg-python einops xformers