torchvision==0.24 transformers>=4.50.0 accelerate gradio numpy<2.0 opencv-python-headless Pillow imageio imageio-ffmpeg openai huggingface_hub spaces tqdm