requests beautifulsoup4 datasets pandas numpy python-dotenv gradio bm25s[full] lxml PyMuPDF huggingface_hub==0.34.6