openai PyPDF2 langchain streamlit python-dotenv tiktoken faiss-cpu pdfplumber python-docx requests beautifulsoup4