Spaces:

shiva0013
/

YT-AI-Automation

Running

App Files Files Community

YT-AI-Automation / backend /docs /SETUP_GUIDE.md

github-actions

Sync Docker Space

5f3e9f5 3 days ago

preview code

raw

history blame contribute delete

5.28 kB

Screenshot Studio - Setup Guide

Complete guide to set up and run Screenshot Studio.

Quick Start

# 1. Install dependencies
pip install -r requirements.txt

# 2. Install Playwright browsers
playwright install chromium

# 3. Configure API
cp config/config.example.py config/config.py
# Edit config/config.py with your API credentials

# 4. Start the application
python start.py

Detailed Setup

Step 1: System Requirements

Python: 3.8 or higher
pip: Latest version
Operating System: Windows, macOS, or Linux
RAM: Minimum 2GB
Disk Space: ~500MB for dependencies

Step 2: Install Python Dependencies

pip install -r requirements.txt

This installs:

Flask (web framework)
Playwright (screenshot engine)
Pillow (image processing)
Requests (HTTP client)

Step 3: Install Playwright Browsers

Playwright needs browser binaries:

playwright install chromium

This downloads Chromium (~150MB). You can also install other browsers:

playwright install firefox
playwright install webkit

Step 4: Configure API Settings

Copy the example config:

cp config/config.example.py config/config.py

Edit config/config.py:

API_KEY = "your-actual-api-key"
BASE_URL = "https://your-api-endpoint.com/v1"
MODEL_NAME = "your-model-name"

Get API credentials:
- For Groq: https://console.groq.com/keys
- For OpenAI: https://platform.openai.com/api-keys
- For other providers: Check their documentation

Step 5: Verify Installation

Run the startup script with checks:

python start.py

This will:

✅ Check all dependencies
✅ Verify configuration
✅ Test Playwright installation
✅ Create output folders
🚀 Start the server

Step 6: Access the Application

Open your browser to:

http://localhost:5000

Configuration Options

Basic Settings (config/config.py)

# API Configuration
API_KEY = "sk-..."              # Your API key
BASE_URL = "https://..."        # API endpoint
MODEL_NAME = "llama-3.1-70b"    # Model to use

# Server Settings
DEBUG = True                    # Enable debug mode
PORT = 5000                     # Server port
HOST = "0.0.0.0"               # Listen on all interfaces

# Output Settings
DEFAULT_SCREENSHOT_FOLDER = "output/screenshots"
DEFAULT_HTML_FOLDER = "output/html"

Advanced Settings

# Screenshot Quality
DEFAULT_VIEWPORT_WIDTH = 1920   # Screenshot width
DEFAULT_VIEWPORT_HEIGHT = 1080  # Screenshot height
DEFAULT_DEVICE_SCALE = 2.5      # Quality multiplier
DEFAULT_FONT_SIZE = 250         # Font size percentage
DEFAULT_OVERLAP = 35            # Overlap between shots

# Limits
MAX_SCREENSHOTS_LIMIT = 50      # Maximum screenshots per request
MAX_TOKENS = 100000             # Maximum AI tokens

Troubleshooting

Issue: "Module not found"

Solution:

pip install -r requirements.txt

Issue: "Playwright executable not found"

Solution:

playwright install chromium

Issue: "API key invalid"

Solution:

Check config/config.py has correct API key
Verify API key is active on provider's dashboard
Check BASE_URL is correct

Issue: "Permission denied" on output folders

Solution:

# Windows
icacls output /grant Users:F /T

# Linux/Mac
chmod -R 755 output

Issue: Port 5000 already in use

Solution: Edit config/config.py:

PORT = 5001  # Use different port

Or stop the process using port 5000:

# Windows
netstat -ano | findstr :5000
taskkill /PID <PID> /F

# Linux/Mac
lsof -ti:5000 | xargs kill -9

Running in Production

Using Gunicorn (Linux/Mac)

gunicorn -w 4 -b 0.0.0.0:5000 app:app

Using Waitress (Windows)

pip install waitress
waitress-serve --port=5000 app:app

Environment Variables

Set sensitive data via environment variables:

# Linux/Mac
export API_KEY="your-key"
export BASE_URL="your-url"

# Windows
set API_KEY=your-key
set BASE_URL=your-url

Then in config/config.py:

import os
API_KEY = os.getenv('API_KEY', 'default-key')
BASE_URL = os.getenv('BASE_URL', 'default-url')

Updating

Update Python packages

pip install --upgrade -r requirements.txt

Update Playwright

playwright install chromium --force

Uninstalling

# Remove Python packages
pip uninstall -r requirements.txt -y

# Remove Playwright browsers
playwright uninstall --all

# Delete project folder
cd ..
rm -rf screenshot-studio

Getting Help

Check terminal output for detailed error messages
Review logs in the console
Verify configuration in config/config.py
Test API separately to isolate issues

Next Steps

After setup:

Read README.md for usage instructions
Check static/README.md for frontend customization
Explore the UI at http://localhost:5000
Try the Text to Image tool with sample content
Experiment with advanced settings

Support

For issues:

Check error messages in terminal
Verify all setup steps completed
Review troubleshooting section
Check API provider status