Spaces:

Param20h
/

sql-query-optimizer

Sleeping

App Files Files Community

sql-query-optimizer / README.md

Param20h

Upload folder using huggingface_hub

2541228 verified 14 days ago

preview code

raw

history blame contribute delete

6.07 kB

metadata

title: SQL Query Optimizer Environment Server
emoji: 🐳
colorFrom: blue
colorTo: indigo
sdk: docker
pinned: false
app_port: 7860
base_path: /web
tags:
  - openenv

SQL Query Optimizer — OpenEnv Environment

An OpenEnv-compliant environment where AI agents learn to review, rewrite, and optimise SQL queries across three real-world failure patterns.

HF Spaces: param20h/sql-query-optimizer

Environment Description

Real-world SQL anti-patterns cost companies millions in infrastructure. This environment teaches agents to identify and fix them through a reward-shaped episode loop. Each episode presents the agent with a broken or unoptimised query alongside schema context; the agent iteratively rewrites it until done or max steps are reached.

Why this domain?

Used by data engineers and DBAs every day
Deterministically gradeable (no ambiguous LLM judging)
Natural difficulty progression from syntax errors to multi-factor optimisation

Observation Space

Field	Type	Description
`task_id`	`int`	Task number (1–3)
`task_name`	`str`	Slug identifier
`task_description`	`str`	What the agent must accomplish
`query`	`str`	The SQL to fix
`schema_context`	`str`	Relevant DDL / table definitions
`hint`	`str \| null`	Optional hint (tasks 1 & 2 only)
`step_number`	`int`	Current step (0-indexed)
`max_steps`	`int`	Steps allowed per episode
`done`	`bool`	Whether episode has ended

Action Space

Field	Type	Description
`rewritten_query`	`str`	The agent's improved SQL
`explanation`	`str`	Brief description of changes made
`is_done`	`bool`	`true` when the agent believes the query is fully fixed

Reward Design

The reward is shaped (not sparse) — the agent receives signal every step:

Component	Value	Trigger
Delta reward	+0.0–0.50 × Δgrader	Grader score improves
Completion bonus	+0.50	`is_done=True` and grader ≥ 0.80
Partial completion	+grader × 0.30	`is_done=True` (always)
Step penalty	−0.02 / step	After halfway point, if not done
Invalid penalty	−0.10	Empty or unparseable query

Final score per step is clamped to [0.0, 1.0].

Tasks

Task 1 — `fix-broken-join` (Easy)

The query uses a comma-separated cross-join (FROM orders, customers) without any join condition, causing a Cartesian product. The agent must rewrite with INNER JOIN … ON o.customer_id = c.customer_id.

Max steps: 3 | Grader: checks JOIN keyword + ON clause with correct key

Task 2 — `eliminate-n-plus-one` (Medium)

A correlated scalar subquery in the SELECT list executes once per row (N+1 problem). The agent must collapse it into a single LEFT JOIN departments ON e.dept_id = d.dept_id.

Max steps: 4 | Grader: checks subquery removal + JOIN on dept_id

Task 3 — `full-optimization` (Hard)

Four independent issues to fix:

Remove redundant DISTINCT (PK join makes it unnecessary)
Replace SELECT * with explicit columns
Replace CAST(price AS VARCHAR) LIKE '1%' → price >= 100 AND price < 200 (sargable)
Add an index hint comment for (category, price)

Max steps: 5 | Grader: 4 × 0.25 sub-criteria, fully independent

API Endpoints

Method	Path	Description
`GET`	`/`	Health check
`POST`	`/reset`	Start episode `{ "task_id": 1 }`
`POST`	`/step`	Submit action `{ "rewritten_query": "...", "explanation": "...", "is_done": true }`
`GET`	`/state`	Current internal state
`GET`	`/tasks`	All tasks + action schema
`GET`	`/grader`	Grader score for current episode
`POST`	`/baseline`	Run baseline inference (requires `OPENAI_API_KEY`)

Interactive docs: http://localhost:7860/docs

Setup & Usage

Prerequisites

Python 3.10+
Docker
API_BASE_URL (OpenAI-compatible endpoint for inference)
MODEL_NAME (model identifier for inference)
HF_TOKEN (API key / bearer token for inference)

Local (Python)

pip install -r requirements.txt
uvicorn server:app --host 0.0.0.0 --port 7860 --reload

Local (Docker)

docker build -t sql-optimizer-env .
docker run -p 7860:7860 -e OPENAI_API_KEY=sk-... sql-optimizer-env

Baseline Inference

$env:API_BASE_URL="https://api.openai.com/v1"
$env:MODEL_NAME="gpt-4o-mini"
$env:HF_TOKEN="hf_or_openai_api_key_here"
python inference.py

OpenEnv Validation

pip install openenv-core
openenv validate

Deploy to HF Spaces

pip install huggingface_hub
huggingface-cli login
openenv push --repo-id your-username/sql-query-optimizer

Environment Configuration

Define these variables before running inference or /baseline:

$env:API_BASE_URL = "https://api.openai.com/v1"
$env:MODEL_NAME = "gpt-4o-mini"
$env:HF_TOKEN = "your_api_key"

Baseline Scores

Measured with gpt-4o-mini at temperature=0, single-pass:

Task	Name	Difficulty	Grader Score
1	fix-broken-join	Easy	0.86
2	eliminate-n-plus-one	Medium	0.72
3	full-optimization	Hard	0.50
—	Average	—	0.69

Scores are reproducible: same model, same temperature, same grader → same output.

Project Structure

metaXscaler/
├── env/
│   ├── __init__.py
│   ├── environment.py   # reset(), step(), state()
│   ├── models.py        # Observation, Action, Reward (Pydantic)
│   ├── tasks.py         # Task definitions + graders
│   └── reward.py        # Shaped reward function
├── server.py            # FastAPI app
├── baseline.py          # Baseline inference script
├── openenv.yaml         # OpenEnv spec metadata
├── Dockerfile
├── requirements.txt
└── README.md

License

MIT