Spaces:

Hodfa71
/

RetailMind

Sleeping

hodfa840 commited on 17 days ago

Commit

f69e608

0 Parent(s):

feat: Complete RetailMind overhaul — hybrid retrieval, EWMA drift detection, self-healing adapter, premium UI, tests & CI

- Rewrote catalog: 200 curated products with unique descriptions, materials, ratings, tags
- Hybrid retrieval: price-parsing + category detection + semantic re-ranking
- EWMA drift detector: smoothed concept tracking with multiple anchor phrases
- Rich adaptation rules with detailed self-healing explanations
- LLM: anti-hallucination prompt engineering with structured context injection
- Premium Gradio UI: aurora header, info callouts, score badges, star ratings
- Added pytest suite (catalog, drift, retrieval, adaptation)
- Added GitHub Actions CI (lint + test on Python 3.10-3.12)
- Recruiter-grade README with architecture diagram, technical decisions, demo walkthrough
- Security: .gitignore for secrets, .env.example for onboarding

Files changed (17) hide show

.env.example +6 -0
.github/workflows/ci.yml +48 -0
.gitignore +40 -0
README.md +197 -0
app.py +468 -0
modules/__init__.py +1 -0
modules/adaptation.py +141 -0
modules/data_simulation.py +318 -0
modules/drift.py +153 -0
modules/llm.py +95 -0
modules/retrieval.py +150 -0
requirements.txt +8 -0
tests/__init__.py +1 -0
tests/test_adaptation.py +51 -0
tests/test_catalog.py +50 -0
tests/test_drift.py +59 -0
tests/test_retrieval.py +60 -0

.env.example ADDED Viewed

	@@ -0,0 +1,6 @@

+# ── Environment Variables ──────────────────────────────────────
+# Copy this file to `.env` and fill in your values.
+# (Optional) Hugging Face API token — only needed if using gated models.
+# The default model (Qwen2.5-0.5B-Instruct) does NOT require a token.
+HF_TOKEN=hf_your_token_here

.github/workflows/ci.yml ADDED Viewed

	@@ -0,0 +1,48 @@

+name: CI
+on:
+  push:
+    branches: [main]
+  pull_request:
+    branches: [main]
+jobs:
+  test:
+    runs-on: ubuntu-latest
+    strategy:
+      matrix:
+        python-version: ["3.10", "3.11", "3.12"]
+    steps:
+      - uses: actions/checkout@v4
+      - name: Set up Python ${{ matrix.python-version }}
+        uses: actions/setup-python@v5
+        with:
+          python-version: ${{ matrix.python-version }}
+          cache: "pip"
+      - name: Install dependencies
+        run: |
+          python -m pip install --upgrade pip
+          pip install -r requirements.txt
+          pip install pytest
+      - name: Run tests
+        run: pytest tests/ -v --tb=short
+  lint:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - name: Set up Python
+        uses: actions/setup-python@v5
+        with:
+          python-version: "3.11"
+      - name: Install ruff
+        run: pip install ruff
+      - name: Lint
+        run: ruff check . --select E,F,W --ignore E501

.gitignore ADDED Viewed

	@@ -0,0 +1,40 @@

+# ── Secrets & tokens ───────────────────────────────────────────
+hf_token
+gh_token
+.env
+*.key
+# ── Python ─────────────────────────────────────────────────────
+__pycache__/
+*.py[cod]
+*$py.class
+*.egg-info/
+dist/
+build/
+*.egg
+# ── Virtual environments ──────────────────────────────────────
+venv/
+.venv/
+env/
+# ── IDE / Editor ──────────────────────────────────────────────
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+# ── Gradio ────────────────────────────────────────────────────
+.gradio/
+flagged/
+# ── OS ────────────────────────────────────────────────────────
+.DS_Store
+Thumbs.db
+# ── Models (don't push multi-GB weights) ─────────────────────
+*.bin
+*.safetensors
+*.gguf
+models/

README.md ADDED Viewed

	@@ -0,0 +1,197 @@

+<div align="center">
+# 🧠 RetailMind
+### Self-Healing LLM for Store Intelligence
+[![CI](https://github.com/hodfa840/-RetailMind-Self-Healing-LLM-for-Store-Intelligence/actions/workflows/ci.yml/badge.svg)](https://github.com/hodfa840/-RetailMind-Self-Healing-LLM-for-Store-Intelligence/actions)
+[![Python 3.10+](https://img.shields.io/badge/python-3.10%2B-blue?logo=python&logoColor=white)](https://python.org)
+[![Gradio](https://img.shields.io/badge/Gradio-4.0%2B-orange?logo=gradio)](https://gradio.app)
+[![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
+**An autonomous e-commerce AI that detects semantic drift in user intent and self-heals its own behavior in real time — no human in the loop.**
+[Live Demo](#-quick-start) · [Architecture](#-architecture) · [How It Works](#-how-the-self-healing-loop-works) · [Technical Decisions](#-technical-decisions)
+</div>
+---
+## 🎯 What This Project Demonstrates
+| Skill | Implementation |
+|-------|---------------|
+| **MLOps / Observability** | Real-time EWMA-based drift detection with live telemetry dashboard |
+| **RAG / Information Retrieval** | Hybrid retrieval: metadata pre-filtering (price, category) + dense semantic re-ranking |
+| **Prompt Engineering** | Anti-hallucination grounding, dynamic prompt injection based on detected drift |
+| **Self-Healing Systems** | Autonomous prompt rewriting when intent distribution shifts — zero human intervention |
+| **LLM Integration** | Local Qwen2.5-0.5B inference on CPU — no API keys, no GPU, fully offline-capable |
+| **Software Engineering** | Type hints, docstrings, logging, pytest suite, CI/CD, modular architecture |
+---
+## ⚡ Architecture
+```mermaid
+graph LR
+    A["🛒 User Query"] --> B["📊 Drift Detector<br/><i>EWMA Semantic Analysis</i>"]
+    A --> C["🔍 Hybrid Retriever<br/><i>Price Filter + Dense Search</i>"]
+    B --> D["🔧 Self-Healing Adapter<br/><i>Dynamic Prompt Mutation</i>"]
+    C --> E["🤖 Local LLM<br/><i>Qwen2.5-0.5B · CPU</i>"]
+    D --> E
+    E --> F["💬 Grounded Response"]
+    B --> G["📈 Telemetry Dashboard<br/><i>Live EWMA Charts</i>"]
+```
+### Module Breakdown
+```
+RetailMind/
+├── app.py                    # Gradio UI — 3-panel dashboard
+├── modules/
+│   ├── data_simulation.py    # 200 curated products with rich metadata
+│   ├── retrieval.py          # Hybrid retriever (price-filter → semantic re-rank)
+│   ├── drift.py              # EWMA-based semantic drift detector
+│   ├── adaptation.py         # Self-healing prompt adapter
+│   └── llm.py                # Local Qwen2.5-0.5B inference engine
+├── tests/                    # pytest suite (catalog, retrieval, drift, adaptation)
+├── .github/workflows/ci.yml  # CI pipeline (lint + test on Python 3.10–3.12)
+└── requirements.txt
+```
+---
+## 🔄 How the Self-Healing Loop Works
+The system continuously monitors the **semantic similarity** between incoming queries and predefined concept anchors using an **Exponentially Weighted Moving Average (EWMA)**.
+```
+                   Normal Mode                          Drift Detected!
+                  ┌──────────┐                         ┌──────────────┐
+User asks about   │ Balanced │  EWMA crosses 0.38 →    │ Auto-Inject  │
+random products → │ Prompt   │ ──────────────────────── │ New Rules    │
+                  └──────────┘                         └──────────────┘
+                                                              │
+                  ┌──────────┐                                ▼
+                  │ LLM now  │ ◄─── Prompt mutated to prioritize
+                  │ focuses  │      price / season / sustainability
+                  │ on drift │      based on detected pattern
+                  └──────────┘
+```
+### Concept Anchors
+| Concept | Trigger Keywords | Adaptation |
+|---------|-----------------|------------|
+| 💰 **Price Sensitive** | cheap, budget, under $X, deal | Prioritize lowest-price items, highlight savings |
+| ☀️ **Summer Shift** | beach, lightweight, UV, hot weather | Surface breathable/outdoor products, suppress winter |
+| 🌿 **Eco Trend** | sustainable, recycled, organic, plant-based | Lead with eco-credentials, cite certifications |
+**Key insight:** The system doesn't just match keywords — it uses **semantic similarity** via sentence embeddings. So even a query like *"I care about the planet"* (no eco keywords) will still trigger the eco adaptation because it's semantically close to the concept anchor.
+---
+## 🔍 Hybrid Retrieval Deep Dive
+Traditional RAG uses pure semantic similarity, which fails on structured queries like *"bags under $25"*. RetailMind combines:
+1. **Price Extraction** — Regex-based NLU parses price ceilings from natural language (`"under $50"`, `"budget of $30"`, `"cheapest"`)
+2. **Category Detection** — Maps query terms to catalog categories (`"eco-friendly"` → eco, `"gym"` → sports)
+3. **Pre-Filtering** — Removes products that violate hard constraints *before* embedding search
+4. **Semantic Re-Ranking** — Cosine similarity on SentenceTransformer embeddings ranks survivors
+```python
+# Example: "eco-friendly bag under $30"
+# Step 1: price_cap = 30.0
+# Step 2: category = "eco-friendly"
+# Step 3: 200 products → ~8 candidates (eco + under $30)
+# Step 4: Rank 8 candidates by semantic similarity → top 4
+```
+---
+## 🚀 Quick Start
+### Prerequisites
+- Python 3.10+
+- ~2 GB disk space (for model weights on first run)
+### Installation
+```bash
+git clone https://github.com/hodfa840/-RetailMind-Self-Healing-LLM-for-Store-Intelligence.git
+cd -RetailMind-Self-Healing-LLM-for-Store-Intelligence
+pip install -r requirements.txt
+```
+### Run
+```bash
+python app.py
+```
+The app launches at `http://localhost:7860` with a public share link.
+### Run Tests
+```bash
+pip install pytest
+pytest tests/ -v
+```
+---
+## 🧪 Demo Walkthrough
+To see the self-healing system in action:
+1. **Phase 1 (Normal)** — Ask general product questions. The system responds in balanced mode.
+2. **Phase 2 (Black Friday)** — Click budget-oriented queries. Watch the drift chart's gold line spike above the threshold. The system auto-injects price-prioritization rules.
+3. **Phase 3 (Summer)** — Switch to summer queries. The cyan line rises, and the system pivots to warm-weather products — *without being told to*.
+4. **Phase 4 (Eco)** — Ask about sustainability. The green line triggers, and the system starts citing certifications and materials.
+> The telemetry panel on the right shows exactly what's happening under the hood — which drift was detected, what prompt rules were injected, and why.
+---
+## 🧭 Technical Decisions
+| Decision | Rationale |
+|----------|-----------|
+| **Qwen2.5-0.5B on CPU** | Eliminates API dependency, runs on any machine, no token needed. Trades quality for reliability — acceptable since grounding handles accuracy. |
+| **EWMA over raw scores** | Single-query similarity is noisy. EWMA smooths the signal so the system doesn't flip between modes on every query. α=0.35 balances reactivity with stability. |
+| **Hybrid retrieval over pure semantic** | Semantic search alone can't handle price constraints. A $200 jacket and a $20 hat may both be semantically relevant to "winter gear under $25" — only the pre-filter catches this. |
+| **SentenceTransformers (all-MiniLM-L6-v2)** | 80MB model, runs on CPU in <50ms per query. Good enough for 200-product catalog. Would swap to a larger model for production scale. |
+| **200 curated products over 1,500 generated** | Quality embeddings require quality descriptions. 200 hand-authored products with unique specs outperform 1,500 template-generated items where retrieval can't distinguish between them. |
+| **Prompt injection over fine-tuning** | Fine-tuning a 0.5B model per drift state is impractical. Dynamic prompt injection achieves the same behavioral shift with zero training cost and instant reversibility. |
+---
+## 🔮 Future Roadmap
+- [ ] **Multi-turn memory** — Track user preferences across conversation turns
+- [ ] **A/B testing framework** — Compare adapted vs. baseline responses
+- [ ] **Drift alerting** — Webhook notifications when drift exceeds critical thresholds
+- [ ] **Vector database** — Migrate from in-memory NumPy to FAISS/Qdrant for scale
+- [ ] **User feedback loop** — Incorporate thumbs-up/down into drift calibration
+---
+## 🛠️ Tech Stack
+| Component | Technology |
+|-----------|-----------|
+| UI Framework | Gradio 4.x |
+| LLM | Qwen/Qwen2.5-0.5B-Instruct (local, CPU) |
+| Embeddings | SentenceTransformers (all-MiniLM-L6-v2) |
+| Retrieval | Hybrid (NumPy cosine + metadata pre-filter) |
+| Charting | Plotly |
+| Testing | pytest |
+| CI/CD | GitHub Actions |
+| Language | Python 3.10+ with type hints |
+---
+<div align="center">
+<sub>Built by <a href="https://github.com/hodfa840">hodfa840</a> · Linköping University</sub>
+</div>

app.py ADDED Viewed

	@@ -0,0 +1,468 @@

+"""
+RetailMind — Self-Healing LLM for Store Intelligence
+Gradio application showcasing real-time semantic drift detection,
+autonomous prompt adaptation, and hybrid RAG retrieval.
+"""
+import logging
+import gradio as gr
+import plotly.graph_objects as go
+from modules.data_simulation import generate_catalog, get_scenarios
+from modules.retrieval import HybridRetriever
+from modules.drift import DriftDetector
+from modules.adaptation import Adapter
+from modules.llm import generate_response
+# ── Logging ────────────────────────────────────────────────────────────────
+logging.basicConfig(
+    level=logging.INFO,
+    format="%(asctime)s │ %(name)-24s │ %(levelname)-5s │ %(message)s",
+    datefmt="%H:%M:%S",
+)
+logger = logging.getLogger("retailmind")
+# ── Initialize components ─────────────────────────────────────────────────
+logger.info("Bootstrapping RetailMind…")
+catalog = generate_catalog()
+retriever = HybridRetriever(catalog)
+detector = DriftDetector()
+adapter = Adapter()
+scenarios = get_scenarios()
+logger.info("Ready — %d products indexed.", len(catalog))
+# ── Helper: Image mapping ─────────────────────────────────────────────────
+IMAGE_MAP = {
+    "Parka": "https://images.unsplash.com/photo-1544923246-77307dd270b5?w=400&h=300&fit=crop",
+    "Sweater": "https://images.unsplash.com/photo-1610652492500-dea0624af6ee?w=400&h=300&fit=crop",
+    "Gloves": "https://images.unsplash.com/photo-1551538827-9c037cb4f32a?w=400&h=300&fit=crop",
+    "Boots": "https://images.unsplash.com/photo-1608256246200-53e635b5b65f?w=400&h=300&fit=crop",
+    "Beanie": "https://images.unsplash.com/photo-1576871337622-98d48d1cf531?w=400&h=300&fit=crop",
+    "Fleece": "https://images.unsplash.com/photo-1591047139829-d91aecb6caea?w=400&h=300&fit=crop",
+    "Base Layer": "https://images.unsplash.com/photo-1489987707025-afc232f7ea0f?w=400&h=300&fit=crop",
+    "Vest": "https://images.unsplash.com/photo-1591047139829-d91aecb6caea?w=400&h=300&fit=crop",
+    "Sneakers": "https://images.unsplash.com/photo-1542291026-7eec264c27ff?w=400&h=300&fit=crop",
+    "Shorts": "https://images.unsplash.com/photo-1591195853828-11db59a44f6b?w=400&h=300&fit=crop",
+    "Sunglasses": "https://images.unsplash.com/photo-1511499767150-a48a237f0083?w=400&h=300&fit=crop",
+    "Linen": "https://images.unsplash.com/photo-1596755094514-f87e34085b2c?w=400&h=300&fit=crop",
+    "Sandals": "https://images.unsplash.com/photo-1603487742131-4160ec999306?w=400&h=300&fit=crop",
+    "Tank": "https://images.unsplash.com/photo-1521572163474-6864f9cf17ab?w=400&h=300&fit=crop",
+    "Hat": "https://images.unsplash.com/photo-1521369909029-2afed882baee?w=400&h=300&fit=crop",
+    "Water Shoes": "https://images.unsplash.com/photo-1542291026-7eec264c27ff?w=400&h=300&fit=crop",
+    "Backpack": "https://images.unsplash.com/photo-1553062407-98eeb64c6a62?w=400&h=300&fit=crop",
+    "Bottle": "https://images.unsplash.com/photo-1602143407151-7111542de6e8?w=400&h=300&fit=crop",
+    "Tee": "https://images.unsplash.com/photo-1521572163474-6864f9cf17ab?w=400&h=300&fit=crop",
+    "Tote": "https://images.unsplash.com/photo-1622560480605-d83c853bc5c3?w=400&h=300&fit=crop",
+    "Shoes": "https://images.unsplash.com/photo-1542291026-7eec264c27ff?w=400&h=300&fit=crop",
+    "Jacket": "https://images.unsplash.com/photo-1551028719-00167b16eac5?w=400&h=300&fit=crop",
+    "Watch": "https://images.unsplash.com/photo-1523275335684-37898b6baf30?w=400&h=300&fit=crop",
+    "Mat": "https://images.unsplash.com/photo-1544367567-0f2fcb009e0b?w=400&h=300&fit=crop",
+    "Headphones": "https://images.unsplash.com/photo-1505740420928-5e560c06d30e?w=400&h=300&fit=crop",
+    "Tracker": "https://images.unsplash.com/photo-1557438159-51eec7a6c9e8?w=400&h=300&fit=crop",
+    "Earbuds": "https://images.unsplash.com/photo-1590658268037-6bf12f032f55?w=400&h=300&fit=crop",
+    "Charger": "https://images.unsplash.com/photo-1609091839311-d5365f9ff1c5?w=400&h=300&fit=crop",
+    "Speaker": "https://images.unsplash.com/photo-1608043152269-423dbba4e7e1?w=400&h=300&fit=crop",
+    "Lamp": "https://images.unsplash.com/photo-1507473885765-e6ed057ab6fe?w=400&h=300&fit=crop",
+    "Power Bank": "https://images.unsplash.com/photo-1609091839311-d5365f9ff1c5?w=400&h=300&fit=crop",
+    "Mug": "https://images.unsplash.com/photo-1514228742587-6b1558fcca3d?w=400&h=300&fit=crop",
+    "Weekender": "https://images.unsplash.com/photo-1590874103328-eac38a683ce7?w=400&h=300&fit=crop",
+    "Overcoat": "https://images.unsplash.com/photo-1544923246-77307dd270b5?w=400&h=300&fit=crop",
+    "Wallet": "https://images.unsplash.com/photo-1627123424574-724758594e93?w=400&h=300&fit=crop",
+    "Belt": "https://images.unsplash.com/photo-1553062407-98eeb64c6a62?w=400&h=300&fit=crop",
+    "Candle": "https://images.unsplash.com/photo-1602607616777-b8fb tried?w=400&h=300&fit=crop",
+    "Blanket": "https://images.unsplash.com/photo-1555041469-a586c61ea9bc?w=400&h=300&fit=crop",
+    "Clock": "https://images.unsplash.com/photo-1563861826100-9cb868fdbe1c?w=400&h=300&fit=crop",
+    "Towel": "https://images.unsplash.com/photo-1583845112203-29329902332e?w=400&h=300&fit=crop",
+    "Hoodie": "https://images.unsplash.com/photo-1556821840-3a63f95609a7?w=400&h=300&fit=crop",
+    "Chino": "https://images.unsplash.com/photo-1473966968600-fa801b869a1a?w=400&h=300&fit=crop",
+    "Crossbody": "https://images.unsplash.com/photo-1590874103328-eac38a683ce7?w=400&h=300&fit=crop",
+    "Socks": "https://images.unsplash.com/photo-1586350977771-b3b0abd50c82?w=400&h=300&fit=crop",
+    "Basketball": "https://images.unsplash.com/photo-1546519638-68e109498ffc?w=400&h=300&fit=crop",
+    "Jersey": "https://images.unsplash.com/photo-1565299624946-b28f40a0ae38?w=400&h=300&fit=crop",
+    "Cushion": "https://images.unsplash.com/photo-1555041469-a586c61ea9bc?w=400&h=300&fit=crop",
+    "Planter": "https://images.unsplash.com/photo-1459411552884-841db9b3cc2a?w=400&h=300&fit=crop",
+    "Organizer": "https://images.unsplash.com/photo-1507473885765-e6ed057ab6fe?w=400&h=300&fit=crop",
+    "Pour-Over": "https://images.unsplash.com/photo-1495474472287-4d71bcdd2085?w=400&h=300&fit=crop",
+}
+DEFAULT_IMG = "https://images.unsplash.com/photo-1472851294608-062f124dcb02?w=400&h=300&fit=crop"
+def _get_product_image(title: str) -> str:
+    """Map product title → curated Unsplash photo."""
+    for key, url in IMAGE_MAP.items():
+        if key.lower() in title.lower():
+            return url
+    return DEFAULT_IMG
+# ── Plotly drift chart ────────────────────────────────────────────────────
+def _plot_drift() -> go.Figure:
+    series = detector.get_history_series()
+    ewma = detector.get_ewma_scores()
+    fig = go.Figure()
+    colors = {"price_sensitive": "#f59e0b", "summer_shift": "#06b6d4", "eco_trend": "#10b981"}
+    labels = {"price_sensitive": "Price Sensitivity", "summer_shift": "Summer Shift", "eco_trend": "Eco Trend"}
+    for concept in series:
+        data = series[concept][-30:]  # last 30 data points
+        fig.add_trace(go.Scatter(
+            y=data,
+            mode="lines",
+            name=labels.get(concept, concept),
+            line=dict(color=colors.get(concept, "#fff"), width=2.5, shape="spline"),
+            fill="tozeroy",
+            fillcolor=colors.get(concept, "#fff").replace(")", ", 0.08)").replace("rgb", "rgba") if "rgb" in colors.get(concept, "") else f"rgba(255,255,255,0.05)",
+        ))
+    # Threshold line
+    fig.add_hline(y=0.38, line_dash="dot", line_color="rgba(255,255,255,0.3)",
+                  annotation_text="Threshold", annotation_font_color="rgba(255,255,255,0.4)")
+    fig.update_layout(
+        height=240,
+        margin=dict(l=0, r=0, t=10, b=0),
+        plot_bgcolor="rgba(0,0,0,0)",
+        paper_bgcolor="rgba(0,0,0,0)",
+        font=dict(color="#94a3b8", size=11),
+        legend=dict(orientation="h", yanchor="bottom", y=1.02, xanchor="center", x=0.5,
+                    font=dict(size=10)),
+        xaxis=dict(showgrid=False, showticklabels=False),
+        yaxis=dict(showgrid=True, gridwidth=1, gridcolor="rgba(255,255,255,0.06)",
+                   range=[0, 0.8]),
+    )
+    return fig
+# ── Product cards HTML ────────────────────────────────────────────────────
+def _build_product_html(retrieved: list[dict]) -> str:
+    if not retrieved:
+        return _empty_catalog_html()
+    cards = []
+    for r in retrieved:
+        p = r["product"]
+        score = r["score"]
+        img = _get_product_image(p["title"])
+        stars_full = int(p.get("rating", 4))
+        stars_html = "★" * stars_full + "☆" * (5 - stars_full)
+        reviews = p.get("reviews", 0)
+        score_pct = int(score * 100)
+        tags_html = "".join(
+            f"<span style='background:rgba(99,102,241,0.15); color:#818cf8; padding:2px 8px; "
+            f"border-radius:20px; font-size:10px; margin-right:4px;'>{t}</span>"
+            for t in p.get("tags", [])[:3]
+        )
+        cards.append(f"""
+        <div style='background:rgba(255,255,255,0.03); border:1px solid rgba(255,255,255,0.08);
+                     border-radius:16px; overflow:hidden; transition:all 0.3s ease;
+                     box-shadow:0 4px 20px rgba(0,0,0,0.3);'>
+            <div style='position:relative;'>
+                <img src='{img}' style='width:100%; height:150px; object-fit:cover;
+                     border-bottom:1px solid rgba(255,255,255,0.06);'
+                     onerror="this.src='{DEFAULT_IMG}'" />
+                <div style='position:absolute; top:8px; right:8px; background:rgba(0,0,0,0.75);
+                     color:#f8fafc; padding:3px 10px; border-radius:20px; font-size:13px;
+                     font-weight:700; backdrop-filter:blur(8px);
+                     border:1px solid rgba(255,255,255,0.15);'>
+                    ${p['price']:.2f}
+                </div>
+                <div style='position:absolute; top:8px; left:8px; background:rgba(99,102,241,0.85);
+                     color:white; padding:2px 8px; border-radius:20px; font-size:10px;
+                     font-weight:600; letter-spacing:0.5px;'>
+                    {score_pct}% match
+                </div>
+            </div>
+            <div style='padding:14px;'>
+                <div style='color:#f1f5f9; font-size:14px; font-weight:600;
+                     margin-bottom:4px; line-height:1.3;'>{p['title']}</div>
+                <div style='display:flex; align-items:center; gap:6px; margin-bottom:6px;'>
+                    <span style='color:#fbbf24; font-size:12px; letter-spacing:1px;'>{stars_html}</span>
+                    <span style='color:#64748b; font-size:11px;'>({reviews:,})</span>
+                </div>
+                <div style='margin-bottom:8px;'>{tags_html}</div>
+                <p style='color:#94a3b8; font-size:12px; line-height:1.4; margin:0;'>
+                    {p['desc'][:100]}…
+                </p>
+            </div>
+        </div>
+        """)
+    return f"""
+    <div style='display:grid; grid-template-columns:1fr 1fr; gap:16px; padding:8px;'>
+        {''.join(cards)}
+    </div>
+    """
+def _empty_catalog_html() -> str:
+    return """
+    <div style='padding:60px 30px; text-align:center; color:#475569;
+                border:2px dashed rgba(255,255,255,0.08); border-radius:20px; margin:16px;'>
+        <div style='font-size:2.5rem; margin-bottom:12px;'>🛍️</div>
+        <div style='font-size:1.1rem; font-weight:500; color:#64748b;'>Awaiting your query…</div>
+        <div style='font-size:0.85rem; color:#475569; margin-top:6px;'>
+            Try a scenario below or type your own question
+        </div>
+    </div>
+    """
+# ── Main query handler ────────────────────────────────────────────────────
+def process_query(query: str, history: list):
+    if not query or not query.strip():
+        return "", history, _plot_drift(), "", "—", _empty_catalog_html()
+    logger.info("Processing query: %r", query)
+    # 1. Measure drift
+    drift_state, scores = detector.analyze_drift(query)
+    # 2. Retrieve products (hybrid: price-filter + semantic)
+    retrieved = retriever.search(query, top_k=4)
+    # 3. Adapt system prompt
+    system_prompt = adapter.adapt_prompt(drift_state)
+    explanation = adapter.get_explanation(drift_state)
+    label = adapter.get_label(drift_state)
+    # 4. Generate LLM response
+    response = generate_response(system_prompt, query, retrieved)
+    history = history or []
+    history.append({"role": "user", "content": query})
+    history.append({"role": "assistant", "content": response})
+    return "", history, _plot_drift(), explanation, label, _build_product_html(retrieved)
+def load_example(example_text: str) -> str:
+    return example_text
+# ══════════════════════════════════════════════════════════════════════════
+# UI Definition
+# ══════════════════════════════════════════════════════════════════════════
+css = """
+@import url('https://fonts.googleapis.com/css2?family=Inter:wght@300;400;500;600;700;800&display=swap');
+body, .gradio-container {
+    font-family: 'Inter', system-ui, -apple-system, sans-serif !important;
+    background: #0a0f1a !important;
+}
+/* Header */
+.hero-header {
+    text-align: center;
+    padding: 2.5rem 2rem 1.5rem;
+    background: linear-gradient(135deg, rgba(15,23,42,0.95) 0%, rgba(30,41,59,0.6) 50%, rgba(15,23,42,0.95) 100%);
+    border-radius: 24px;
+    border: 1px solid rgba(255,255,255,0.06);
+    box-shadow: 0 25px 60px rgba(0,0,0,0.5);
+    position: relative;
+    overflow: hidden;
+    margin-bottom: 1.5rem;
+}
+.hero-header::before {
+    content: '';
+    position: absolute;
+    top: -50%;
+    left: -50%;
+    width: 200%;
+    height: 200%;
+    background: radial-gradient(circle at 30% 50%, rgba(99,102,241,0.08) 0%, transparent 50%),
+                radial-gradient(circle at 70% 50%, rgba(6,182,212,0.06) 0%, transparent 50%);
+    animation: aurora 8s ease-in-out infinite alternate;
+}
+@keyframes aurora {
+    0% { transform: translate(0, 0) rotate(0deg); }
+    100% { transform: translate(-5%, 5%) rotate(3deg); }
+}
+.hero-title {
+    font-size: 2.8rem;
+    font-weight: 800;
+    background: linear-gradient(135deg, #818cf8 0%, #06b6d4 50%, #10b981 100%);
+    -webkit-background-clip: text;
+    -webkit-text-fill-color: transparent;
+    margin: 0;
+    position: relative;
+    letter-spacing: -0.5px;
+}
+.hero-sub {
+    color: #64748b;
+    font-size: 0.95rem;
+    letter-spacing: 3px;
+    text-transform: uppercase;
+    font-weight: 500;
+    margin-top: 0.5rem;
+    position: relative;
+}
+.hero-badges {
+    display: flex;
+    justify-content: center;
+    gap: 12px;
+    margin-top: 1rem;
+    position: relative;
+    flex-wrap: wrap;
+}
+.hero-badge {
+    background: rgba(255,255,255,0.04);
+    border: 1px solid rgba(255,255,255,0.08);
+    color: #94a3b8;
+    padding: 4px 14px;
+    border-radius: 20px;
+    font-size: 0.75rem;
+    font-weight: 500;
+    letter-spacing: 0.5px;
+}
+/* Panels */
+.glass-panel {
+    background: rgba(15, 23, 42, 0.6) !important;
+    border: 1px solid rgba(255,255,255,0.06) !important;
+    border-radius: 20px !important;
+    backdrop-filter: blur(12px) !important;
+}
+/* Scenario pills */
+.scenario-row { display: flex; gap: 8px; flex-wrap: wrap; margin-top: 8px; }
+/* Section headers */
+.panel-header {
+    color: #e2e8f0;
+    font-size: 1rem;
+    font-weight: 600;
+    padding: 14px 16px 8px;
+    display: flex;
+    align-items: center;
+    gap: 8px;
+}
+/* Info box */
+.info-callout {
+    background: rgba(99,102,241,0.08);
+    border: 1px solid rgba(99,102,241,0.2);
+    border-radius: 12px;
+    padding: 12px 16px;
+    color: #a5b4fc;
+    font-size: 0.8rem;
+    line-height: 1.5;
+    margin: 8px 12px;
+}
+/* Hide Gradio footer */
+footer { display: none !important; }
+"""
+with gr.Blocks(css=css, theme=gr.themes.Base(), title="RetailMind — Self-Healing AI") as app:
+    # ── Header ────────────────────────────────────────────────────
+    gr.HTML("""
+    <div class="hero-header">
+        <h1 class="hero-title">RetailMind</h1>
+        <p class="hero-sub">Self-Healing LLM · Store Intelligence</p>
+        <div class="hero-badges">
+            <span class="hero-badge">🧠 Semantic Drift Detection</span>
+            <span class="hero-badge">🔄 Autonomous Prompt Healing</span>
+            <span class="hero-badge">🔍 Hybrid RAG Retrieval</span>
+            <span class="hero-badge">📊 Real-Time Telemetry</span>
+        </div>
+    </div>
+    """)
+    with gr.Row():
+        # ── LEFT: Chat Panel ─────────────────────────────────────
+        with gr.Column(scale=4, elem_classes=["glass-panel"]):
+            gr.HTML("<div class='panel-header'>💬 AI Shopping Assistant</div>")
+            chatbot = gr.Chatbot(
+                height=420,
+                container=False,
+                show_copy_button=True,
+                placeholder="Ask me about products, deals, or seasonal picks…",
+            )
+            with gr.Row():
+                msg = gr.Textbox(
+                    placeholder="e.g. Find me eco-friendly running shoes under $120…",
+                    show_label=False,
+                    container=False,
+                    scale=8,
+                )
+                submit = gr.Button("Search", variant="primary", scale=2)
+            gr.HTML("""
+            <div class='info-callout'>
+                💡 <b>Demo tip:</b> Click the scenario buttons below in order
+                (Phase 1 → 4) to watch the system detect intent drift and
+                autonomously heal its behavior in real time.
+            </div>
+            """)
+            for scenario_name, queries in scenarios.items():
+                with gr.Accordion(scenario_name, open=False):
+                    for q in queries:
+                        btn = gr.Button(q, size="sm", variant="secondary")
+                        btn.click(fn=load_example, inputs=btn, outputs=msg)
+        # ── MIDDLE: Product Feed ─────────────────────────────────
+        with gr.Column(scale=4, elem_classes=["glass-panel"]):
+            gr.HTML("<div class='panel-header'>🛍️ Retrieved Products</div>")
+            retrieved_box = gr.HTML(value=_empty_catalog_html())
+        # ── RIGHT: MLOps Telemetry ───────────────────────────────
+        with gr.Column(scale=3, elem_classes=["glass-panel"]):
+            gr.HTML("<div class='panel-header'>⚡ MLOps Telemetry</div>")
+            current_phase = gr.Textbox(
+                label="Active Semantic State",
+                value="⚖️ Balanced Mode",
+                interactive=False,
+            )
+            drift_plot = gr.Plot(value=_plot_drift())
+            gr.HTML("""
+            <div class='info-callout'>
+                📈 The chart above tracks <b>EWMA-smoothed</b> semantic
+                similarity between user queries and concept anchors
+                (price, season, eco). When a line crosses the dotted
+                threshold, the system <b>autonomously rewrites</b> its
+                own instructions.
+            </div>
+            """)
+            gr.HTML("<div class='panel-header'>🧠 Self-Healing Log</div>")
+            explanation_box = gr.Textbox(
+                label="Adaptation Status",
+                interactive=False,
+                lines=6,
+                value=(
+                    "📊 System Status: Normal\n"
+                    "━━━━━━━━━━━━━━━━━━━━━━━━━━\n"
+                    "No significant drift detected.\n"
+                    "System prompt: Default balanced mode.\n"
+                    "All EWMA concept scores below threshold (0.38)."
+                ),
+            )
+    # ── Event wiring ──────────────────────────────────────────────
+    submit.click(
+        process_query,
+        inputs=[msg, chatbot],
+        outputs=[msg, chatbot, drift_plot, explanation_box, current_phase, retrieved_box],
+    )
+    msg.submit(
+        process_query,
+        inputs=[msg, chatbot],
+        outputs=[msg, chatbot, drift_plot, explanation_box, current_phase, retrieved_box],
+    )
+if __name__ == "__main__":
+    app.launch(server_name="0.0.0.0", share=True)

modules/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ """RetailMind modules."""

modules/adaptation.py ADDED Viewed

	@@ -0,0 +1,141 @@

+"""
+Self-healing prompt adapter for RetailMind.
+Dynamically rewrites the LLM system prompt based on detected semantic drift.
+This is the "self-healing" core — the system adapts its behavior in real time
+without human intervention when it detects shifting user intent patterns.
+"""
+from __future__ import annotations
+import logging
+from dataclasses import dataclass
+logger = logging.getLogger(__name__)
+_BASE_PROMPT = (
+    "You are RetailMind, a knowledgeable and friendly AI shopping assistant for "
+    "an online retail store. You help customers find the perfect products from "
+    "our catalog.\n\n"
+    "RULES:\n"
+    "1. ONLY recommend products that appear in the 'Available Inventory' below.\n"
+    "2. Always mention the exact product name and price.\n"
+    "3. Keep responses concise (3–5 sentences) but helpful.\n"
+    "4. If a product matches the customer's needs, explain WHY it's a good fit.\n"
+    "5. Never invent products that aren't in the inventory list."
+)
+@dataclass
+class AdaptationRule:
+    """A single self-healing rule triggered by a drift concept."""
+    concept: str
+    label: str
+    prompt_injection: str
+    explanation: str
+# Pre-defined adaptation rules — each maps a drift signal to a prompt mutation
+_RULES: dict[str, AdaptationRule] = {
+    "price_sensitive": AdaptationRule(
+        concept="price_sensitive",
+        label="💰 Price-Sensitive Mode",
+        prompt_injection=(
+            "\n\n⚠️ ACTIVE ADAPTATION — PRICE SENSITIVITY DETECTED:\n"
+            "Customer intent analysis shows strong budget-consciousness. "
+            "You MUST:\n"
+            "• Lead with the cheapest matching products first.\n"
+            "• Explicitly state the price and any savings.\n"
+            "• Compare price-to-value across options.\n"
+            "• Mention if an item is the lowest-priced in its category."
+        ),
+        explanation=(
+            "🔧 Self-Healing Activated\n"
+            "━━━━━━━━━━━━━━━━━━━━━━━━━━\n"
+            "Signal: Price-sensitive keyword drift detected (budget, cheap, under $X)\n"
+            "Action: Injected price-prioritization directives into system prompt\n"
+            "Effect: LLM now ranks by price-to-value instead of general relevance\n"
+            "Trigger: EWMA score exceeded threshold (0.38)"
+        ),
+    ),
+    "summer_shift": AdaptationRule(
+        concept="summer_shift",
+        label="☀️ Summer Season Mode",
+        prompt_injection=(
+            "\n\n⚠️ ACTIVE ADAPTATION — SEASONAL SHIFT DETECTED:\n"
+            "Query patterns indicate a seasonal shift toward summer. "
+            "You MUST:\n"
+            "• Prioritize lightweight, breathable, and warm-weather products.\n"
+            "• Highlight UV protection and heat-management features.\n"
+            "• De-prioritize winter and cold-weather items.\n"
+            "• Mention materials suited for hot climates (linen, mesh, moisture-wicking)."
+        ),
+        explanation=(
+            "🔧 Self-Healing Activated\n"
+            "━━━━━━━━━━━━━━━━━━━━━━━━━━\n"
+            "Signal: Seasonal semantic shift detected (summer, beach, UV, lightweight)\n"
+            "Action: Injected warm-weather prioritization into system prompt\n"
+            "Effect: LLM now filters for breathable materials and summer categories\n"
+            "Trigger: EWMA score exceeded threshold (0.38)"
+        ),
+    ),
+    "eco_trend": AdaptationRule(
+        concept="eco_trend",
+        label="🌿 Eco-Conscious Mode",
+        prompt_injection=(
+            "\n\n⚠️ ACTIVE ADAPTATION — SUSTAINABILITY TREND DETECTED:\n"
+            "User intent strongly favors eco-friendly products. "
+            "You MUST:\n"
+            "• Lead with recycled, organic, and plant-based items.\n"
+            "• Highlight environmental certifications (GOTS, OEKO-TEX).\n"
+            "• Explain the sustainability story behind each recommendation.\n"
+            "• Mention materials: recycled ocean plastic, organic cotton, bamboo, cork."
+        ),
+        explanation=(
+            "🔧 Self-Healing Activated\n"
+            "━━━━━━━━━━━━━━━━━━━━━━━━━━\n"
+            "Signal: Eco-conscious trend detected (sustainable, recycled, organic)\n"
+            "Action: Injected sustainability-first directives into system prompt\n"
+            "Effect: LLM now leads with eco-credentials and material sourcing\n"
+            "Trigger: EWMA score exceeded threshold (0.38)"
+        ),
+    ),
+}
+_NORMAL_EXPLANATION = (
+    "📊 System Status: Normal\n"
+    "━━━━━━━━━━━━━━━━━━━━━━━━━━\n"
+    "No significant drift detected in user intent patterns.\n"
+    "System prompt: Default balanced recommendation mode.\n"
+    "All EWMA concept scores below threshold (0.38)."
+)
+class Adapter:
+    """Stateless prompt adapter — maps drift signals to prompt mutations."""
+    def __init__(self) -> None:
+        self.base_prompt: str = _BASE_PROMPT
+        self._active_rule: AdaptationRule | None = None
+    def adapt_prompt(self, drift_state: str) -> str:
+        """Return the adapted system prompt for the current drift state."""
+        rule = _RULES.get(drift_state)
+        self._active_rule = rule
+        if rule:
+            logger.info("Adaptation triggered: %s", rule.label)
+            return self.base_prompt + rule.prompt_injection
+        return self.base_prompt + "\n\nProvide balanced recommendations covering a mix of features, prices, and styles."
+    def get_explanation(self, drift_state: str) -> str:
+        """Human-readable explanation of what the adapter did and why."""
+        rule = _RULES.get(drift_state)
+        return rule.explanation if rule else _NORMAL_EXPLANATION
+    def get_label(self, drift_state: str) -> str:
+        """Short UI label for the active state."""
+        rule = _RULES.get(drift_state)
+        return rule.label if rule else "⚖️ Balanced Mode"

modules/data_simulation.py ADDED Viewed

	@@ -0,0 +1,318 @@

+"""
+Synthetic product catalog generator for RetailMind.
+Generates a curated catalog of ~200 realistic e-commerce products with rich
+descriptions, material specs, star ratings, and semantic tags — designed to
+produce high-quality embeddings for dense retrieval.
+"""
+import random
+from typing import TypedDict
+random.seed(42)  # Reproducible catalog across sessions
+class Product(TypedDict):
+    id: int
+    title: str
+    category: str
+    price: float
+    desc: str
+    tags: list[str]
+    rating: float
+    reviews: int
+    materials: str
+# ---------------------------------------------------------------------------
+# Hand-authored product templates — each with unique, embedding-rich content
+# ---------------------------------------------------------------------------
+_TEMPLATES: list[dict] = [
+    # ── Winter ──────────────────────────────────────────────────────────────
+    {"title": "Alpine Pro Insulated Parka", "category": "winter", "price": 189.99,
+     "desc": "Engineered for sub-zero temperatures with 700-fill goose down insulation and a waterproof shell. Features an adjustable storm hood, internal media pocket, and reflective accents for low-light visibility. Wind-rated to -30°F.",
+     "tags": ["waterproof", "insulated", "cold-weather", "outdoor"], "materials": "Nylon ripstop shell, goose down fill"},
+    {"title": "Fireside Merino Wool Sweater", "category": "winter", "price": 79.99,
+     "desc": "A classic crewneck knit from ultra-soft 100% merino wool. Breathable yet warm, perfect for layering or wearing solo by the fire. Naturally odor-resistant and temperature-regulating.",
+     "tags": ["wool", "layering", "classic", "cozy"], "materials": "100% Merino wool"},
+    {"title": "Glacier Grip Thermal Gloves", "category": "winter", "price": 34.99,
+     "desc": "Touchscreen-compatible thermal gloves with silicone grip palms. Fleece-lined interior keeps hands warm while conductive fingertips let you use your phone without exposing skin to the cold.",
+     "tags": ["touchscreen", "thermal", "cold-weather", "tech-friendly"], "materials": "Polyester fleece, silicone grip, conductive thread"},
+    {"title": "Blizzard Shield Snow Boots", "category": "winter", "price": 149.99,
+     "desc": "Heavy-duty winter boots with Thinsulate insulation and Vibram Arctic Grip outsoles. Sealed seams and a gusseted tongue keep snow and slush out. Comfort-rated to -40°F.",
+     "tags": ["waterproof", "insulated", "snow", "hiking"], "materials": "Full-grain leather, Thinsulate, Vibram sole"},
+    {"title": "Nordic Knit Beanie", "category": "winter", "price": 24.99,
+     "desc": "Double-layer acrylic knit beanie with a fleece headband liner. Classic Nordic pattern adds style while the snug fit traps heat. One size fits most.",
+     "tags": ["knit", "warm", "casual", "unisex"], "materials": "Acrylic knit, polyester fleece liner"},
+    {"title": "Summit Fleece Pullover", "category": "winter", "price": 64.99,
+     "desc": "Mid-weight microfleece pullover ideal for layering under a shell or wearing on cool autumn mornings. Quarter-zip design, chin guard, and zippered chest pocket.",
+     "tags": ["fleece", "layering", "outdoor", "mid-weight"], "materials": "100% recycled polyester microfleece"},
+    {"title": "Thermal Base Layer Set", "category": "winter", "price": 54.99,
+     "desc": "Moisture-wicking thermal top and leggings designed as a first layer for skiing, snowboarding, or cold commutes. Flatlock seams prevent chafing during all-day wear.",
+     "tags": ["base-layer", "moisture-wicking", "skiing", "thermal"], "materials": "Merino-synthetic blend"},
+    {"title": "Expedition Down Vest", "category": "winter", "price": 109.99,
+     "desc": "Packable 650-fill down vest that compresses into its own pocket. Provides core warmth without restricting arm movement — perfect for active winter pursuits or travel.",
+     "tags": ["packable", "down", "layering", "travel"], "materials": "Water-resistant nylon, 650-fill duck down"},
+    # ── Summer ──────────────────────────────────────────────────────────────
+    {"title": "Breeze Runner Mesh Sneakers", "category": "summer", "price": 89.99,
+     "desc": "Ultra-breathable mesh upper with a responsive foam midsole. Weighs just 7.2 oz per shoe, making them ideal for hot-weather runs, gym sessions, or all-day wear in the heat.",
+     "tags": ["breathable", "lightweight", "running", "mesh"], "materials": "Engineered mesh upper, EVA foam midsole"},
+    {"title": "Pacific Coast Board Shorts", "category": "summer", "price": 39.99,
+     "desc": "Quick-dry board shorts with a 4-way stretch waistband and secure zip pocket. UPF 50+ sun protection fabric keeps you safe from UV rays during long beach days.",
+     "tags": ["quick-dry", "UPF", "beach", "swim"], "materials": "Recycled polyester, elastane blend"},
+    {"title": "Solaris UV Shield Sunglasses", "category": "summer", "price": 59.99,
+     "desc": "Polarized lenses with 100% UV400 protection in a lightweight titanium frame. Anti-glare coating reduces eye strain on bright days. Comes with a hard-shell carrying case.",
+     "tags": ["polarized", "UV-protection", "lightweight", "outdoor"], "materials": "Titanium frame, polarized polycarbonate lenses"},
+    {"title": "Coastal Breeze Linen Shirt", "category": "summer", "price": 49.99,
+     "desc": "Relaxed-fit linen button-down that stays cool in 90°F+ heat. Garment-dyed for a lived-in look. Perfect from boardwalk brunch to sunset cocktails.",
+     "tags": ["linen", "breathable", "casual", "warm-weather"], "materials": "100% French linen"},
+    {"title": "Reef Walker Sandals", "category": "summer", "price": 44.99,
+     "desc": "Contoured footbed sandals with arch support and a rugged outsole. Synthetic nubuck straps adjust for a custom fit. Great for beach walks, pool decks, and casual summer outings.",
+     "tags": ["sandals", "arch-support", "beach", "casual"], "materials": "Synthetic nubuck, molded EVA footbed"},
+    {"title": "Tropic Mesh Tank Top", "category": "summer", "price": 22.99,
+     "desc": "Lightweight mesh-back tank with moisture-wicking fabric that keeps you dry during hot workouts or humid commutes. Flatlock seams and a relaxed hem for all-day comfort.",
+     "tags": ["moisture-wicking", "gym", "breathable", "lightweight"], "materials": "Polyester-spandex blend"},
+    {"title": "Sun Shield Wide Brim Hat", "category": "summer", "price": 34.99,
+     "desc": "UPF 50+ wide-brim sun hat with an adjustable chin cord and mesh ventilation panels. Floats in water and packs flat for travel. Essential protection for hiking, fishing, and gardening.",
+     "tags": ["UPF", "sun-protection", "outdoor", "packable"], "materials": "Nylon with mesh vents"},
+    {"title": "Aqua Sport Water Shoes", "category": "summer", "price": 29.99,
+     "desc": "Drainage-port water shoes with a grippy rubber sole for rocky beaches and river crossings. Neoprene collar prevents sand entry. Dries in under an hour.",
+     "tags": ["water-shoes", "quick-dry", "beach", "outdoor"], "materials": "Mesh, neoprene, rubber outsole"},
+    # ── Eco-Friendly ────────────────────────────────────────────────────────
+    {"title": "EcoLoop Recycled Backpack", "category": "eco-friendly", "price": 74.99,
+     "desc": "Made from 20 recycled ocean-bound plastic bottles. Features a padded laptop sleeve, water-resistant coating, and ergonomic shoulder straps. Every purchase funds 1 lb of ocean cleanup.",
+     "tags": ["recycled", "ocean-plastic", "sustainable", "laptop"], "materials": "Recycled RPET fabric, plant-based waterproof coating"},
+    {"title": "Bamboo Hydration Bottle", "category": "eco-friendly", "price": 28.99,
+     "desc": "Double-wall vacuum insulated bottle with a natural bamboo cap and silicone seal. Keeps drinks cold for 24 hours or hot for 12. BPA-free, plastic-free, and designed to last a lifetime.",
+     "tags": ["bamboo", "BPA-free", "insulated", "reusable"], "materials": "18/8 stainless steel, bamboo lid"},
+    {"title": "Organic Cotton Classic Tee", "category": "eco-friendly", "price": 32.99,
+     "desc": "GOTS-certified organic cotton tee dyed with low-impact, water-saving dyes. Pre-shrunk ring-spun cotton feels buttery soft from the first wear. Fair Trade certified production.",
+     "tags": ["organic", "fair-trade", "GOTS-certified", "cotton"], "materials": "100% GOTS organic cotton"},
+    {"title": "Hemp Canvas Tote Bag", "category": "eco-friendly", "price": 19.99,
+     "desc": "Durable hemp canvas tote that replaces 700 single-use plastic bags in its lifetime. Reinforced seams, interior pocket, and long handles for comfortable shoulder carry.",
+     "tags": ["hemp", "reusable", "sustainable", "zero-waste"], "materials": "Organic hemp canvas"},
+    {"title": "Plant-Based Running Shoes", "category": "eco-friendly", "price": 119.99,
+     "desc": "The upper is woven from eucalyptus fiber, the midsole from sugarcane-based EVA, and the outsole from natural rubber. Carbon-negative manufacturing. Feels like running on clouds.",
+     "tags": ["plant-based", "carbon-negative", "running", "vegan"], "materials": "Eucalyptus fiber, sugarcane EVA, natural rubber"},
+    {"title": "Recycled Denim Jacket", "category": "eco-friendly", "price": 89.99,
+     "desc": "Classic trucker jacket made from 100% post-consumer recycled denim. Each jacket diverts 1.5 lbs of textile waste from landfills. Stone-washed finish with brass buttons.",
+     "tags": ["recycled", "denim", "upcycled", "sustainable"], "materials": "100% recycled post-consumer denim"},
+    {"title": "Solar-Powered Watch", "category": "eco-friendly", "price": 159.99,
+     "desc": "Never needs a battery — charges via any light source. Sapphire crystal face, titanium case, and a strap made from recycled ocean plastic. Water-resistant to 100 meters.",
+     "tags": ["solar", "recycled", "titanium", "water-resistant"], "materials": "Titanium, sapphire crystal, recycled ocean-plastic strap"},
+    {"title": "Cork Yoga Mat", "category": "eco-friendly", "price": 64.99,
+     "desc": "Harvested from sustainable cork oak forests without harming the tree. Non-slip surface improves grip when wet. Antimicrobial naturally. Backed with natural rubber for cushioning.",
+     "tags": ["cork", "sustainable", "yoga", "non-toxic"], "materials": "Natural cork, natural rubber backing"},
+    # ── Sports & Fitness ────────────────────────────────────────────────────
+    {"title": "ProPulse Running Shoes", "category": "sports", "price": 129.99,
+     "desc": "Carbon-plate racing shoes with a nitrogen-infused midsole for maximum energy return. Engineered mesh upper weighs just 6.5 oz. Designed for 5K to marathon distances.",
+     "tags": ["carbon-plate", "racing", "lightweight", "marathon"], "materials": "Engineered mesh, carbon fiber plate, nitrogen foam"},
+    {"title": "FlexCore Training Shorts", "category": "sports", "price": 44.99,
+     "desc": "4-way stretch training shorts with a built-in compression liner and three secure pockets. Sweat-wicking DryFit fabric keeps you cool through HIIT, lifting, and sprints.",
+     "tags": ["training", "compression", "moisture-wicking", "gym"], "materials": "Polyester-elastane with DryFit technology"},
+    {"title": "IronGrip Fitness Watch", "category": "sports", "price": 199.99,
+     "desc": "GPS-enabled multisport watch with heart rate monitoring, VO2 max estimation, and 14-day battery life. Tracks 30+ activities including swimming (waterproof to 50m). Syncs with Strava.",
+     "tags": ["GPS", "heart-rate", "waterproof", "multisport"], "materials": "Fiber-reinforced polymer case, silicone band"},
+    {"title": "Thunder Strike Basketball", "category": "sports", "price": 34.99,
+     "desc": "Official size and weight composite leather basketball with deep channel design for superior grip. Indoor/outdoor rated with a butyl bladder for consistent air retention.",
+     "tags": ["basketball", "indoor-outdoor", "official-size", "grip"], "materials": "Composite leather, butyl rubber bladder"},
+    {"title": "Velocity Compression Tights", "category": "sports", "price": 59.99,
+     "desc": "Graduated compression tights that boost blood circulation and reduce muscle fatigue during long runs. Reflective logos for night visibility. Flatlock seams prevent chafing.",
+     "tags": ["compression", "running", "reflective", "recovery"], "materials": "Nylon-spandex compression fabric"},
+    {"title": "PowerLift Training Gloves", "category": "sports", "price": 27.99,
+     "desc": "Ventilated weightlifting gloves with padded leather palms and adjustable wrist wraps. Reduces calluses while maintaining bar feel. Pull-tab for easy removal between sets.",
+     "tags": ["weightlifting", "gym", "padded", "grip"], "materials": "Genuine leather palm, mesh back, neoprene wrist wrap"},
+    {"title": "AeroFlow Cycling Jersey", "category": "sports", "price": 74.99,
+     "desc": "Full-zip cycling jersey with three rear pockets and a silicone gripper hem. Italian mesh side panels maximize airflow on climbs. Sublimation-printed — colors won't fade or peel.",
+     "tags": ["cycling", "breathable", "lightweight", "performance"], "materials": "Italian polyester mesh blend"},
+    {"title": "Endurance Hydration Pack", "category": "sports", "price": 49.99,
+     "desc": "Lightweight 2L hydration vest designed for trail running. Bite valve with on/off switch, front stash pockets for gels, and a bounce-free fit that adjusts with dual sternum straps.",
+     "tags": ["hydration", "trail-running", "lightweight", "outdoor"], "materials": "Ripstop nylon, BPA-free reservoir"},
+    # ── Electronics & Tech ──────────────────────────────────────────────────
+    {"title": "AuraBeats Studio Headphones", "category": "electronics", "price": 249.99,
+     "desc": "Active noise cancelling over-ear headphones with 40mm custom drivers and 30-hour battery life. Adaptive EQ auto-tunes to your ear shape. Features multipoint Bluetooth for switching between laptop and phone.",
+     "tags": ["ANC", "wireless", "bluetooth", "noise-cancelling"], "materials": "Memory foam cushions, anodized aluminum, protein leather"},
+    {"title": "NovaBand Fitness Tracker", "category": "electronics", "price": 49.99,
+     "desc": "Slim fitness band with AMOLED display, continuous heart rate monitoring, sleep tracking, and SpO2 sensor. 10-day battery life and swim-proof to 50 meters. Weighs just 22 grams.",
+     "tags": ["fitness-tracker", "AMOLED", "heart-rate", "waterproof"], "materials": "Polycarbonate case, silicone band"},
+    {"title": "TrueWireless Pro Earbuds", "category": "electronics", "price": 129.99,
+     "desc": "In-ear ANC earbuds with transparency mode and spatial audio support. 6-hour playtime per charge, 24 hours total with the wireless charging case. IPX5 sweat-resistant for workouts.",
+     "tags": ["ANC", "earbuds", "wireless", "spatial-audio"], "materials": "Medical-grade silicone tips, matte plastic shell"},
+    {"title": "Portable Solar Charger Panel", "category": "electronics", "price": 69.99,
+     "desc": "Foldable 21W solar panel with dual USB-A and USB-C outputs. Charges a phone in ~2.5 hours of direct sunlight. Carabiner attachment for backpack mounting during hikes.",
+     "tags": ["solar", "portable", "USB-C", "outdoor"], "materials": "Monocrystalline silicon, PET laminate, polyester canvas"},
+    {"title": "SmartTherm Travel Mug", "category": "electronics", "price": 39.99,
+     "desc": "App-connected travel mug with an LED temperature display on the lid. Set your preferred drinking temperature and the mug maintains it for up to 3 hours via battery-powered heating element.",
+     "tags": ["smart", "temperature-control", "travel", "app-connected"], "materials": "304 stainless steel, ceramic coating interior"},
+    {"title": "UltraSlim Power Bank 10K", "category": "electronics", "price": 34.99,
+     "desc": "10,000mAh portable charger thinner than most phones. Dual output (USB-C PD + USB-A QC3.0) charges two devices simultaneously. Fully recharges in 2.5 hours.",
+     "tags": ["power-bank", "USB-C", "portable", "fast-charging"], "materials": "Aluminum alloy shell, lithium-polymer cells"},
+    {"title": "Compact Bluetooth Speaker", "category": "electronics", "price": 44.99,
+     "desc": "IP67 waterproof and dustproof mini speaker with surprisingly rich 360° sound. 12-hour battery, built-in mic for calls, and a carabiner loop. Floats in water.",
+     "tags": ["bluetooth", "waterproof", "portable", "speaker"], "materials": "Rubberized exterior, passive bass radiator"},
+    {"title": "Night Owl LED Desk Lamp", "category": "electronics", "price": 54.99,
+     "desc": "Dimmable LED desk lamp with 5 color temperature presets and a wireless Qi charging pad in the base. Adjustable gooseneck, memory function, and a 1-hour auto-off timer.",
+     "tags": ["LED", "desk-lamp", "wireless-charging", "dimmable"], "materials": "Aluminum arm, ABS base with Qi coil"},
+    # ── Premium / Luxury ────────────────────────────────────────────────────
+    {"title": "Artisan Leather Weekender", "category": "premium", "price": 349.99,
+     "desc": "Hand-stitched full-grain vegetable-tanned leather duffle with brass YKK zippers. Develops a rich patina with age. Separate shoe compartment and detachable shoulder strap.",
+     "tags": ["leather", "handmade", "luxury", "travel"], "materials": "Full-grain vegetable-tanned leather, brass hardware"},
+    {"title": "Heritage Automatic Watch", "category": "premium", "price": 499.99,
+     "desc": "Swiss-movement automatic watch with a sapphire crystal dial and exhibition caseback. 42mm stainless steel case with a genuine alligator strap. 50-meter water resistance.",
+     "tags": ["automatic", "swiss-movement", "sapphire", "luxury"], "materials": "316L stainless steel, sapphire crystal, alligator leather strap"},
+    {"title": "Cashmere Blend Overcoat", "category": "premium", "price": 389.99,
+     "desc": "Italian-milled cashmere-wool blend overcoat with a notch lapel and half-canvas construction. Fully lined in Bemberg silk. Timeless silhouette for dressed-up or smart-casual looks.",
+     "tags": ["cashmere", "Italian", "luxury", "formal"], "materials": "70% wool, 30% cashmere, Bemberg lining"},
+    {"title": "Handcrafted Walnut Sunglasses", "category": "premium", "price": 179.99,
+     "desc": "Frames carved from sustainably sourced American black walnut with Carl Zeiss polarized lenses. Each pair has unique wood grain patterns. Spring hinges for a comfortable universal fit.",
+     "tags": ["handcrafted", "walnut", "polarized", "sustainable"], "materials": "Black walnut wood, Carl Zeiss polarized lenses"},
+    {"title": "Titanium Card Wallet", "category": "premium", "price": 89.99,
+     "desc": "Minimalist RFID-blocking wallet machined from grade-5 titanium. Holds 6 cards and features a quick-access pull tab. Weighs just 2.1 oz and will outlast any leather wallet.",
+     "tags": ["titanium", "RFID-blocking", "minimalist", "EDC"], "materials": "Grade-5 titanium, Dyneema pull tab"},
+    {"title": "Silk Pocket Square Collection", "category": "premium", "price": 59.99,
+     "desc": "Set of 3 hand-rolled Italian silk pocket squares in complementary patterns. Each square is individually wrapped in tissue — perfect as a gift or to elevate your suit game.",
+     "tags": ["silk", "Italian", "gift", "formal"], "materials": "100% Italian silk, hand-rolled edges"},
+    {"title": "Executive Leather Belt", "category": "premium", "price": 119.99,
+     "desc": "Single-piece full-grain bridle leather belt with a solid brass buckle. No stitching — the leather is thick enough to hold its shape for decades. Ages beautifully with wear.",
+     "tags": ["leather", "brass", "luxury", "classic"], "materials": "Full-grain English bridle leather, solid brass buckle"},
+    {"title": "Carbon Fiber Money Clip", "category": "premium", "price": 44.99,
+     "desc": "Aerospace-grade carbon fiber money clip with a satin finish. Ultra-lightweight and strong enough to hold 15+ folded bills without losing spring tension over time.",
+     "tags": ["carbon-fiber", "minimalist", "EDC", "lightweight"], "materials": "3K twill carbon fiber"},
+    # ── Home & Lifestyle ────────────────────────────────────────────────────
+    {"title": "Aromatherapy Soy Candle Set", "category": "home", "price": 36.99,
+     "desc": "Set of 3 hand-poured soy candles in amber glass jars: Lavender Fields, Cedar & Sage, and Vanilla Bean. 45-hour burn time each. Cotton wicks, no synthetic fragrances.",
+     "tags": ["soy", "aromatherapy", "handmade", "non-toxic"], "materials": "100% soy wax, cotton wicks, essential oils"},
+    {"title": "Japanese Ceramic Pour-Over Set", "category": "home", "price": 54.99,
+     "desc": "Minimalist pour-over coffee dripper with a double-wall ceramic server. The cone's spiral ribs allow optimal coffee bloom. Makes 2-4 cups of clean, nuanced brew.",
+     "tags": ["ceramic", "coffee", "Japanese", "minimalist"], "materials": "Hasami porcelain, borosilicate server"},
+    {"title": "Weighted Linen Throw Blanket", "category": "home", "price": 79.99,
+     "desc": "Stonewashed Belgian linen throw with a comfortable 3 lb weight. Gets softer with every wash. Perfect draped over a sofa or at the foot of the bed. OEKO-TEX certified.",
+     "tags": ["linen", "stonewashed", "cozy", "OEKO-TEX"], "materials": "100% Belgian flax linen"},
+    {"title": "Walnut & Brass Desk Organizer", "category": "home", "price": 44.99,
+     "desc": "Handcrafted desk organizer with solid walnut compartments and brass dividers. Holds pens, cards, phone, and small accessories. Felt-lined base protects desktop surfaces.",
+     "tags": ["walnut", "brass", "handcrafted", "office"], "materials": "American black walnut, brushed brass accents"},
+    {"title": "Terracotta Herb Planter Trio", "category": "home", "price": 29.99,
+     "desc": "Set of 3 terracotta planters with drainage holes and bamboo saucers. Perfect for kitchen windowsill herbs like basil, rosemary, and mint. Hand-finished with a matte glaze.",
+     "tags": ["terracotta", "gardening", "kitchen", "handmade"], "materials": "Terracotta clay, bamboo saucers"},
+    {"title": "Memory Foam Seat Cushion", "category": "home", "price": 39.99,
+     "desc": "Ergonomic U-shaped seat cushion with cooling gel-infused memory foam. Reduces tailbone pressure during long work sessions. Machine-washable velour cover with anti-slip bottom.",
+     "tags": ["ergonomic", "memory-foam", "office", "comfort"], "materials": "Gel-infused memory foam, velour cover"},
+    {"title": "Minimalist Wall Clock", "category": "home", "price": 49.99,
+     "desc": "12-inch silent-sweep wall clock with a birch plywood face and brass hands. No ticking sound — uses a precision quartz movement. Mounts flush with a single nail.",
+     "tags": ["minimalist", "silent", "birch", "Scandinavian"], "materials": "Baltic birch plywood, brass hands, quartz movement"},
+    {"title": "Turkish Cotton Bath Towel Set", "category": "home", "price": 64.99,
+     "desc": "Set of 4 Turkish cotton towels — 2 bath, 2 hand. Long-staple cotton loops absorb 3x their weight in water. Gets fluffier with each wash. OEKO-TEX Standard 100.",
+     "tags": ["Turkish-cotton", "absorbent", "OEKO-TEX", "bath"], "materials": "100% long-staple Turkish cotton"},
+    # ── Casual / Streetwear ─────────────────────────────────────────────────
+    {"title": "Urban Canvas Sneakers", "category": "casual", "price": 59.99,
+     "desc": "Classic low-top canvas sneakers with a vulcanized rubber sole for all-day comfort. Metal eyelets, cotton laces, and a removable cushioned insole. Comes in 8 colorways.",
+     "tags": ["canvas", "classic", "casual", "street"], "materials": "Organic cotton canvas, vulcanized rubber sole"},
+    {"title": "Oversized Graphic Hoodie", "category": "casual", "price": 54.99,
+     "desc": "Heavyweight 14 oz French terry hoodie with a relaxed oversized fit. Abstract graphic screen-printed with water-based inks. Ribbed cuffs, kangaroo pocket, and a double-lined hood.",
+     "tags": ["hoodie", "oversized", "streetwear", "graphic"], "materials": "80% cotton, 20% polyester French terry"},
+    {"title": "Slim Fit Chino Pants", "category": "casual", "price": 49.99,
+     "desc": "Tailored slim-fit chinos in a stretch twill that moves with you. Sits at the natural waist with a tapered leg. Works equally well with sneakers or loafers.",
+     "tags": ["chinos", "slim-fit", "stretch", "versatile"], "materials": "98% cotton, 2% elastane twill"},
+    {"title": "Vintage Wash Denim Jacket", "category": "casual", "price": 79.99,
+     "desc": "Classic trucker jacket in a medium-wash selvedge denim. Chest flap pockets, adjustable waist tabs, and copper-tone buttons. The perfect layering piece for spring and fall.",
+     "tags": ["denim", "trucker", "vintage", "layering"], "materials": "100% selvedge cotton denim"},
+    {"title": "Everyday Crossbody Bag", "category": "casual", "price": 34.99,
+     "desc": "Compact crossbody bag with an adjustable strap, front zip pocket, and RFID-protected main compartment. Fits phone, wallet, keys, and a small water bottle. Weighs 8 oz.",
+     "tags": ["crossbody", "RFID", "compact", "everyday"], "materials": "Water-resistant nylon, YKK zippers"},
+    {"title": "Bamboo Fiber Crew Socks 6-Pack", "category": "casual", "price": 24.99,
+     "desc": "Ultra-soft bamboo fiber socks with natural antibacterial and moisture-wicking properties. Reinforced heel and toe, seamless toe closure, and a mid-calf height. Fits sizes 6–12.",
+     "tags": ["bamboo", "antibacterial", "moisture-wicking", "comfort"], "materials": "70% bamboo viscose, 25% cotton, 5% elastane"},
+    {"title": "Relaxed Linen Drawstring Pants", "category": "casual", "price": 44.99,
+     "desc": "Breezy linen pants with an elastic drawstring waist and side pockets. Perfect for beach vacations, weekend errands, or just lounging at home. Gets softer with every wash.",
+     "tags": ["linen", "relaxed", "breathable", "vacation"], "materials": "100% pre-washed linen"},
+    {"title": "Retro Aviator Sunglasses", "category": "casual", "price": 29.99,
+     "desc": "Classic aviator frames in brushed gold metal with gradient smoke lenses. UV400 protection, adjustable nose pads, and spring-loaded temples for a comfortable fit.",
+     "tags": ["aviator", "UV400", "retro", "metal-frame"], "materials": "Brushed metal alloy, gradient polycarbonate lenses"},
+]
+def _expand_catalog(templates: list[dict], target_count: int = 200) -> list[Product]:
+    """
+    Expand hand-authored templates into a full catalog by adding tasteful
+    variations (color/version suffixes) while preserving description richness.
+    """
+    catalog: list[Product] = []
+    color_variants = [
+        "Charcoal", "Midnight Blue", "Forest Green", "Stone Grey",
+        "Rust Orange", "Ivory", "Slate", "Obsidian",
+    ]
+    idx = 1
+    variant_idx = 0
+    while len(catalog) < target_count:
+        for tmpl in templates:
+            if len(catalog) >= target_count:
+                break
+            if variant_idx == 0:
+                title = tmpl["title"]
+            else:
+                color = color_variants[variant_idx % len(color_variants)]
+                title = f"{tmpl['title']} — {color}"
+            catalog.append(Product(
+                id=idx,
+                title=title,
+                category=tmpl["category"],
+                price=round(tmpl["price"] * (1 + random.uniform(-0.08, 0.08)), 2),
+                desc=tmpl["desc"],
+                tags=list(tmpl["tags"]),
+                rating=round(random.uniform(3.8, 5.0), 1),
+                reviews=random.randint(12, 2400),
+                materials=tmpl["materials"],
+            ))
+            idx += 1
+        variant_idx += 1
+    return catalog
+def generate_catalog() -> list[Product]:
+    """Generate the full product catalog."""
+    return _expand_catalog(_TEMPLATES, target_count=200)
+def get_scenarios() -> dict[str, list[str]]:
+    """
+    Pre-built query sequences that demonstrate drift detection and
+    self-healing adaptation in a recruiter demo.
+    """
+    return {
+        "🟢 Phase 1 · Normal": [
+            "I need a good water bottle for hiking.",
+            "Looking for comfortable running shoes.",
+            "Can you recommend a fitness watch with GPS?",
+            "What kind of bags do you have for travel?",
+        ],
+        "🔴 Phase 2 · Black Friday": [
+            "What's the absolute cheapest winter hat you have?",
+            "Any bags under $25?",
+            "Show me the most budget-friendly options.",
+            "I only have $30 to spend, what can I get?",
+        ],
+        "☀️ Phase 3 · Summer Shift": [
+            "Do you have lightweight sandals for the beach?",
+            "I need breathable clothes for hot weather.",
+            "Looking for UV protection sunglasses.",
+            "Recommend summer vacation essentials.",
+        ],
+        "🌿 Phase 4 · Eco Trend": [
+            "Show me products made from recycled materials.",
+            "I only want sustainable, eco-friendly options.",
+            "Do you have anything organic or plant-based?",
+            "What's your most environmentally responsible product?",
+        ],
+    }

modules/drift.py ADDED Viewed

	@@ -0,0 +1,153 @@

+"""
+Semantic drift detector for RetailMind.
+Tracks the rolling semantic similarity of incoming user queries against
+predefined *concept anchors* (e.g., price-sensitivity, seasonal shift,
+eco-trend).  When the exponentially-weighted moving average for any concept
+exceeds a configurable threshold the system flags an active drift — which
+triggers the self-healing adapter to rewrite the LLM system prompt.
+"""
+from __future__ import annotations
+import logging
+import time
+from dataclasses import dataclass, field
+from typing import Any
+import numpy as np
+from sentence_transformers import SentenceTransformer
+logger = logging.getLogger(__name__)
+# Use shared model instance across retriever & drift detector
+_shared_model: SentenceTransformer | None = None
+def _get_model() -> SentenceTransformer:
+    global _shared_model
+    if _shared_model is None:
+        _shared_model = SentenceTransformer("all-MiniLM-L6-v2")
+    return _shared_model
+@dataclass
+class DriftEvent:
+    """Immutable record of a single drift measurement."""
+    timestamp: float
+    query: str
+    scores: dict[str, float]
+    dominant: str
+@dataclass
+class DriftDetector:
+    """
+    Monitors semantic drift across configurable concept anchors.
+    Uses EWMA (exponentially weighted moving average) to smooth noisy
+    single-query scores into stable trend signals.
+    """
+    threshold: float = 0.38
+    ewma_alpha: float = 0.35          # smoothing factor (higher = more reactive)
+    history: list[DriftEvent] = field(default_factory=list)
+    _ewma: dict[str, float] = field(default_factory=dict)
+    _concept_embs: dict[str, Any] = field(default_factory=dict, repr=False)
+    def __post_init__(self) -> None:
+        model = _get_model()
+        # Multiple anchor phrases per concept → averaged embedding for robustness
+        concept_phrases = {
+            "price_sensitive": [
+                "cheap budget discount low price clearance sale savings affordable",
+                "what is the cheapest option under twenty dollars bargain deal",
+                "I only have a limited budget, show me value picks",
+            ],
+            "summer_shift": [
+                "summer heat warm weather sandals shorts sunscreen beach",
+                "lightweight breathable sun protection hot climate UV",
+                "vacation tropical poolside outdoor warm temperature",
+            ],
+            "eco_trend": [
+                "eco-friendly sustainable organic recycled environment green",
+                "plant-based carbon-neutral zero waste biodegradable vegan",
+                "responsible sourcing ethical production renewable materials",
+            ],
+        }
+        for concept, phrases in concept_phrases.items():
+            embs = model.encode(phrases, show_progress_bar=False)
+            self._concept_embs[concept] = np.mean(embs, axis=0)
+            self._ewma[concept] = 0.0
+        logger.info("DriftDetector initialized with %d concept anchors.", len(concept_phrases))
+    # ── Public API ──────────────────────────────────────────────────────────
+    def analyze_drift(self, query: str) -> tuple[str, dict[str, float]]:
+        """
+        Score *query* against all concept anchors and return
+        ``(dominant_concept, raw_scores)``.
+        """
+        model = _get_model()
+        query_emb = model.encode([query], show_progress_bar=False)[0]
+        raw_scores: dict[str, float] = {}
+        for concept, ref_emb in self._concept_embs.items():
+            sim = float(
+                np.dot(query_emb, ref_emb)
+                / (np.linalg.norm(query_emb) * np.linalg.norm(ref_emb) + 1e-10)
+            )
+            raw_scores[concept] = sim
+            # Update EWMA
+            prev = self._ewma[concept]
+            self._ewma[concept] = self.ewma_alpha * sim + (1 - self.ewma_alpha) * prev
+        # Determine dominant drift from smoothed signal
+        detected = "normal"
+        max_smoothed = 0.0
+        for concept, smoothed in self._ewma.items():
+            if smoothed > self.threshold and smoothed > max_smoothed:
+                max_smoothed = smoothed
+                detected = concept
+        event = DriftEvent(
+            timestamp=time.time(),
+            query=query,
+            scores=raw_scores,
+            dominant=detected,
+        )
+        self.history.append(event)
+        if len(self.history) > 200:
+            self.history = self.history[-200:]
+        logger.debug("Drift analysis: %s | scores=%s | ewma=%s", detected, raw_scores, self._ewma)
+        return detected, raw_scores
+    def get_ewma_scores(self) -> dict[str, float]:
+        """Return current EWMA-smoothed scores for dashboard display."""
+        return dict(self._ewma)
+    def get_recent_stats(self) -> dict[str, float] | None:
+        """Return averaged raw scores from last N queries."""
+        if not self.history:
+            return None
+        recent = self.history[-5:]
+        concepts = list(self._concept_embs.keys())
+        return {
+            c: float(np.mean([e.scores[c] for e in recent]))
+            for c in concepts
+        }
+    def get_history_series(self) -> dict[str, list[float]]:
+        """Return full EWMA time-series for each concept (for charts)."""
+        # Recompute from history for accurate display
+        series: dict[str, list[float]] = {c: [] for c in self._concept_embs}
+        ewma_state = {c: 0.0 for c in self._concept_embs}
+        for event in self.history:
+            for c in self._concept_embs:
+                ewma_state[c] = self.ewma_alpha * event.scores[c] + (1 - self.ewma_alpha) * ewma_state[c]
+                series[c].append(ewma_state[c])
+        return series

modules/llm.py ADDED Viewed

	@@ -0,0 +1,95 @@

+"""
+Local LLM inference engine for RetailMind.
+Uses Qwen2.5-0.5B-Instruct running entirely on CPU — no API keys, no GPU,
+no external dependencies.  Prompt engineering is tuned to minimize
+hallucination by grounding all answers in the provided product context.
+"""
+from __future__ import annotations
+import logging
+import time
+from typing import Any
+import torch
+from transformers import pipeline
+logger = logging.getLogger(__name__)
+_generator = None
+def _get_pipeline():
+    """Lazy-load the text-generation pipeline (singleton)."""
+    global _generator
+    if _generator is None:
+        logger.info("Loading Qwen2.5-0.5B-Instruct on CPU (first call only)…")
+        t0 = time.time()
+        _generator = pipeline(
+            "text-generation",
+            model="Qwen/Qwen2.5-0.5B-Instruct",
+            device="cpu",
+            torch_dtype=torch.float32,
+        )
+        logger.info("Model loaded in %.1fs", time.time() - t0)
+    return _generator
+def generate_response(
+    system_prompt: str,
+    user_query: str,
+    retrieved_items: list[dict[str, Any]],
+) -> str:
+    """
+    Generate a grounded product recommendation.
+    The retrieved items are injected directly into the system prompt so
+    the model can only reference real products.
+    """
+    # Build structured context from retrieved products
+    context_lines = []
+    for i, r in enumerate(retrieved_items, 1):
+        p = r["product"]
+        stars = "★" * int(p.get("rating", 4)) + "☆" * (5 - int(p.get("rating", 4)))
+        context_lines.append(
+            f"{i}. {p['title']} — ${p['price']:.2f}\n"
+            f"   Category: {p['category']} | Rating: {stars} ({p.get('reviews', 0)} reviews)\n"
+            f"   Materials: {p.get('materials', 'N/A')}\n"
+            f"   Description: {p['desc']}"
+        )
+    context = "\n\n".join(context_lines)
+    messages = [
+        {
+            "role": "system",
+            "content": (
+                f"{system_prompt}\n\n"
+                f"══════ Available Inventory ══════\n\n"
+                f"{context}\n\n"
+                f"══════════════════════════════════\n"
+                f"IMPORTANT: Only recommend from the products listed above. "
+                f"Cite exact names and prices."
+            ),
+        },
+        {"role": "user", "content": user_query},
+    ]
+    try:
+        gen = _get_pipeline()
+        result = gen(
+            messages,
+            max_new_tokens=250,
+            temperature=0.3,
+            do_sample=True,
+            top_p=0.9,
+            return_full_text=False,
+        )
+        generated = result[0]["generated_text"]
+        if isinstance(generated, list):
+            return generated[-1]["content"]
+        return generated
+    except Exception as e:
+        logger.exception("LLM inference failed")
+        return f"[RetailMind] I encountered an issue generating a response. Error: {e}"

modules/retrieval.py ADDED Viewed

	@@ -0,0 +1,150 @@

+"""
+Hybrid retrieval engine for RetailMind.
+Combines dense semantic search (SentenceTransformers) with structured
+metadata filtering (price range, category, tags) so that queries like
+"eco-friendly bag under $30" actually return relevant, correctly-priced items.
+"""
+from __future__ import annotations
+import logging
+import re
+from typing import Any
+import numpy as np
+from sentence_transformers import SentenceTransformer
+logger = logging.getLogger(__name__)
+class HybridRetriever:
+    """Two-stage retriever: metadata pre-filter → semantic re-rank."""
+    def __init__(self, catalog: list[dict]) -> None:
+        self.catalog = catalog
+        self.model = SentenceTransformer("all-MiniLM-L6-v2")
+        # Build rich embedding texts that capture all searchable facets
+        texts = [
+            (
+                f"{p['title']}. {p['desc']} "
+                f"Category: {p['category']}. "
+                f"Materials: {p.get('materials', 'N/A')}. "
+                f"Tags: {', '.join(p.get('tags', []))}."
+            )
+            for p in catalog
+        ]
+        logger.info("Encoding %d products…", len(catalog))
+        self.embeddings = self.model.encode(texts, show_progress_bar=False)
+        self._norms = np.linalg.norm(self.embeddings, axis=1)
+        logger.info("Catalog indexed successfully.")
+    # ── Public API ──────────────────────────────────────────────────────────
+    def search(
+        self,
+        query: str,
+        top_k: int = 4,
+        category_filter: str | None = None,
+    ) -> list[dict[str, Any]]:
+        """
+        Retrieve top-k products for *query*.
+        Pipeline:
+        1. Extract price ceiling from natural language (e.g. "under $50").
+        2. Pre-filter catalog by price / category if applicable.
+        3. Rank remaining items by cosine similarity.
+        4. Return top-k with scores.
+        """
+        price_cap = self._extract_price_cap(query)
+        cat_hint = category_filter or self._extract_category_hint(query)
+        # Stage 1 — metadata pre-filter
+        candidate_indices = self._prefilter(price_cap, cat_hint)
+        # Stage 2 — semantic ranking over candidates
+        query_emb = self.model.encode([query], show_progress_bar=False)[0]
+        query_norm = np.linalg.norm(query_emb)
+        if len(candidate_indices) == 0:
+            # Fallback: rank entire catalog if filters yield nothing
+            candidate_indices = list(range(len(self.catalog)))
+        cand_embs = self.embeddings[candidate_indices]
+        cand_norms = self._norms[candidate_indices]
+        scores = np.dot(cand_embs, query_emb) / (cand_norms * query_norm + 1e-10)
+        top_local = np.argsort(scores)[::-1][:top_k]
+        results = []
+        for li in top_local:
+            global_idx = candidate_indices[li]
+            results.append({
+                "product": self.catalog[global_idx],
+                "score": float(scores[li]),
+            })
+        logger.debug(
+            "Query: %r | price_cap=%s | cat=%s | candidates=%d | top=%d",
+            query, price_cap, cat_hint, len(candidate_indices), len(results),
+        )
+        return results
+    # ── Private helpers ─────────────────────────────────────────────────────
+    @staticmethod
+    def _extract_price_cap(query: str) -> float | None:
+        """Parse 'under $50', 'below 30', 'less than $25', 'budget' etc."""
+        patterns = [
+            r"under\s*\$?\s*(\d+(?:\.\d+)?)",
+            r"below\s*\$?\s*(\d+(?:\.\d+)?)",
+            r"less\s+than\s*\$?\s*(\d+(?:\.\d+)?)",
+            r"cheaper\s+than\s*\$?\s*(\d+(?:\.\d+)?)",
+            r"max(?:imum)?\s*\$?\s*(\d+(?:\.\d+)?)",
+            r"\$(\d+(?:\.\d+)?)\s*(?:or\s+less|max|budget)",
+            r"only\s+have\s*\$?\s*(\d+)",
+            r"(?:spend|budget)\s*(?:of|is)?\s*\$?\s*(\d+)",
+        ]
+        for pat in patterns:
+            m = re.search(pat, query, re.IGNORECASE)
+            if m:
+                return float(m.group(1))
+        # Heuristic: very budget-oriented queries
+        budget_keywords = {"cheapest", "budget", "affordable", "inexpensive", "bargain"}
+        if any(kw in query.lower() for kw in budget_keywords):
+            return 50.0  # Reasonable default budget ceiling
+        return None
+    def _extract_category_hint(self, query: str) -> str | None:
+        """Map common query terms to catalog categories."""
+        category_keywords: dict[str, list[str]] = {
+            "winter": ["winter", "cold", "snow", "warm", "insulated", "thermal"],
+            "summer": ["summer", "beach", "hot", "heat", "sun", "warm weather"],
+            "eco-friendly": ["eco", "sustainable", "organic", "recycled", "green", "environment", "plant-based"],
+            "sports": ["sport", "fitness", "running", "gym", "training", "workout", "athletic"],
+            "electronics": ["tech", "electronic", "gadget", "headphone", "speaker", "charger", "smart"],
+            "premium": ["luxury", "premium", "high-end", "designer", "artisan"],
+            "home": ["home", "kitchen", "desk", "candle", "bath", "decor"],
+            "casual": ["casual", "streetwear", "everyday", "hoodie", "sneaker", "jeans"],
+        }
+        q_lower = query.lower()
+        for cat, keywords in category_keywords.items():
+            if any(kw in q_lower for kw in keywords):
+                return cat
+        return None
+    def _prefilter(
+        self, price_cap: float | None, category: str | None
+    ) -> list[int]:
+        """Return indices of products matching hard constraints."""
+        indices = []
+        for i, p in enumerate(self.catalog):
+            if price_cap is not None and p["price"] > price_cap:
+                continue
+            if category is not None and p["category"] != category:
+                continue
+            indices.append(i)
+        return indices

requirements.txt ADDED Viewed

	@@ -0,0 +1,8 @@

+gradio>=4.0.0
+transformers
+torch
+sentence-transformers
+huggingface_hub
+python-dotenv
+plotly
+numpy

tests/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # Tests package

tests/test_adaptation.py ADDED Viewed

	@@ -0,0 +1,51 @@

+"""
+Unit tests for the self-healing adapter.
+"""
+import pytest
+from modules.adaptation import Adapter
+@pytest.fixture
+def adapter():
+    return Adapter()
+class TestAdapter:
+    """Tests for the prompt adaptation engine."""
+    def test_normal_returns_base_prompt(self, adapter):
+        prompt = adapter.adapt_prompt("normal")
+        assert "RetailMind" in prompt
+        assert "ACTIVE ADAPTATION" not in prompt
+    def test_price_sensitive_injects_rules(self, adapter):
+        prompt = adapter.adapt_prompt("price_sensitive")
+        assert "PRICE SENSITIVITY" in prompt
+        assert "cheapest" in prompt.lower()
+    def test_summer_shift_injects_rules(self, adapter):
+        prompt = adapter.adapt_prompt("summer_shift")
+        assert "SEASONAL SHIFT" in prompt
+        assert "lightweight" in prompt.lower()
+    def test_eco_trend_injects_rules(self, adapter):
+        prompt = adapter.adapt_prompt("eco_trend")
+        assert "SUSTAINABILITY" in prompt
+        assert "recycled" in prompt.lower() or "organic" in prompt.lower()
+    def test_explanation_differs_per_state(self, adapter):
+        explanations = set()
+        for state in ["normal", "price_sensitive", "summer_shift", "eco_trend"]:
+            explanations.add(adapter.get_explanation(state))
+        assert len(explanations) == 4, "Each state should produce a unique explanation"
+    def test_label_differs_per_state(self, adapter):
+        labels = set()
+        for state in ["normal", "price_sensitive", "summer_shift", "eco_trend"]:
+            labels.add(adapter.get_label(state))
+        assert len(labels) == 4, "Each state should produce a unique label"
+    def test_base_prompt_contains_anti_hallucination(self, adapter):
+        prompt = adapter.adapt_prompt("normal")
+        assert "ONLY recommend" in prompt or "only recommend" in prompt.lower()

tests/test_catalog.py ADDED Viewed

	@@ -0,0 +1,50 @@

+"""
+Unit tests for RetailMind core modules.
+Run with: pytest tests/ -v
+"""
+import pytest
+from modules.data_simulation import generate_catalog, get_scenarios
+class TestCatalog:
+    """Tests for the product catalog generator."""
+    def test_catalog_size(self):
+        catalog = generate_catalog()
+        assert len(catalog) == 200, f"Expected 200 products, got {len(catalog)}"
+    def test_product_has_required_fields(self):
+        catalog = generate_catalog()
+        required = {"id", "title", "category", "price", "desc", "tags", "rating", "reviews", "materials"}
+        for p in catalog[:5]:
+            missing = required - set(p.keys())
+            assert not missing, f"Product {p['id']} missing fields: {missing}"
+    def test_prices_are_positive(self):
+        catalog = generate_catalog()
+        for p in catalog:
+            assert p["price"] > 0, f"Product {p['id']} has non-positive price: {p['price']}"
+    def test_ratings_in_range(self):
+        catalog = generate_catalog()
+        for p in catalog:
+            assert 1.0 <= p["rating"] <= 5.0, f"Product {p['id']} has invalid rating: {p['rating']}"
+    def test_categories_are_valid(self):
+        valid = {"winter", "summer", "eco-friendly", "sports", "electronics", "premium", "home", "casual"}
+        catalog = generate_catalog()
+        for p in catalog:
+            assert p["category"] in valid, f"Invalid category: {p['category']}"
+    def test_unique_ids(self):
+        catalog = generate_catalog()
+        ids = [p["id"] for p in catalog]
+        assert len(ids) == len(set(ids)), "Duplicate product IDs found"
+    def test_scenarios_not_empty(self):
+        scenarios = get_scenarios()
+        assert len(scenarios) >= 4, "Expected at least 4 scenario phases"
+        for name, queries in scenarios.items():
+            assert len(queries) >= 3, f"Scenario '{name}' has too few queries"

tests/test_drift.py ADDED Viewed

	@@ -0,0 +1,59 @@

+"""
+Unit tests for the drift detection module.
+"""
+import pytest
+from modules.drift import DriftDetector
+@pytest.fixture
+def detector():
+    return DriftDetector()
+class TestDriftDetector:
+    """Tests for semantic drift detection."""
+    def test_normal_query_no_drift(self, detector):
+        drift, scores = detector.analyze_drift("I need a good water bottle.")
+        assert drift == "normal", f"Expected 'normal', got '{drift}'"
+        assert all(isinstance(v, float) for v in scores.values())
+    def test_price_sensitive_detection(self, detector):
+        # Feed multiple budget-oriented queries to build up EWMA
+        for q in ["cheapest option", "budget under $20", "show me the cheapest"]:
+            drift, _ = detector.analyze_drift(q)
+        assert drift == "price_sensitive", f"Expected 'price_sensitive' after budget queries, got '{drift}'"
+    def test_eco_trend_detection(self, detector):
+        for q in ["sustainable organic products", "eco-friendly recycled", "I want plant-based items"]:
+            drift, _ = detector.analyze_drift(q)
+        assert drift == "eco_trend", f"Expected 'eco_trend' after eco queries, got '{drift}'"
+    def test_summer_shift_detection(self, detector):
+        for q in ["summer beach sandals", "hot weather lightweight", "UV protection for sun"]:
+            drift, _ = detector.analyze_drift(q)
+        assert drift == "summer_shift", f"Expected 'summer_shift' after summer queries, got '{drift}'"
+    def test_scores_have_all_concepts(self, detector):
+        _, scores = detector.analyze_drift("test query")
+        expected = {"price_sensitive", "summer_shift", "eco_trend"}
+        assert set(scores.keys()) == expected
+    def test_history_accumulates(self, detector):
+        for i in range(5):
+            detector.analyze_drift(f"query {i}")
+        assert len(detector.history) == 5
+    def test_ewma_scores_available(self, detector):
+        detector.analyze_drift("some query")
+        ewma = detector.get_ewma_scores()
+        assert isinstance(ewma, dict)
+        assert len(ewma) == 3
+    def test_history_series_length_matches(self, detector):
+        for i in range(10):
+            detector.analyze_drift(f"query {i}")
+        series = detector.get_history_series()
+        for concept, data in series.items():
+            assert len(data) == 10, f"{concept} series length {len(data)} != 10"

tests/test_retrieval.py ADDED Viewed

	@@ -0,0 +1,60 @@

+"""
+Unit tests for hybrid retrieval engine.
+"""
+import pytest
+from modules.data_simulation import generate_catalog
+from modules.retrieval import HybridRetriever
+@pytest.fixture(scope="module")
+def retriever():
+    catalog = generate_catalog()
+    return HybridRetriever(catalog)
+class TestHybridRetriever:
+    """Tests for the hybrid retrieval system."""
+    def test_returns_correct_count(self, retriever):
+        results = retriever.search("running shoes", top_k=4)
+        assert len(results) == 4
+    def test_results_have_scores(self, retriever):
+        results = retriever.search("water bottle")
+        for r in results:
+            assert "score" in r
+            assert "product" in r
+            assert 0.0 <= r["score"] <= 1.0
+    def test_price_filtering_under_30(self, retriever):
+        results = retriever.search("shoes under $30", top_k=4)
+        for r in results:
+            assert r["product"]["price"] <= 30.0, (
+                f"Product '{r['product']['title']}' costs ${r['product']['price']} "
+                f"but should be under $30"
+            )
+    def test_price_filtering_under_50(self, retriever):
+        results = retriever.search("I only have $50 to spend", top_k=4)
+        for r in results:
+            assert r["product"]["price"] <= 50.0
+    def test_eco_category_relevance(self, retriever):
+        results = retriever.search("eco-friendly sustainable products", top_k=4)
+        eco_count = sum(1 for r in results if r["product"]["category"] == "eco-friendly")
+        assert eco_count >= 2, f"Expected ≥2 eco products, got {eco_count}"
+    def test_winter_category_relevance(self, retriever):
+        results = retriever.search("warm winter jacket for cold weather", top_k=4)
+        winter_count = sum(1 for r in results if r["product"]["category"] == "winter")
+        assert winter_count >= 2, f"Expected ≥2 winter products, got {winter_count}"
+    def test_results_sorted_by_score(self, retriever):
+        results = retriever.search("fitness watch with GPS", top_k=4)
+        scores = [r["score"] for r in results]
+        assert scores == sorted(scores, reverse=True), "Results not sorted by score"
+    def test_empty_query_returns_results(self, retriever):
+        results = retriever.search("", top_k=4)
+        assert len(results) == 4  # Should gracefully handle empty queries