IntelliGuard / README.md
sarthak20P's picture
Update README.md
16ff4a3 verified
---
title: IntelliGuard Firewall
emoji: πŸ›‘οΈ
sdk: gradio
sdk_version: 4.44.1
python_version: "3.10"
app_file: app.py
pinned: false
license: mit
---
# πŸ›‘οΈ IntelliGuard | Enterprise Prompt Injection Firewall
**IntelliGuard** is a zero-trust, multi-layered AI security firewall designed to protect enterprise LLMs and autonomous agents from deep semantic jailbreaks, zero-click exploits, and multimodal prompt injections.
This Hugging Face Space serves as the lightweight frontend. All heavy inference is routed remotely to an **AMD Instinct MI300X** cloud instance, demonstrating production-grade, split-stack deployment.
## πŸš€ How to Use This Space
1. **Live Scanner:** Navigate to the first tab to manually type payloads or use the Quick Insert test vectors (e.g., Base64 Smuggling, Roleplay Jailbreaks).
2. **Batch Demo:** Run a high-speed test of 20 concurrent payloads to evaluate the throughput of the connected AMD hardware.
3. **API Integration:** This frontend defaults to a simulated local instance if the main cloud server spins down, but can be configured to point to any active backend via the `INTELLIGUARD_API` environment variable.
## 🧠 The 4-Layer Architecture
Instead of relying on a single, easily bypassed classifier, IntelliGuard forces all input through a specialized funnel:
```text
[User Prompt / Inbound Email]
β”‚
β–Ό
1. SPINE (DistilBERT) β€”β€”> Catches structural syntax & hacker code (90.4% F1)
β”‚
β–Ό
2. DECODER β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”> Unpacks Base64, Hex, and hidden text smuggling
β”‚
β–Ό
3. BRAIN (XLM-RoBERTa) β€”> Catches semantic roleplay & native languages (99.1% F1)
β”‚
β–Ό
4. JUDGE (Ensemble NN) β€”> Final consensus evaluation
β”‚
β–Ό
[EXECUTOR / AGENT] β€”β€”> Payload verified safe. Allowed to process.
```