metadata
title: IntelliGuard Firewall
emoji: π‘οΈ
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.1.0
python_version: 3.11
app_file: app.py
pinned: false
license: mit
π‘οΈ IntelliGuard | Enterprise Prompt Injection Firewall
IntelliGuard is a zero-trust, multi-layered AI security firewall designed to protect enterprise LLMs and autonomous agents from deep semantic jailbreaks, zero-click exploits, and multimodal prompt injections.
This Hugging Face Space serves as the lightweight frontend. All heavy inference is routed remotely to an AMD Instinct MI300X cloud instance, demonstrating production-grade, split-stack deployment.
π How to Use This Space
- Live Scanner: Navigate to the first tab to manually type payloads or use the Quick Insert test vectors (e.g., Base64 Smuggling, Roleplay Jailbreaks).
- Batch Demo: Run a high-speed test of 20 concurrent payloads to evaluate the throughput of the connected AMD hardware.
- API Integration: This frontend defaults to a simulated local instance if the main cloud server spins down, but can be configured to point to any active backend via the
INTELLIGUARD_APIenvironment variable.
π§ The 4-Layer Architecture
Instead of relying on a single, easily bypassed classifier, IntelliGuard forces all input through a specialized funnel:
[User Prompt / Inbound Email]
β
βΌ
1. SPINE (DistilBERT) ββ> Catches structural syntax & hacker code (90.4% F1)
β
βΌ
2. DECODER βββββββββββββ> Unpacks Base64, Hex, and hidden text smuggling
β
βΌ
3. BRAIN (XLM-RoBERTa) β> Catches semantic roleplay & native languages (99.1% F1)
β
βΌ
4. JUDGE (Ensemble NN) β> Final consensus evaluation
β
βΌ
[EXECUTOR / AGENT] ββ> Payload verified safe. Allowed to process.