Spaces:

lablab-ai-amd-developer-hackathon
/

IntelliGuard

Running

App Files Files Community

IntelliGuard / README.md

sarthak20P

Update README.md

42907ff verified 1 day ago

1.89 kB

title: IntelliGuard Firewall
emoji: 🛡️
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.1.0
python_version: 3.11
app_file: app.py
pinned: false
license: mit

🛡️ IntelliGuard | Enterprise Prompt Injection Firewall

IntelliGuard is a zero-trust, multi-layered AI security firewall designed to protect enterprise LLMs and autonomous agents from deep semantic jailbreaks, zero-click exploits, and multimodal prompt injections.

This Hugging Face Space serves as the lightweight frontend. All heavy inference is routed remotely to an AMD Instinct MI300X cloud instance, demonstrating production-grade, split-stack deployment.

🚀 How to Use This Space

Live Scanner: Navigate to the first tab to manually type payloads or use the Quick Insert test vectors (e.g., Base64 Smuggling, Roleplay Jailbreaks).
Batch Demo: Run a high-speed test of 20 concurrent payloads to evaluate the throughput of the connected AMD hardware.
API Integration: This frontend defaults to a simulated local instance if the main cloud server spins down, but can be configured to point to any active backend via the INTELLIGUARD_API environment variable.

🧠 The 4-Layer Architecture

Instead of relying on a single, easily bypassed classifier, IntelliGuard forces all input through a specialized funnel:

[User Prompt / Inbound Email] 
       │
       ▼
 1. SPINE (DistilBERT) ——> Catches structural syntax & hacker code (90.4% F1)
       │
       ▼
 2. DECODER —————————————> Unpacks Base64, Hex, and hidden text smuggling
       │
       ▼
 3. BRAIN (XLM-RoBERTa) —> Catches semantic roleplay & native languages (99.1% F1)
       │
       ▼
 4. JUDGE (Ensemble NN) —> Final consensus evaluation 
       │
       ▼
[EXECUTOR / AGENT] ——> Payload verified safe. Allowed to process.