IntelliGuard / README.md
sarthak20P's picture
Update README.md
42907ff verified
|
raw
history blame
1.89 kB
metadata
title: IntelliGuard Firewall
emoji: πŸ›‘οΈ
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.1.0
python_version: 3.11
app_file: app.py
pinned: false
license: mit

πŸ›‘οΈ IntelliGuard | Enterprise Prompt Injection Firewall

IntelliGuard is a zero-trust, multi-layered AI security firewall designed to protect enterprise LLMs and autonomous agents from deep semantic jailbreaks, zero-click exploits, and multimodal prompt injections.

This Hugging Face Space serves as the lightweight frontend. All heavy inference is routed remotely to an AMD Instinct MI300X cloud instance, demonstrating production-grade, split-stack deployment.

πŸš€ How to Use This Space

  1. Live Scanner: Navigate to the first tab to manually type payloads or use the Quick Insert test vectors (e.g., Base64 Smuggling, Roleplay Jailbreaks).
  2. Batch Demo: Run a high-speed test of 20 concurrent payloads to evaluate the throughput of the connected AMD hardware.
  3. API Integration: This frontend defaults to a simulated local instance if the main cloud server spins down, but can be configured to point to any active backend via the INTELLIGUARD_API environment variable.

🧠 The 4-Layer Architecture

Instead of relying on a single, easily bypassed classifier, IntelliGuard forces all input through a specialized funnel:

[User Prompt / Inbound Email] 
       β”‚
       β–Ό
 1. SPINE (DistilBERT) β€”β€”> Catches structural syntax & hacker code (90.4% F1)
       β”‚
       β–Ό
 2. DECODER β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”> Unpacks Base64, Hex, and hidden text smuggling
       β”‚
       β–Ό
 3. BRAIN (XLM-RoBERTa) β€”> Catches semantic roleplay & native languages (99.1% F1)
       β”‚
       β–Ό
 4. JUDGE (Ensemble NN) β€”> Final consensus evaluation 
       β”‚
       β–Ό
[EXECUTOR / AGENT] β€”β€”> Payload verified safe. Allowed to process.