File size: 1,862 Bytes
4d79edd
8e86148
 
4d79edd
16ff4a3
2dba678
4d79edd
bc6ce15
 
4d79edd
8e86148
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
title: IntelliGuard Firewall
emoji: πŸ›‘οΈ
sdk: gradio
sdk_version: 4.44.1
python_version: "3.10"
app_file: app.py
pinned: false
license: mit
---
# πŸ›‘οΈ IntelliGuard | Enterprise Prompt Injection Firewall

**IntelliGuard** is a zero-trust, multi-layered AI security firewall designed to protect enterprise LLMs and autonomous agents from deep semantic jailbreaks, zero-click exploits, and multimodal prompt injections. 

This Hugging Face Space serves as the lightweight frontend. All heavy inference is routed remotely to an **AMD Instinct MI300X** cloud instance, demonstrating production-grade, split-stack deployment.

## πŸš€ How to Use This Space
1. **Live Scanner:** Navigate to the first tab to manually type payloads or use the Quick Insert test vectors (e.g., Base64 Smuggling, Roleplay Jailbreaks).
2. **Batch Demo:** Run a high-speed test of 20 concurrent payloads to evaluate the throughput of the connected AMD hardware.
3. **API Integration:** This frontend defaults to a simulated local instance if the main cloud server spins down, but can be configured to point to any active backend via the `INTELLIGUARD_API` environment variable.

## 🧠 The 4-Layer Architecture

Instead of relying on a single, easily bypassed classifier, IntelliGuard forces all input through a specialized funnel:

```text
[User Prompt / Inbound Email] 
       β”‚
       β–Ό
 1. SPINE (DistilBERT) β€”β€”> Catches structural syntax & hacker code (90.4% F1)
       β”‚
       β–Ό
 2. DECODER β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”β€”> Unpacks Base64, Hex, and hidden text smuggling
       β”‚
       β–Ό
 3. BRAIN (XLM-RoBERTa) β€”> Catches semantic roleplay & native languages (99.1% F1)
       β”‚
       β–Ό
 4. JUDGE (Ensemble NN) β€”> Final consensus evaluation 
       β”‚
       β–Ό
[EXECUTOR / AGENT] β€”β€”> Payload verified safe. Allowed to process.
```