LLM-Defense (english)
This is a simple classifier meant to filter out common attack vectors for LLMs.
Uses
The main usecase for this in AI agents. This model is best used as a gate between a outside input (via email, text, etc) and the inner model (Opus, Codex, etc) that actually will run the prompts. This is not a catchall for all of the attacks, but it akin to making sure the doors are locked to your house.
- Downloads last month
- 36
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for shariqtorres/llm-defense-eng
Base model
distilbert/distilbert-base-cased