LLM-Defense (english)

This is a simple classifier meant to filter out common attack vectors for LLMs.

Uses

The main usecase for this in AI agents. This model is best used as a gate between a outside input (via email, text, etc) and the inner model (Opus, Codex, etc) that actually will run the prompts. This is not a catchall for all of the attacks, but it akin to making sure the doors are locked to your house.

Downloads last month
36
Safetensors
Model size
65.8M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for shariqtorres/llm-defense-eng

Finetuned
(336)
this model

Dataset used to train shariqtorres/llm-defense-eng