Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

prodnull
/
minilm-prompt-injection-classifier

Text Classification
ONNX
English
prompt-injection
security
adversarial-robustness
Eval Results (legacy)
Model card Files Files and versions
xet
Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • .gitattributes
    1.52 kB
    initial commit about 1 month ago
  • README.md
    9.78 kB
    docs: update model card for v4 adversarial hardening (FreeLB+PWWS, Mahalanobis, adaptive benchmark) about 1 month ago
  • mini_semantic.onnx
    90.6 MB
    xet
    v4: adversarially hardened model (FreeLB + 2 rounds PWWS, 6,472 samples) about 1 month ago
  • special_tokens_map.json
    124 Bytes
    Initial release: fine-tuned MiniLM-L6-v2 ONNX classifier (87 MB, F1=95.80%) about 1 month ago
  • tokenizer.json
    712 kB
    Initial release: fine-tuned MiniLM-L6-v2 ONNX classifier (87 MB, F1=95.80%) about 1 month ago
  • tokenizer_config.json
    565 Bytes
    Initial release: fine-tuned MiniLM-L6-v2 ONNX classifier (87 MB, F1=95.80%) about 1 month ago
  • vocab.txt
    232 kB
    Initial release: fine-tuned MiniLM-L6-v2 ONNX classifier (87 MB, F1=95.80%) about 1 month ago