Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Rohan03
/
purpose-agent
like
0
Text Generation
English
purpose-agent
agents
self-improving
multi-agent
memory-system
local-first
slm
safety
event-driven
rag
tools
License:
mit
Model card
Files
Files and versions
xet
Community
12ff1aa
purpose-agent
/
benchmarks
30.3 kB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
Rohan03
fix: real-model robustness โ benchmarks/validate_real.py
d7dc6c8
verified
14 days ago
results
Track 2: validation suite with improvement curves, cold/warm, transfer, adversarial
14 days ago
validate.py
Safe
15.5 kB
Track 2: validation suite with improvement curves, cold/warm, transfer, adversarial
14 days ago
validate_real.py
Safe
8.16 kB
fix: real-model robustness โ benchmarks/validate_real.py
14 days ago