YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Qwen3-4B-AgentBench-llm2025_advance_v5_ironguard-speed-fp8

This is the Speed-Optimized version of the V5 "Iron Guard" Agent. Generated by converting the V5-BF16 weights to FP8 (Float8) to overcome high latency per iteration.

🛡️ "Iron Guard" Discipline + Speed

This version maintains extreme discipline and strict Action format enforcement while reducing model size by 50% (4GB) for faster inference.

[日本語訳] 本モデルは、SQL等の推論遅延を解消するために、V5 (Iron Guard) を FP8 量子化 した高速版です。規律を削ることなく、軽量化(4GB)によってスループットを向上させています。

Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support