This model was converted to openvino IR using NNCF scale estimation algorithm which minimizes L2 error between original and compressed layers.

Downloads last month
8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Echo9Zulu/Nemotron-Cascade-14B-Thinking-int4_asym-se-ov

Finetuned
(6)
this model

Dataset used to train Echo9Zulu/Nemotron-Cascade-14B-Thinking-int4_asym-se-ov

Collection including Echo9Zulu/Nemotron-Cascade-14B-Thinking-int4_asym-se-ov