This model was converted to openvino IR using NNCF scale estimation algorithm which minimizes L2 error between original and compressed layers.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Echo9Zulu/Nemotron-Cascade-14B-Thinking-int4_asym-se-ov

Base model

Finetuned

(6)

this model

Dataset used to train Echo9Zulu/Nemotron-Cascade-14B-Thinking-int4_asym-se-ov