Onnx 4 Bit version of nvidia/Llama3-ChatQA-1.5-8B used by FusionQuill.AI
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Onnx 4 Bit version of nvidia/Llama3-ChatQA-1.5-8B used by FusionQuill.AI