noctrex
/

UI-Venus-1.5-30B-A3B-MXFP4_MOE-GGUF

Model card Files Files and versions

This is a MXFP4_MOE quantization of the model UI-Venus-1.5-30B-A3B

Usage Notes:

Download the latest llama.cpp to use these quantizations.
Try to use the best quality you can run.
For the mmproj file, the F32 version is recommended for best results (F32 > BF16 > F16).

Downloads last month: 31

GGUF

Model size

31B params

Architecture

qwen3vlmoe

Hardware compatibility

Log In to add your hardware

4-bit

Model tree for noctrex/UI-Venus-1.5-30B-A3B-MXFP4_MOE-GGUF

Base model

inclusionAI/UI-Venus-1.5-30B-A3B

Quantized

(5)

this model