This is a MXFP4_MOE quantization of the model UI-Venus-1.5-30B-A3B

Usage Notes:

  • Download the latest llama.cpp to use these quantizations.
  • Try to use the best quality you can run.
  • For the mmproj file, the F32 version is recommended for best results (F32 > BF16 > F16).
Downloads last month
31
GGUF
Model size
31B params
Architecture
qwen3vlmoe
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for noctrex/UI-Venus-1.5-30B-A3B-MXFP4_MOE-GGUF

Quantized
(5)
this model