This is a MXFP4_MOE quantization of the model UI-Venus-1.5-30B-A3B
Usage Notes:
- Download the latest llama.cpp to use these quantizations.
- Try to use the best quality you can run.
- For the
mmprojfile, the F32 version is recommended for best results (F32 > BF16 > F16).
- Downloads last month
- 31
Hardware compatibility
Log In to add your hardware
4-bit
Model tree for noctrex/UI-Venus-1.5-30B-A3B-MXFP4_MOE-GGUF
Base model
inclusionAI/UI-Venus-1.5-30B-A3B