Sulphur-2-Base (Dev) - GGUF

This repository contains GGUF format model files for SulphurAI's Sulphur-2-base.

Model Details

Available Quantizations

The following quantization tiers are provided to accommodate different hardware capabilities and VRAM constraints.

Filename Quantization Type Recommended Use
sulphur_dev_bf16.gguf BF16 Unquantized baseline (16-bit). Maximum quality and accuracy. Requires massive VRAM (42GB+).
sulphur_dev-Q8_0.gguf Q8_0 Extremely high quality, near unquantized performance. Requires high VRAM.
sulphur_dev-Q6_K.gguf Q6_K Very high quality, minimal precision loss.
sulphur_dev-Q5_K_M.gguf Q5_K_M Excellent balance of quality and performance.
sulphur_dev-Q4_K_M.gguf Q4_K_M Recommended standard. Fast inference with very low quality degradation.
sulphur_dev-Q3_K_M.gguf Q3_K_M High compression. Best for constrained environments with limited RAM/VRAM.
Downloads last month
260
GGUF
Model size
21B params
Architecture
ltxv
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Abiray/Sulphur-2-base-GGUF

Quantized
(1)
this model

Collection including Abiray/Sulphur-2-base-GGUF