Qwen3-8B-NVFP4

NVFP4 quantized Qwen3-8B for NVIDIA Blackwell GPUs (RTX 5090, RTX PRO 4000).

Details

  • Format: NVFP4 (4-bit FP) + FP8 KV cache
  • Tools: TensorRT-LLM 1.2.0, ModelOpt 0.37.0
  • Calibration: 512 samples, cnn_dailymail

Usage

huggingface-cli download glux-cz/Qwen3-8B-NVFP4-Blackwell --local-dir ./checkpoint
trtllm-build --checkpoint_dir ./checkpoint --output_dir ./engine --gemm_plugin nvfp4
Downloads last month
5
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for glux-cz/Qwen3-8B-NVFP4-Blackwell

Finetuned
Qwen/Qwen3-8B
Finetuned
(1468)
this model