Auto-Quantized GGUF Model

This repository contains automated GGUF quantization files for nbeerbower/Qwen3.5-9B-Writing-DPO.

The calibration data for the imatrix is targeted at Chinese novels and role-playing (RP), while preserving logic and common sense.
imatrix 的校准数据以中文的小说、角色扮演为目标，同时保留逻辑和常识。

📊 Perplexity Evaluation

(Tested against the provided calibration dataset)

Base (F16/BF16): PPL = 15.6960 +/- 0.13226
Q5_K: PPL = 14.7557 +/- 0.12028

Downloads last month: 1,305

GGUF

Model size

9B params

Architecture

qwen35

Hardware compatibility

4-bit

5-bit

View +1 variant

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nuofang/Qwen3.5-9B-Writing-DPO-GGUF

Base model

Qwen/Qwen3.5-9B-Base

Finetuned

Qwen/Qwen3.5-9B

Finetuned

nbeerbower/Qwen3.5-9B-Writing-DPO

Quantized

(4)

this model