MetaphoricalCode
/

Dumpling-Qwen2.5-32B-v2-exl3-5bpw-hb6

Text Generation

text-generation-inference

Model card Files Files and versions

Quantized using the default exllamav3 (0.0.2) quantization process.

Original model: https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-32B-v2
exllamav3: https://github.com/turboderp-org/exllamav3

Dumpling-Qwen2.5-32B-v2

nbeerbower/Rombos-EVAGutenberg-TIES-Qwen2.5-32B finetuned on:

Method

QLoRA ORPO tuned with 8x A100 for 2 epochs. Rank 64 LoRA, 2e-5 learning rate.

Downloads last month: 2

Safetensors

Model size

11B params

Tensor type

F16

·

I16

·

Model tree for MetaphoricalCode/Dumpling-Qwen2.5-32B-v2-exl3-5bpw-hb6

Base model

nbeerbower/Rombos-EVAGutenberg-TIES-Qwen2.5-32B

Finetuned

nbeerbower/Dumpling-Qwen2.5-32B-v2

Quantized

(9)

this model

Datasets used to train MetaphoricalCode/Dumpling-Qwen2.5-32B-v2-exl3-5bpw-hb6