Quantized EXAONE4
Collection
Quantized checkpoints for EXAONE4 series • 5 items • Updated
4-bit: mse=2.4 + group_size=32 + SmoothMSE(64,0.75) + EoRA(rank=64)
Expected: 98-99.5% quality × 2.8-3.2x = 2.74-3.18 score
from gptqmodel import GPTQModel
model = GPTQModel.from_quantized("namgyu-youn/EXAONE-4.0-1.2B-GPTQ-W4A16-EoRA", device="cuda:0")
from vllm import LLM
llm = LLM(model="namgyu-youn/EXAONE-4.0-1.2B-GPTQ-W4A16-EoRA", dtype="float16")
Base model
LGAI-EXAONE/EXAONE-4.0-1.2B