Suggest renaming files per GGUF naming convention

#6
by mishig - opened

Hi @antirez , small filename suggestion.

The current names encode the full per-tensor recipe (e.g. …IQ2XXS-w2Q2K-AProjQ8-SExpQ8-OutQ8-chat-v2.gguf). The convention used by Unsloth, Bartowski, and mradermacher (and described in the GGUF naming spec) is to label by the dominant body quant (matching general.file_type=19=MOSTLY_IQ2_XXS) and put the full recipe in the model card. They use _L / _XL when embed/output/critical tensors are bumped to Q8_0, exactly your situation. For imatrix variants, mradermacher marks them with an i1- prefix on the encoding.

By that convention:

Current Proposed
…IQ2XXS-w2Q2K-AProjQ8-SExpQ8-OutQ8-chat-v2.gguf DeepSeek-V4-Flash-IQ2_XXS_L.gguf
…IQ2XXS-w2Q2K-AProjQ8-SExpQ8-OutQ8-chat-v2-imatrix.gguf DeepSeek-V4-Flash-i1-IQ2_XXS_L.gguf
…Q4KExperts-F16HC-F16Compressor-F16Indexer-Q8Attn-Q8Shared-Q8Out-chat-v2.gguf DeepSeek-V4-Flash-Q4_K_XL.gguf
…Q4KExperts-F16HC-F16Compressor-F16Indexer-Q8Attn-Q8Shared-Q8Out-chat-v2-imatrix.gguf DeepSeek-V4-Flash-i1-Q4_K_XL.gguf
DeepSeek-V4-Flash-MTP-Q4K-Q8_0-F32.gguf DeepSeek-V4-Flash-MTP-Q4_K.gguf

Renames can be done in-place via the HF UI's GGUF Editor: no reupload needed. Happy to open a PR if useful 😊

for example, on https://huggingface.co/antirez/deepseek-v4-gguf/blob/main/DeepSeek-V4-Flash-IQ2XXS-w2Q2K-AProjQ8-SExpQ8-OutQ8-chat-v2-imatrix.gguf

image

Sign up or log in to comment