Spaces:

ggml-org
/

gguf-my-repo

Running on A10G

GGUF My Repo re-design

#187

by olegshulyakov - opened Aug 9, 2025

←

•

Migrate Docker image to official llama.cpp CUDA image.
Re-write app.py to OOP to re-design methods signatures.
Added additional llama-quantize options: --token-embedding-type, --leave-output-tensor, --output-tensor-type
Customizable output options: repo name, file name
Upload to different quants to the same repository.
Updated imatrix training file to calibration_data_v5_rc.txt.

olegshulyakov changed pull request status to open Aug 10, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

· Sign up or log in to comment