inference

pinned

by CemalSahin - opened Sep 22, 2025

•

Thanks for quantizing this model!

Made a simple script to use these GGUF models easily: https://github.com/cmlshn/PromptEnhancer-GGUF

Just run python inference/prompt_enhancer_gguf.py. Works great on H100, getting ~54 tok/s

mradermacher pinned discussion Sep 22, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment