Tokenizer error?

by 3ndetz - opened Dec 26, 2025

Owner Dec 26, 2025

•

edited Dec 26, 2025

looks like GGUF version is too bad...

but safetensors with patched tokenizers (from which I converted) was normal.
I made some patches to llama cpp hf gguf convert to make it work (replace sentencepiece for t5 with gpt2 tokenizer, and replace pre-toketnizer to gpt2), so it launched, but seems like there is trash in model outputs

3ndetz

Owner Dec 26, 2025

Потом поищу решение, но пока заморожу на дня 2 или дольше. Если вдруг кто это читает (особенно, кто работал с фредом), буду рад помощи.
Вот скрипт в колабе для конверта.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment