gguf when official? because the 3rd party one even a q8 is on drugs.

#1
by TomieLLM - opened

this is Q8 at https://huggingface.co/mazrba/YanoljaNEXT-Rosetta-4B-2510-Q8_0-GGUF
image

i dont know if they did wrong or something.

image

Yanolja org

Hi,
https://huggingface.co/yanolja/YanoljaNEXT-Rosetta-4B-2510-GGUF
Just uploaded. Since it is my first time creating GGUF files, there could be any mistakes. If so, please let us know. Thanks!

image
Damnโ€ฆ nice, nice. I knew it! Lesson learned, I should wait for the official GGUF. The people who make GGUFs sometimes mess things up. This one works fine though. โค๏ธ

Iโ€™m gonna do more testing since the system prompt is pretty sensitive.

Other models understand this: โ€œTranslate the following Japanese text to English. Output only the English translation, nothing else.โ€ I was wondering why it was losing BLEU and semantic scores.

Turns out it was outputting Japanese instead of English, so I had to change it to:
prompt="Translate the user's text to English."

from rank 23 it jump to rank 10 hehe

image

I might need to elaborate what I mean about "Turns out it was outputting Japanese instead of English"

image
image

TomieLLM changed discussion status to closed
Yanolja org

Yes, it could be very sensitive to the system prompt format because we did not make any variations of the system prompt in the training dataset.
However, it looks very weird that the Rosetta model underperformed compared to gemma-3-4b-it.
If possible, can you let us know how we can reproduce it? Thanks for testing!

Yes, it could be very sensitive to the system prompt format because we did not make any variations of the system prompt in the training dataset.
However, it looks very weird that the Rosetta model underperformed compared to gemma-3-4b-it.
If possible, can you let us know how we can reproduce it? Thanks for testing!

sure but ๐Ÿ˜… mind that this is like basic basic test https://github.com/TomieAi/basic-benchmark.

I have trouble telling rosetta model about references in rpgm like \N[idx] was place holder for a proper noun..

cant swap it sometimes because sometimes u can edit character name.. the only time it was static f it was a item/skill or place name.
It probably out of scope of the project. but it will be nice if it improve the IFEval performance

# additional prompt
system_lines.append(
        "RPGM GUIDE TRANSLATION\n\n"
        "Preserving \\N[...] codes as proper noun placeholder.\n"
        "Example:\n"
        "INPUT:ใ€Œใ“ใฎ\\N[4]ใฎๆŒ‡่ผชใฏใ€\\N[21]ใง็ฅ–ๆฏใŒ็ฅ–็ˆถใซ่ดˆใฃใŸใ‚‚ใฎใ ใ€‚ใ€\n"
        "OUTPUT: โ€œThis ring of \\\\N[4] was given by my grandmother to my grandfather in \\\\N[21].โ€\n\n"
        "Preserving \\C[...] codes as color\n"
        "Example:\n"
        "INPUT: \\\\C[4]ใ€ๆ‹“ๆตทใ€‘\\\\C[0] ใ€Œใฏใใ€ใฏใใ€\n"
        "OUTPUT: \\\\C[4]Takumi\\\\C[0] Haa, haa."
    )

Rosetta is hit or miss on this..

๐ŸŸฆ [1/1] Translating using model: yanoljanext-rosetta-4b-2510
โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”
๐Ÿ“œ Japanese: ใ€Œๆ›ธๅซšใŠไน…ใ—ใถใ‚Šใ€‚ไฟบใ€ใ‚ตใƒžใƒผใ‚ญใƒฃใƒณใƒ—ใงไผšใฃใŸ\N[1]ใ€‚ใ€
๐Ÿงญ Context: A friendly reunion scene where the speaker recognizes someone they met before at a summer camp.
๐ŸŽญ Tone: Warm and casual
๐Ÿ“˜ Glossary:
   - ๆ›ธๅซš โ†’ Amanda
   - ใ‚ตใƒžใƒผใ‚ญใƒฃใƒณใƒ— โ†’ summer camp
๐Ÿ—’๏ธ Other Context (do not translate):
   - \N[1] = person name
โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”
"Amanda, long time no see. I'm [Person Name], from the summer camp we met at."
--
"Amanda, long time no see. I'm the one you met at summer camp back in \\N[1]."
--
"Amanda, long time no see. I'm that guy you met at summer camp."

Gemma is consistent.

๐ŸŸฆ [1/1] Translating using model: gemma-3n-e4b-it
โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”
๐Ÿ“œ Japanese: ใ€Œๆ›ธๅซšใŠไน…ใ—ใถใ‚Šใ€‚ไฟบใ€ใ‚ตใƒžใƒผใ‚ญใƒฃใƒณใƒ—ใงไผšใฃใŸ\N[1]ใ€‚ใ€
๐Ÿงญ Context: A friendly reunion scene where the speaker recognizes someone they met before at a summer camp.
๐ŸŽญ Tone: Warm and casual
๐Ÿ“˜ Glossary:
   - ๆ›ธๅซš โ†’ Amanda
   - ใ‚ตใƒžใƒผใ‚ญใƒฃใƒณใƒ— โ†’ summer camp
๐Ÿ—’๏ธ Other Context (do not translate):
   - \N[1] = person name
โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”
โœ… Done โ†’ "Amanda, long time no see. I'm \\N[1] we met at summer camp."... 

Sign up or log in to comment