Preference data bootstrapping (cheap but effective)

#26

by jbakerx - opened Dec 17, 2025

Dec 17, 2025

Generate 2–4 candidates per prompt with different decoding settings, then auto-score candidates with:
repetition penalty metrics
“modern slang” detectors
language ID + Cyrillic cleanliness

Keep top candidates and have humans rank a smaller subset.

salakash

Owner Dec 24, 2025

We will consider this enhancement for inclusion in version 2.0.0.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment