|
download
raw
794 Bytes

Ideas Backlog

  • [P1] Add a Japanese continued-pretraining phase on clean monolingual prose before SFT if SFT-only loss plateaus.
  • [P1] Replace raw translated HH examples with a stricter subset filtered by Japanese character ratio and URL density.
  • [P2] Try a consistent prompt prefix such as 日本語で自然かつ簡潔に答えてください。 to bias the adapter toward Japanese-only responses.
  • [P3] Sweep LoRA target depth via --num-layers (8/16/all) once data quality stabilizes.
  • [P1] Filter HH/OASST rows with high URL density or unusually low Japanese ratio after normalization; these are likely noisy translated web answers.
  • [P1] Lower completion_max_chars for a compact-response training subset to see if the adapter learns cleaner Japanese response style faster.

Xet Storage Details

Size:
794 Bytes
·
Xet hash:
d5d0c03f0d85d9ff9e27f43f968f32ac4418a01d9e0dae8b77b331332476b0da

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.