Add logprobs workaround for harmony channel tokens

Enable logprobs_workaround: true in all 4 io.yaml files (citations, hallucination_detection, query_rewrite, answerability)
Increase max_completion_tokens for answerability to account for harmony channel token overhead

Details

The logprobs_workaround flag (added in granite-common#127) derives model output content from the logprob token sequence instead of trusting message.content, since logprobs are the authoritative sequence the model actually produced.

frreiss changed pull request status to merged Feb 25

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment