UQ Changes
#4
by ink-pad - opened
No description provided.
Models Added
uncertainty/lora/granite-4.0-micro- LoRA adapteruncertainty/alora/granite-4.0-micro- aLoRA adapter
How It Works
- Base model generates a response to a user query
- UQ adapter is invoked with
<certainty>token - Model outputs
{"score": "0-9"}indicating confidence level - Score is transformed to calibrated likelihood (0.05-0.95) via
io.yaml
Training Details
- Base model: ibm-granite/granite-4.0-micro
- LoRA config: rank=32, alpha=32, target_modules=[q_proj, k_proj, v_proj]
- Training samples: ~215K examples with certainty labels
- Evaluation: 57 MMLU subsets (14,042 samples)
Files
adapter_config.json- PEFT adapter configurationadapter_model.safetensors- Trained adapter weightsio.yaml- Output schema and transformation rules
ink-pad changed pull request status to open
frreiss changed pull request status to merged