rankalign-v6-gemma-2-9b-it-d0.15-e2-ambigqa-all-tcs-fsx-lo0.1

Fine-tuned checkpoint from the rankalign project.

Training Details

Field	Value
Base model	`google/gemma-2-9b-it`
Version	v6
Task	`ambigqa-all`
Epoch	2
Delta	0.15
Typicality correction	self
Length normalization	False
Preference loss weight	1
NLL validator weight	0
NLL generator weight	0
Validator log-odds	False
Force same-x	True
Semi-supervised ratio	None
Labeled-only ratio	0.1

Reproducibility

Original checkpoint name: v6-google--gemma-2-9b-it-delta0.15-epoch2--ambigqa-all--d2g--random--alpha1.0--tc-self--full-completion--force-same-x--labelonly0.1

To evaluate:

python scripts/eval_by_claude.py \
    --model TAUR-dev/rankalign-v6-gemma-2-9b-it-d0.15-e2-ambigqa-all-tcs-fsx-lo0.1 \
    --task ambigqa-all \
    --split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
    --self-typicality

Downloads last month: 187

Safetensors

Model size

9B params

Tensor type

F32

Model tree for TAUR-dev/rankalign-v6-gemma-2-9b-it-d0.15-e2-ambigqa-all-tcs-fsx-lo0.1

Base model

google/gemma-2-9b

Finetuned

google/gemma-2-9b-it

Finetuned

(403)

this model