Add zero-shot baseline mode: --adapter_path none skips adapter loading, --no_think suppresses Qwen3 thinking" a2801bb verified nraptisss commited on 9 days ago
Add evaluate_v3.py β stratified sampling, layer-aware max tokens, incremental saves, resume support 734da09 verified nraptisss commited on 10 days ago
Fix: flush stdout for nohup, log every sample, add timestamps f34fb3a verified nraptisss commited on 11 days ago
Add evaluate_v2.py β standard-aware KPI checking (fixes 92% false negatives in reliability metric) f1d77cf verified nraptisss commited on 11 days ago
Fix: --max_seq_length β --max_length (matches train.py argparse) f5ecafd verified nraptisss commited on 12 days ago
Add comprehensive scientific documentation with 38 citations 630af96 verified nraptisss commited on 12 days ago
fix: max_seq_length β max_length, warmup_ratio β warmup_steps (TRL 1.3 compat) 2fdbc71 verified nraptisss commited on 12 days ago