Expand eval set to 89 questions; add semantic-fusion weight knob d72272a Beemer Claude Opus 4.7 commited on 5 days ago