Fix 7 architect-identified issues, add eval harness — targeting 9.5+ c970958 umanggarg Claude Sonnet 4.6 commited on Mar 24