A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation Paper • 2605.17278 • Published 3 days ago • 2