Submitted by Qingchuan Ma 2 A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation MAC-AutoML 2 1