initial submission: Research Integrity Gym with Llama 3.3 baseline 62b6842 Bhavishya011 commited on Mar 28