Add MDPBench evaluation results

#50

Add official MDPBench benchmark results.

Benchmark: https://huggingface.co/datasets/Delores-Lin/MDPBench

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment