28.4 kB
muthuk1's picture
Benchmark: add LLM-as-a-Judge + BERTScore (hackathon 30% accuracy criterion)
5d58764