Spaces:
Runtime error
Runtime error
evaluation
#18
by ldwang - opened
Are there any evaluation tools or repos available for pretrained models and instructed models? I’d like to evaluate their performance locally.
Thanks a lot.
EleutherAI LM Evaluation Harness: https://github.com/EleutherAI/lm-evaluation-harness (original)
Huggingface modification: https://github.com/huggingface/lm-evaluation-harness/tree/adding_all_changess
Seems resolved to me! Closing the issue!
Weyaxi changed discussion status to closed