Spaces:
Running on CPU Upgrade
Running on CPU Upgrade
Long Context Evaluation
#430
by mrfakename - opened
Hi,
Is there any evaluation to test ability to perform well on longer prompts?
I would like this one, too
Hi! If one of you wants to set up the above dataset as a leaderboard, I can give you a hand. (We won't add it to the Open LLM Leaderboard however)
clefourrier changed discussion status to closed
This comment has been hidden