Add GSM8K evaluation result (89.3%)
#114
by burtenshaw HF Staff - opened
Adding GSM8K benchmark score from model card.
Score: 89.3% (8-shot)
Source: Model Card benchmark table
For those asking about API access — I've been using Crazyrouter as a unified gateway. One API key, OpenAI SDK compatible. Works well for testing different models without managing multiple accounts.