Add GSM8K evaluation result (89.3%)

#114
by burtenshaw HF Staff - opened

Adding GSM8K benchmark score from model card.

Score: 89.3% (8-shot)
Source: Model Card benchmark table

For those asking about API access — I've been using Crazyrouter as a unified gateway. One API key, OpenAI SDK compatible. Works well for testing different models without managing multiple accounts.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment