gemma4-31B-it-speculator.eagle3
This is a preliminary model release, we will continue to train the model and improve the acceptance rates in the next few days.
Model Overview
- Verifier: google/gemma-4-26B-A4B-it
- Speculative Decoding Algorithm: EAGLE-3
- Model Architecture: Eagle3Speculator
- Release Date: 04/13/2026
- Version: 1.0
- Model Developers: RedHat
This is a speculator model designed for use with google/gemma-4-26B-A4B-it, based on the EAGLE-3 speculative decoding algorithm.
It was trained using the Speculators library on a combination of the Magpie-Align/Magpie-Llama-3.1-Pro-300K-Filtered dataset and the train_sft split of the HuggingFaceH4/ultrachat_200k dataset. Training data used Magpie + UltraChat with responses from the gemma-4-31B-it model (no reasoning).
This model should be used with the google/gemma-4-26B-A4B-it chat template, specifically through the /chat/completions endpoint.
vLLM version
UPDATE: Now supported on vllm-main!
Use with vLLM
vllm serve google/gemma-4-26B-A4B-it \
--tensor-parallel-size 2 \
--speculative-config '{
"model": "RedHatAI/gemma-4-26B-A4B-it-speculator.eagle3",
"num_speculative_tokens": 3,
"method": "eagle3"
}' \
--max-num-seqs 64 \
Evaluations
Model / run:
vLLM: UPDATE: Now supported on vllm-main!
Training data: Magpie + UltraChat; responses from the gemma 4 31B it model (no reasoning).
Use cases
| Use Case | Dataset | Number of Samples |
|---|---|---|
| Coding | HumanEval | 164 |
| Math Reasoning | math_reasoning | 80 |
| Question Answering | qa | 80 |
| MT_bench (Question) | question | 80 |
| RAG | rag | 80 |
| Summarization | summarization | 80 |
| Translation | translation | 80 |
Acceptance lengths (draft length, temperature=default)
| Dataset | k=1 | k=2 | k=3 | k=4 | k=5 |
|---|---|---|---|---|---|
| HumanEval | 1.79 | 2.37 | 2.75 | 3.01 | 3.18 |
| math_reasoning | 1.83 | 2.48 | 2.95 | 3.28 | 3.51 |
| qa | 1.57 | 1.85 | 1.99 | 2.07 | 2.09 |
| question | 1.66 | 2.05 | 2.29 | 2.40 | 2.47 |
| rag | 1.64 | 2.02 | 2.21 | 2.30 | 2.36 |
| summarization | 1.53 | 1.77 | 1.87 | 1.91 | 1.93 |
| translation | 1.61 | 1.95 | 2.13 | 2.23 | 2.31 |
- Downloads last month
- 590