inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt4 0.5B • Updated 21 days ago • 92
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt5 0.5B • Updated 21 days ago • 142
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt0 0.5B • Updated 21 days ago • 85
inference-optimization/gpt-oss-20b-from-gpt-oss-120b-ckpt3-speculator.eagle3 0.9B • Updated 25 days ago • 21
inference-optimization/gpt-oss-20b-from-gpt-oss-120b-ckpt2-speculator.eagle3 0.9B • Updated 25 days ago • 97
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch3 2B • Updated 26 days ago • 25
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch2 2B • Updated 26 days ago • 12
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch1 2B • Updated 26 days ago • 15
inference-optimization/gpt-oss-20b-from-gpt-oss-120b-ckpt1-speculator.eagle3 0.9B • Updated 26 days ago • 48
inference-optimization/gpt-oss-20b-from-gpt-oss-120b-ckpt0-speculator.eagle3 0.9B • Updated 26 days ago • 53
inference-optimization/Qwen3-Next-80B-A3B-Instruct_mtp_speculator Text Generation • 2B • Updated 27 days ago • 42
inference-optimization/Qwen3-32B-from-Qwen3-235B_resps-speculators.eagle3-ckpt0 2B • Updated 28 days ago • 83