speculative decoding acceptance rate 0%

#1
by rumcs - opened

Hi, thank you for providing this model.
I am trying to use it in vllm. It seems that the speculative decoding does not work at all. I am curious if the drafter model is fine tuned as well? Thanks!

Sign up or log in to comment