with latest sglang, gibberish output

#6
by cudaoom - opened

image

speculative_algo: "DFLASH"

speculative_draft_model_path: "/workspace/draft"

speculative_draft_model_quantization: "unquant"

speculative_num_draft_tokens: 8

speculative_dflash_draft_window_size: 4096

Thanks for reporting this issue. Did you install the latest commit from PR 23000? Could you also share the GPU information of the run, which will be helpful for us the figure out the issue.

image

speculative_algo: "DFLASH"

speculative_draft_model_path: "/workspace/draft"

speculative_draft_model_quantization: "unquant"

speculative_num_draft_tokens: 8

speculative_dflash_draft_window_size: 4096

Hi, could you share the sglang command that previously worked for you? And what gpu are you using?

Sign up or log in to comment