how to use vllm to speed up inference?

#11
by menglan - opened

can give an example of how to using vllm to speed up?
thanks.

Sign up or log in to comment