vLLM compatibility and concurrency

#1
by chatboo - opened

Can this be run with vLLM / Orpheus speech and concurrent requests? Cheers.

Unsloth AI org

Unsure unfortunately. Uou might need to ask the vLLM team or test it πŸ™

Thanks I did a GPTQ export any way

Sign up or log in to comment