Can vllm or sglang run the weights ?

#1
by nwzjk - opened

I'm working with 4090

baa.ai org

The advantage of MLX is that it supports mixed precision, so you would need to look for an engine that supports that on your platform.

Sign up or log in to comment