how to run gpt-oss-120b by ollama using multi-gpu
I'd recommend you using vLLM
· Sign up or log in to comment