Anybody running this version on DGX Spark ?

by dionode - opened 20 days ago

•

I found this quantization ideal to run on the DGX Spark, however the NVIDIA container registry most updated version still runs vLLM 15.

I tried to build vLLM fro source but faced multiple out of memory or dependency errors.

Someone already running this version on the Spark ? I think, I'll wait for updates on the container registry.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment