can not use the gpu on AMD RYZEN AI MAX+ 395 w/ Radeon 8060S
#3
by HL973 - opened
很奇怪,你这个量化版似乎不能使用我的gpu,但同时我下载了moxin-org/nemotron-3-nano-30b-a3b和unsloth/nemotron-3-nano-30b-a3b,他们都可以完全加载到gpu中,
以下为unsloth/nemotron-3-nano-30b-a3b,和moxin-org/nemotron-3-nano-30b-a3b的基本一样:
load_tensors: loading model tensors, this can take a while... (mmap = false)
load_tensors: offloading 52 repeating layers to GPU
load_tensors: offloading output layer to GPU
load_tensors: offloaded 53/53 layers to GPU
load_tensors: Vulkan0 model buffer size = 31591.35 MiB
load_tensors: Vulkan_Host model buffer size = 357.00 MiB
