runtime error
Exit code: 1. Reason: 88Z level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.AVX512=1 CPU.0.AVX512_VBMI=1 CPU.0.AVX512_VNNI=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(gcc) time=2026-04-01T14:31:45.481Z level=INFO source=runner.go:1284 msg=load request="{Operation:alloc LoraPath:[] Parallel:1 BatchSize:512 FlashAttention:Enabled KvSize:4096 KvCacheType: NumThreads:2 GPULayers:[] MultiUserCache:false ProjectorPath: MainGPU:0 UseMmap:false}" time=2026-04-01T14:31:45.965Z level=INFO source=runner.go:1284 msg=load request="{Operation:commit LoraPath:[] Parallel:1 BatchSize:512 FlashAttention:Enabled KvSize:4096 KvCacheType: NumThreads:2 GPULayers:[] MultiUserCache:false ProjectorPath: MainGPU:0 UseMmap:false}" time=2026-04-01T14:31:45.966Z level=INFO source=ggml.go:482 msg="offloading 0 repeating layers to GPU" time=2026-04-01T14:31:45.966Z level=INFO source=ggml.go:486 msg="offloading output layer to CPU" time=2026-04-01T14:31:45.966Z level=INFO source=ggml.go:494 msg="offloaded 0/35 layers to GPU" time=2026-04-01T14:31:45.966Z level=INFO source=device.go:245 msg="model weights" device=CPU size="3.6 GiB" time=2026-04-01T14:31:45.966Z level=INFO source=device.go:256 msg="kv cache" device=CPU size="254.0 MiB" time=2026-04-01T14:31:45.966Z level=INFO source=device.go:267 msg="compute graph" device=CPU size="126.9 MiB" time=2026-04-01T14:31:45.966Z level=INFO source=device.go:272 msg="total memory" size="3.9 GiB" time=2026-04-01T14:31:45.966Z level=INFO source=sched.go:561 msg="loaded runners" count=1 time=2026-04-01T14:31:45.966Z level=INFO source=server.go:1352 msg="waiting for llama runner to start responding" time=2026-04-01T14:31:45.968Z level=INFO source=server.go:1386 msg="waiting for server to become available" status="llm server loading model" time=2026-04-01T14:31:47.598Z level=INFO source=server.go:1390 msg="llama runner started in 2.51 seconds" [GIN] 2026/04/01 - 14:31:59 | 200 | 14.638057769s | 127.0.0.1 | POST "/api/generate"
Container logs:
Fetching error logs...