add MCP server, REST API, docstring, gradio[mcp], ~30 min timing b7cce06 Nekochu commited on 7 days ago
preload models into page cache, offload-to-cpu, IQ4_XS text enc, conv-direct, mmap, vae f32, miku theme 00c018e Nekochu commited on 7 days ago
revert to 8-step Q5_0 GGUF (4-step OOMs during build conversion) c9b2c0d Nekochu commited on 7 days ago
fix: use q5_0 quantization type (q5_k_m not supported by sd.cpp) 4393113 Nekochu commited on 7 days ago
4-step distill: download BF16, convert to Q5_K_M GGUF at build time 6fa797e Nekochu commited on 7 days ago
Z-Anime 6B CPU: distill 8-step Q5_0, Qwen3-4B Q8_0, euler_a, beta schedule 736cf48 Nekochu commited on 7 days ago