Z-Anime

Running

App Files Files Community

Nekochu commited on 7 days ago

Commit

736cf48

1 Parent(s): 18070fb

Z-Anime 6B CPU: distill 8-step Q5_0, Qwen3-4B Q8_0, euler_a, beta schedule

Browse files

Files changed (3) hide show

Dockerfile +61 -0
README.md +41 -5
app.py +145 -0

Dockerfile ADDED Viewed

	@@ -0,0 +1,61 @@

+FROM ubuntu:22.04 AS builder
+ENV DEBIAN_FRONTEND=noninteractive
+RUN apt-get update && apt-get install -y --no-install-recommends \
+    build-essential cmake git ca-certificates libopenblas-dev pkg-config \
+    && rm -rf /var/lib/apt/lists/*
+WORKDIR /build
+RUN git clone --depth 1 https://github.com/leejet/stable-diffusion.cpp.git . \
+    && git submodule update --init --depth 1
+RUN mkdir build && cd build \
+    && cmake .. -DGGML_BLAS=ON -DSD_BUILD_SHARED_LIBS=OFF \
+    && cmake --build . --config Release -j1
+RUN mkdir -p /artifacts \
+    && cp /build/build/bin/sd-cli /artifacts/ \
+    && (cp -a /build/build/bin/lib*.so* /artifacts/ 2>/dev/null || true)
+# ---------------------------------------------------------------------------
+# Runtime image
+# ---------------------------------------------------------------------------
+FROM ubuntu:22.04
+ENV DEBIAN_FRONTEND=noninteractive
+RUN apt-get update && apt-get install -y --no-install-recommends \
+    libopenblas0 libgomp1 ca-certificates curl \
+    python3 python3-pip \
+    && rm -rf /var/lib/apt/lists/*
+WORKDIR /app
+COPY --from=builder /artifacts/ /app/
+ENV LD_LIBRARY_PATH=/app:${LD_LIBRARY_PATH}
+RUN chmod +x /app/sd-cli
+RUN mkdir -p /app/models
+# Download Z-Anime distill 8-step Q5_0 GGUF (~4.51GB)
+RUN curl -fL --retry 3 --retry-delay 5 -o /app/models/z-anime-8steps-q5_0.gguf \
+    "https://huggingface.co/DaNS2025/Z-Anime_8-steps.GGUF/resolve/main/Z-Anime-8steps.q5_0.gguf"
+# Download Qwen3-4B text encoder Q8_0 GGUF (~4.28GB)
+RUN curl -fL --retry 3 --retry-delay 5 -o /app/models/qwen3_4b_q8_0.gguf \
+    "https://huggingface.co/worstplayer/Z-Image_Qwen_3_4b_text_encoder_GGUF/resolve/main/Qwen_3_4b-Q8_0.gguf"
+# Download VAE (~168MB)
+RUN curl -fL --retry 3 --retry-delay 5 -o /app/models/ae.safetensors \
+    "https://huggingface.co/SeeSee21/Z-Anime/resolve/main/vae/ae.safetensors"
+# Install Python deps
+RUN pip3 install --no-cache-dir gradio Pillow
+COPY app.py /app/app.py
+EXPOSE 7860
+CMD ["python3", "/app/app.py"]

README.md CHANGED Viewed

@@ -1,10 +1,46 @@
 ---
-title: Z Anime CPU
-emoji: 👁
-colorFrom: purple
-colorTo: blue
 sdk: docker
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: Z-Anime Image Generation (CPU)
+emoji: 🎨
+colorFrom: blue
+colorTo: pink
 sdk: docker
 pinned: false
+license: apache-2.0
+tags:
+  - image-generation
+  - z-anime
+  - gguf
+  - cpu
+  - anime
+short_description: Z-Anime 6B - CPU anime image generation via sd.cpp
+models:
+  - SeeSee21/Z-Anime
+startup_duration_timeout: 1h
 ---
+# Z-Anime Image Generation (CPU)
+Generate anime images with [Z-Anime 6B](https://huggingface.co/SeeSee21/Z-Anime) (S3-DiT, distill 8-step) via [stable-diffusion.cpp](https://github.com/leejet/stable-diffusion.cpp). Runs on free CPU Spaces.
+## Models
+| Component | File | Size |
+|-----------|------|------|
+| Diffusion (DiT) | Z-Anime-8steps Q5_0 GGUF | 4.51 GB |
+| Text Encoder | Qwen3-4B Q8_0 GGUF | 4.28 GB |
+| VAE | ae.safetensors | 168 MB |
+## Settings
+- **Steps:** 8 (distilled)
+- **CFG:** 1.0
+- **Sampler:** euler_ancestral
+- **Resolution:** 512x512 (recommended on CPU)
+## Credits
+- [Z-Anime](https://huggingface.co/SeeSee21/Z-Anime) by SeeSee21
+- [Z-Image](https://github.com/Tongyi-MAI/Z-Image) by Alibaba Tongyi Lab
+- [stable-diffusion.cpp](https://github.com/leejet/stable-diffusion.cpp) by leejet
+- [Z-Anime 8-step GGUF](https://huggingface.co/DaNS2025/Z-Anime_8-steps.GGUF) by DaNS2025
+- [Qwen3-4B Text Encoder GGUF](https://huggingface.co/worstplayer/Z-Image_Qwen_3_4b_text_encoder_GGUF) by worstplayer

app.py ADDED Viewed

	@@ -0,0 +1,145 @@

+"""Z-Anime 6B Image Generation (CPU) via sd-cli binary"""
+import os, time, subprocess, tempfile, threading
+from PIL import Image
+import gradio as gr
+# ---------------------------------------------------------------------------
+# Model paths (downloaded at build time)
+# ---------------------------------------------------------------------------
+DIFFUSION = "/app/models/z-anime-8steps-q5_0.gguf"
+LLM = "/app/models/qwen3_4b_q8_0.gguf"
+VAE = "/app/models/ae.safetensors"
+RESOLUTIONS = ["512x512", "768x512", "512x768"]
+STEPS = 8
+CFG = 1.0
+TIMEOUT = 10800
+_active_proc = None
+_proc_lock = threading.Lock()
+# ---------------------------------------------------------------------------
+# Inference
+# ---------------------------------------------------------------------------
+def generate(prompt, negative_prompt, resolution, seed):
+    global _active_proc
+    if not prompt or not prompt.strip():
+        raise gr.Error("Please enter a prompt.")
+    prompt = prompt.strip()[:500]
+    w, h = (int(x) for x in resolution.split("x"))
+    seed = int(seed or -1) if seed is not None else -1
+    with tempfile.NamedTemporaryFile(suffix=".png", delete=False) as f:
+        output_path = f.name
+    cmd = [
+        "/app/sd-cli",
+        "--diffusion-model", DIFFUSION,
+        "--llm", LLM,
+        "--vae", VAE,
+        "-p", prompt,
+        "-n", negative_prompt or "",
+        "-W", str(w),
+        "-H", str(h),
+        "--steps", str(STEPS),
+        "--cfg-scale", str(CFG),
+        "--sampling-method", "euler_a",
+        "--schedule", "beta",
+        "-o", output_path,
+        "--diffusion-fa",
+        "--vae-tiling",
+        "-v",
+    ]
+    if seed >= 0:
+        cmd += ["-s", str(seed)]
+    print(f"[gen] {w}x{h} steps={STEPS} seed={seed} prompt={prompt[:80]}")
+    t0 = time.time()
+    try:
+        proc = subprocess.Popen(cmd, stdout=subprocess.PIPE, stderr=subprocess.PIPE)
+        with _proc_lock:
+            _active_proc = proc
+        try:
+            stdout, stderr = proc.communicate(timeout=TIMEOUT)
+        except subprocess.TimeoutExpired:
+            proc.kill()
+            proc.wait()
+            raise
+        elapsed = time.time() - t0
+        with _proc_lock:
+            _active_proc = None
+        if proc.returncode != 0:
+            err = stderr.decode(errors="replace")[-500:] if stderr else "Unknown error"
+            if proc.returncode == -9:
+                raise gr.Error("Out of memory (killed by OS). Try 512x512.")
+            raise gr.Error(f"sd-cli failed (code {proc.returncode}): {err}")
+        if not os.path.exists(output_path) or os.path.getsize(output_path) == 0:
+            raise gr.Error("No output image generated")
+        img = Image.open(output_path)
+        status = f"Generated in {elapsed:.1f}s ({w}x{h}, {STEPS} steps)"
+        print(f"[gen] {status}")
+        return img, status
+    except subprocess.TimeoutExpired:
+        with _proc_lock:
+            _active_proc = None
+        raise gr.Error(f"Generation timed out ({TIMEOUT//60} min limit)")
+    except gr.Error:
+        with _proc_lock:
+            _active_proc = None
+        raise
+    except Exception as e:
+        with _proc_lock:
+            _active_proc = None
+        raise gr.Error(f"Error: {e}")
+# ---------------------------------------------------------------------------
+# Gradio UI
+# ---------------------------------------------------------------------------
+with gr.Blocks(title="Z-Anime (CPU)") as demo:
+    gr.Markdown(
+        "**[Z-Anime 6B](https://huggingface.co/SeeSee21/Z-Anime)** S3-DiT Q5_0 GGUF "
+        "(distill 8-step) via [sd.cpp](https://github.com/leejet/stable-diffusion.cpp) | "
+        "Free CPU inference"
+    )
+    with gr.Row():
+        with gr.Column():
+            prompt_input = gr.Textbox(label="Prompt", lines=3,
+                placeholder="anime girl with silver hair, fantasy armor, dramatic lighting")
+            neg_input = gr.Textbox(label="Negative Prompt", lines=2,
+                value="lowres, bad anatomy, bad hands, text, error, worst quality, blurry")
+            with gr.Row():
+                res_input = gr.Dropdown(choices=RESOLUTIONS, value="512x512",
+                    label="Resolution")
+                seed_input = gr.Number(value=-1, label="Seed (-1=random)", precision=0)
+            gen_btn = gr.Button("Generate (8 steps, CFG 1)", variant="primary", size="lg")
+        with gr.Column():
+            output_img = gr.Image(type="pil", label="Output")
+            status_box = gr.Textbox(label="Status", interactive=False)
+    gen_btn.click(fn=generate,
+        inputs=[prompt_input, neg_input, res_input, seed_input],
+        outputs=[output_img, status_box],
+        concurrency_limit=1)
+    def _on_unload():
+        with _proc_lock:
+            proc = _active_proc
+        if proc and proc.poll() is None:
+            print("[cleanup] User disconnected, killing sd-cli process")
+            proc.kill()
+    demo.unload(_on_unload)
+demo.launch(server_name="0.0.0.0", server_port=7860, show_error=True)