Instructions to use MeiGen-AI/GenEvolve with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MeiGen-AI/GenEvolve with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="MeiGen-AI/GenEvolve")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText

processor = AutoProcessor.from_pretrained("MeiGen-AI/GenEvolve")
model = AutoModelForImageTextToText.from_pretrained("MeiGen-AI/GenEvolve")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use MeiGen-AI/GenEvolve with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "MeiGen-AI/GenEvolve"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MeiGen-AI/GenEvolve",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker

docker model run hf.co/MeiGen-AI/GenEvolve

SGLang

How to use MeiGen-AI/GenEvolve with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "MeiGen-AI/GenEvolve" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MeiGen-AI/GenEvolve",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "MeiGen-AI/GenEvolve" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MeiGen-AI/GenEvolve",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Docker Model Runner
How to use MeiGen-AI/GenEvolve with Docker Model Runner:
```
docker model run hf.co/MeiGen-AI/GenEvolve
```

Ephemeral182 commited on 3 days ago

Commit

562355d

verified ·

1 Parent(s): a81278f

README: sync Quick Start with latest GitHub (separate Qwen-Image-Edit FastAPI service, MeiGen-AI/GenEvolve as MODEL_PATH, install order)

Browse files

Files changed (1) hide show

README.md +34 -5

README.md CHANGED Viewed

@@ -111,27 +111,56 @@ For a user request, the agent samples a multi-turn trajectory of tool calls befo
 ## 🚀 Quick Start
-The deployed checkpoint is the **student policy** — it consumes a user prompt and returns a JSON `gen_prompt + reference_images` program through a `<think>/<tool_call>/<answer>` loop. The end-to-end runtime (vLLM/SGLang server + agent loop + tools + Qwen/Nano renderers) lives in the [GitHub repo](https://github.com/Ephemeral182/GenEvolve).
 ```bash
 git clone https://github.com/Ephemeral182/GenEvolve.git
 cd GenEvolve
 conda create -n genevolve python=3.11 -y && conda activate genevolve
 pip install torch==2.8.0 torchvision==0.23.0 --index-url https://download.pytorch.org/whl/cu128
-pip install --no-build-isolation -r requirements.txt && pip install -e .
-# Serve the policy (TP/DP knobs scale across GPUs)
 MODEL_PATH=MeiGen-AI/GenEvolve PORT=8000 TP=1 DP=8 bash scripts/serve_vllm.sh
-# End-to-end example (Nano backend)
 export SERPER_API_KEY=<your_key>      # required for search / image_search
-export GOOGLE_API_KEY=<your_key>      # only for the Nano Banana Pro backend
 python examples/quickstart.py \
     --backend nano-banana-pro \
     --base-url http://localhost:8000/v1 \
     --model GenEvolve \
     --prompt "A 1990s travel-magazine cover of two backpackers in front of the Eiffel Tower at golden hour, the title \"PARIS\" in bold serif." \
     --output paris.png
 ```
 The agent's final `<answer>` is a JSON object:

 ## 🚀 Quick Start
+The deployed checkpoint is the **student policy** — it consumes a user prompt and returns a JSON `gen_prompt + reference_images` program through a `<think>/<tool_call>/<answer>` loop. The end-to-end runtime (vLLM serving + agent loop + tools + Qwen/Nano renderers) lives in the [GitHub repo](https://github.com/Ephemeral182/GenEvolve); the snippet below mirrors its installation and usage.
+### 1. Install the main GenEvolve runtime
 ```bash
 git clone https://github.com/Ephemeral182/GenEvolve.git
 cd GenEvolve
 conda create -n genevolve python=3.11 -y && conda activate genevolve
+pip install -U pip setuptools wheel packaging psutil ninja
 pip install torch==2.8.0 torchvision==0.23.0 --index-url https://download.pytorch.org/whl/cu128
+pip install --no-build-isolation -r requirements.txt
+pip install -e .
+```
+Qwen-Image-Edit rendering runs as a **separate FastAPI service** (kept out of the vLLM environment to avoid CUDA/diffusers conflicts). Set up that service from the GitHub README when you want to use `--backend qwen-image-edit-service`.
+### 2. Serve the agent policy
+```bash
+# Single GPU / single replica.
+MODEL_PATH=MeiGen-AI/GenEvolve PORT=8000 TP=1 DP=1 bash scripts/serve_vllm.sh
+# Higher throughput on one 8-GPU node (8 replicas, 1 GPU each).
 MODEL_PATH=MeiGen-AI/GenEvolve PORT=8000 TP=1 DP=8 bash scripts/serve_vllm.sh
+```
+`TP` shards one model replica across multiple GPUs; `DP` launches multiple replicas; total GPU usage is `TP × DP`.
+### 3. End-to-end example
+```bash
 export SERPER_API_KEY=<your_key>      # required for search / image_search
+export GOOGLE_API_KEY=<your_key>      # only for --backend nano-banana-pro
+# Nano Banana Pro renderer
 python examples/quickstart.py \
     --backend nano-banana-pro \
     --base-url http://localhost:8000/v1 \
     --model GenEvolve \
     --prompt "A 1990s travel-magazine cover of two backpackers in front of the Eiffel Tower at golden hour, the title \"PARIS\" in bold serif." \
     --output paris.png
+# Qwen-Image-Edit renderer (point at your Qwen-Image-Edit FastAPI service)
+python examples/quickstart.py \
+    --backend qwen-image-edit-service \
+    --service-url http://your-qwen-service:8001 \
+    --base-url http://localhost:8000/v1 \
+    --model GenEvolve \
+    --output paris_qwen.png
 ```
 The agent's final `<answer>` is a JSON object: