Glanty
/

Capybara

Any-to-Any

Diffusers

Safetensors

Model card Files Files and versions

xet

Community

Glanty commited on Feb 20

Commit

381691b

verified ·

1 Parent(s): d9bf8e2

Update README.md about ComfyUI

Browse files

Files changed (1) hide show

README.md +22 -3

README.md CHANGED Viewed

@@ -39,10 +39,11 @@ The framework leverages advanced diffusion models and transformer architectures
 ## 🔥 News
-* **[2026.02.17]** 🚀 Initial release v0.1 of the Capybara inference framework supportting generation and instruction-based editing tasks (T2I, T2V, TI2I, TV2V).
 ## 📝 TODO List
-- [ ] Add support for ComfyUI.
 - [ ] Release our unified creation model.
 - [ ] Release training code.
@@ -158,7 +159,7 @@ python inference.py \
 python inference.py \
     --pretrained_model_name_or_path ./ckpts \
     --media_path ./assets/examples/video1.mp4 \
-    --prompt "Replace the monkey with Ultraman. keep the Ultraman's motion matched the original running pose and motion of monkey." \
     --output_path ./results/test_single_output/tv2v \
     --num_inference_steps 50 \
     --num_frames 81 \
@@ -251,6 +252,24 @@ accelerate launch --config_file acc_config/accelerate_config.yaml --num_processe
     --rewrite_instruction
 ```
 ## ⚙️ Configuration Details
 ### Task Types

 ## 🔥 News
+* **[2026.02.20]** 🎨 Added [ComfyUI support](#-comfyui-support) with custom nodes for all task types (T2I, T2V, TI2I, TV2V).
+* **[2026.02.17]** 🚀 Initial release v0.1 of the Capybara inference framework supporting generation and instruction-based editing tasks (T2I, T2V, TI2I, TV2V).
 ## 📝 TODO List
+- [x] Add support for ComfyUI.
 - [ ] Release our unified creation model.
 - [ ] Release training code.
 python inference.py \
     --pretrained_model_name_or_path ./ckpts \
     --media_path ./assets/examples/video1.mp4 \
+    --prompt "Replace the monkey with Ultraman. Keep the Ultraman's motion matched the original running pose and motion of monkey." \
     --output_path ./results/test_single_output/tv2v \
     --num_inference_steps 50 \
     --num_frames 81 \
     --rewrite_instruction
 ```
+## 🎨 ComfyUI Support
+Capybara provides custom ComfyUI nodes for all task types (T2V, T2I, TI2I, TV2V).
+```bash
+ln -s /path/to/Capybara /path/to/ComfyUI/custom_nodes/Capybara
+```
+| Node | Description |
+| --- | --- |
+| **Capybara Load Pipeline** | Load all model components with automatic attention backend selection |
+| **Capybara Generate** | Main generation / editing node for all task types |
+| **Capybara Load Video** | Load a video file as IMAGE frames + fps |
+| **Capybara Load Rewrite Model** | Load Qwen3-VL for prompt rewriting |
+| **Capybara Rewrite Instruction** | Expand short prompts into detailed instructions |
+A sample workflow is provided in [`comfyui/examples/`](comfyui/examples/). For setup details and node documentation, see the [ComfyUI README](comfyui/README.md).
 ## ⚙️ Configuration Details
 ### Task Types