Glanty
/

Capybara

Any-to-Any

Diffusers

Safetensors

Model card Files Files and versions

xet

Community

RainCCH commited on Feb 20

Commit

8884191

1 Parent(s): 381691b

update README.md

Browse files

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -47,7 +47,7 @@ The framework leverages advanced diffusion models and transformer architectures
 - [ ] Release our unified creation model.
 - [ ] Release training code.
-## 🏞️ Show Caeses
 **Results of generation tasks.** We show two generation tasks under our unified model. The top section presents text-to-image results, illustrating high-fidelity synthesis across diverse styles. The bottom rows show text-to-video results, demonstrating temporally coherent generation with natural motion for both realistic and stylized content.
 <p align="center">
 <img src="./assets/misc/gen_teaser.png" style="width: 100%; height: auto;"/>
@@ -63,7 +63,7 @@ The framework leverages advanced diffusion models and transformer architectures
 <img src="./assets/misc/videoedit_teaser5.png" style="width: 100%; height: auto;"/>
 </p>
-**Results of in-context visual creation.** We show in-context generation and in-context editing results , including subject-conditioned generation (S2V/S2I), conditional generation(C2V), image-to-video(I2V), reference-driven editing (II2I/IV2V).
 <p align="center">
 <img src="./assets/misc/incontext_teaser2.png" style="width: 100%; height: auto;"/>
 </p>
@@ -212,8 +212,8 @@ For editing tasks (TI2I / TV2V), prepare a CSV with `img_path`/`video_path` and
 ```csv
 img_path,instruction
-img1.jpeg,insturction1.
-img2.jpeg,insturction2.
 ```
 > The path column holds relative paths to media files (images or videos) under the data root directory.
@@ -268,7 +268,7 @@ ln -s /path/to/Capybara /path/to/ComfyUI/custom_nodes/Capybara
 | **Capybara Load Rewrite Model** | Load Qwen3-VL for prompt rewriting |
 | **Capybara Rewrite Instruction** | Expand short prompts into detailed instructions |
-A sample workflow is provided in [`comfyui/examples/`](comfyui/examples/). For setup details and node documentation, see the [ComfyUI README](comfyui/README.md).
 ## ⚙️ Configuration Details

 - [ ] Release our unified creation model.
 - [ ] Release training code.
+## 🏞️ Show Cases
 **Results of generation tasks.** We show two generation tasks under our unified model. The top section presents text-to-image results, illustrating high-fidelity synthesis across diverse styles. The bottom rows show text-to-video results, demonstrating temporally coherent generation with natural motion for both realistic and stylized content.
 <p align="center">
 <img src="./assets/misc/gen_teaser.png" style="width: 100%; height: auto;"/>
 <img src="./assets/misc/videoedit_teaser5.png" style="width: 100%; height: auto;"/>
 </p>
+**Results of in-context visual creation.** We show in-context generation and in-context editing results, including subject-conditioned generation (S2V/S2I), conditional generation (C2V), image-to-video (I2V), reference-driven editing (II2I/IV2V).
 <p align="center">
 <img src="./assets/misc/incontext_teaser2.png" style="width: 100%; height: auto;"/>
 </p>
 ```csv
 img_path,instruction
+img1.jpeg,instruction1.
+img2.jpeg,instruction2.
 ```
 > The path column holds relative paths to media files (images or videos) under the data root directory.
 | **Capybara Load Rewrite Model** | Load Qwen3-VL for prompt rewriting |
 | **Capybara Rewrite Instruction** | Expand short prompts into detailed instructions |
+A sample workflow is provided in [`comfyui/examples/`](https://github.com/xgen-universe/Capybara/tree/main/comfyui/examples). For setup details and node documentation, see the [ComfyUI README](https://github.com/xgen-universe/Capybara/tree/main/comfyui).
 ## ⚙️ Configuration Details