Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -24,7 +24,12 @@ tags:
|
|
| 24 |
|
| 25 |
<table>
|
| 26 |
<tr>
|
| 27 |
-
<td align="center"><img src="./assets/effects/
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
</tr>
|
| 29 |
</table>
|
| 30 |
|
|
@@ -41,7 +46,7 @@ Unlike conventional frame-level enhancement pipelines, this model family operate
|
|
| 41 |
|
| 42 |
Project links: [GitHub project](https://github.com/Valiant-Cat/LTX2-ICEdit-Insight) | [JoyFox on Hugging Face](https://huggingface.co/joyfox)
|
| 43 |
|
| 44 |
-
## Model Files
|
| 45 |
|
| 46 |
| File | Purpose |
|
| 47 |
| --- | --- |
|
|
@@ -51,7 +56,7 @@ Project links: [GitHub project](https://github.com/Valiant-Cat/LTX2-ICEdit-Insig
|
|
| 51 |
| `ltx2.3-ic-watermark-remove-general.safetensors` | Watermark removal and occlusion-aware reconstruction |
|
| 52 |
| `ltx2.3-ic-subtitles-remove-general.safetensors` | Subtitle removal and text overlay cleanup |
|
| 53 |
|
| 54 |
-
## Showcase
|
| 55 |
|
| 56 |
<table>
|
| 57 |
<tr>
|
|
@@ -72,7 +77,7 @@ Project links: [GitHub project](https://github.com/Valiant-Cat/LTX2-ICEdit-Insig
|
|
| 72 |
</tr>
|
| 73 |
</table>
|
| 74 |
|
| 75 |
-
## Script Usage
|
| 76 |
|
| 77 |
Run all scripts from the project root.
|
| 78 |
|
|
@@ -86,7 +91,7 @@ bash run_subtitle_rm.sh
|
|
| 86 |
bash run_subtitle_rm.sh /path/to/input.mp4
|
| 87 |
```
|
| 88 |
|
| 89 |
-
## Command Examples
|
| 90 |
|
| 91 |
### Video Restoration
|
| 92 |
|
|
@@ -156,7 +161,7 @@ python run_pipeline.py \
|
|
| 156 |
--lora ./models/loras/ltx2.3-train/ltx2.3-ic-subtitles-remove-general.safetensors
|
| 157 |
```
|
| 158 |
|
| 159 |
-
## Key Improvements
|
| 160 |
|
| 161 |
### Task-Aware IC-Edit Framework
|
| 162 |
|
|
@@ -226,7 +231,7 @@ This is especially helpful for recovering:
|
|
| 226 |
- typography-like fine structures
|
| 227 |
- natural image clarity
|
| 228 |
|
| 229 |
-
## Inference Notes
|
| 230 |
|
| 231 |
- Single-stage inference is recommended for most editing tasks.
|
| 232 |
- Two-stage refinement can improve visual polish but may weaken task-specific LoRA constraints.
|
|
@@ -237,7 +242,7 @@ This is especially helpful for recovering:
|
|
| 237 |
- Frame count should follow the `8k + 1` rule.
|
| 238 |
- Output height and width should be multiples of `32` in single-stage inference.
|
| 239 |
|
| 240 |
-
## Training
|
| 241 |
|
| 242 |
This model family was trained and optimized by **JoyFox Lab** (**Chengdu Xuanhu Technology Co., Ltd.**).
|
| 243 |
|
|
@@ -251,13 +256,13 @@ The training pipeline includes:
|
|
| 251 |
- high-frequency perceptual enhancement
|
| 252 |
- instruction-guided video editing optimization
|
| 253 |
|
| 254 |
-
## Contact
|
| 255 |
|
| 256 |
For research collaboration, commercial licensing, or workflow integration, contact:
|
| 257 |
|
| 258 |
- `z@vvicat.com`
|
| 259 |
|
| 260 |
-
## License
|
| 261 |
|
| 262 |
Licensed under **Apache 2.0**.
|
| 263 |
|
|
|
|
| 24 |
|
| 25 |
<table>
|
| 26 |
<tr>
|
| 27 |
+
<td align="center"><img src="./assets/effects/output_004.webp" alt="Video restoration preview" width="420"/></td>
|
| 28 |
+
<td align="center"><img src="./assets/effects/视频高清对比效果.webp" alt="Video HD enhancement preview" width="420"/></td>
|
| 29 |
+
</tr>
|
| 30 |
+
<tr>
|
| 31 |
+
<td align="center"><img src="./assets/effects/去水印对比效果.webp" alt="Watermark removal preview" width="420"/></td>
|
| 32 |
+
<td align="center"><img src="./assets/effects/去字幕对比效果.webp" alt="Subtitle removal preview" width="420"/></td>
|
| 33 |
</tr>
|
| 34 |
</table>
|
| 35 |
|
|
|
|
| 46 |
|
| 47 |
Project links: [GitHub project](https://github.com/Valiant-Cat/LTX2-ICEdit-Insight) | [JoyFox on Hugging Face](https://huggingface.co/joyfox)
|
| 48 |
|
| 49 |
+
## 📦 Model Files
|
| 50 |
|
| 51 |
| File | Purpose |
|
| 52 |
| --- | --- |
|
|
|
|
| 56 |
| `ltx2.3-ic-watermark-remove-general.safetensors` | Watermark removal and occlusion-aware reconstruction |
|
| 57 |
| `ltx2.3-ic-subtitles-remove-general.safetensors` | Subtitle removal and text overlay cleanup |
|
| 58 |
|
| 59 |
+
## 🎬 Showcase
|
| 60 |
|
| 61 |
<table>
|
| 62 |
<tr>
|
|
|
|
| 77 |
</tr>
|
| 78 |
</table>
|
| 79 |
|
| 80 |
+
## 🚀 Script Usage
|
| 81 |
|
| 82 |
Run all scripts from the project root.
|
| 83 |
|
|
|
|
| 91 |
bash run_subtitle_rm.sh /path/to/input.mp4
|
| 92 |
```
|
| 93 |
|
| 94 |
+
## 💻 Command Examples
|
| 95 |
|
| 96 |
### Video Restoration
|
| 97 |
|
|
|
|
| 161 |
--lora ./models/loras/ltx2.3-train/ltx2.3-ic-subtitles-remove-general.safetensors
|
| 162 |
```
|
| 163 |
|
| 164 |
+
## ✨ Key Improvements
|
| 165 |
|
| 166 |
### Task-Aware IC-Edit Framework
|
| 167 |
|
|
|
|
| 231 |
- typography-like fine structures
|
| 232 |
- natural image clarity
|
| 233 |
|
| 234 |
+
## 🧠 Inference Notes
|
| 235 |
|
| 236 |
- Single-stage inference is recommended for most editing tasks.
|
| 237 |
- Two-stage refinement can improve visual polish but may weaken task-specific LoRA constraints.
|
|
|
|
| 242 |
- Frame count should follow the `8k + 1` rule.
|
| 243 |
- Output height and width should be multiples of `32` in single-stage inference.
|
| 244 |
|
| 245 |
+
## 🏗️ Training
|
| 246 |
|
| 247 |
This model family was trained and optimized by **JoyFox Lab** (**Chengdu Xuanhu Technology Co., Ltd.**).
|
| 248 |
|
|
|
|
| 256 |
- high-frequency perceptual enhancement
|
| 257 |
- instruction-guided video editing optimization
|
| 258 |
|
| 259 |
+
## 📬 Contact
|
| 260 |
|
| 261 |
For research collaboration, commercial licensing, or workflow integration, contact:
|
| 262 |
|
| 263 |
- `z@vvicat.com`
|
| 264 |
|
| 265 |
+
## 📜 License
|
| 266 |
|
| 267 |
Licensed under **Apache 2.0**.
|
| 268 |
|