Instructions to use zghhui/OmniNFT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use zghhui/OmniNFT with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
metadata
license: apache-2.0
base_model:
- Lightricks/LTX-2
- Lightricks/LTX-2.3
pipeline_tag: any-to-any
library_name: peft
tags:
- lora
- reinforcement-learning
- GRPO
- T2AV
π¬ OmniNFT RL-LoRA for LTX Video
π Modality-wise Omni Diffusion Negative-aware Fine-Tuning for LTX-2 and LTX-2.3~
π Model Files
| Folder | Base Model | Description |
|---|---|---|
LTX-2-RL-Lora/ |
Lightricks/LTX-2 | RL-LoRA for LTX-2 |
LTX-2.3-RL-Lora/ |
Lightricks/LTX-2.3 | RL-LoRA for LTX-2.3 |
π οΈ Quick Start
1οΈβ£ Clone the Code
git clone https://github.com/zghhui/OmniNFT.git
cd OmniNFT
pip install -r requirements.txt
2οΈβ£ Download Base Models
Download the base checkpoints from HuggingFace:
# LTX-2
huggingface-cli download Lightricks/LTX-2 --local-dir ./checkpoints/LTX-2
# LTX-2.3
huggingface-cli download Lightricks/LTX-2.3 --local-dir ./checkpoints/LTX-2.3
3οΈβ£ Download RL-LoRA Weights
# LTX-2 LoRA
huggingface-cli download zghhui/OmniNFT \
--include "LTX-2-RL-Lora/*" \
--local-dir ./checkpoints/
# LTX-2.3 LoRA
huggingface-cli download zghhui/OmniNFT \
--include "LTX-2.3-RL-Lora/*" \
--local-dir ./checkpoints/
4οΈβ£ Merge LoRA into Base Model
# π Merge LTX-2 + RL-LoRA
python scripts/merge_lora.py \
--checkpoint-path ./checkpoints/LTX-2/transformer/model.safetensors \
--lora-dir ./checkpoints/LTX-2-RL-Lora \
--output-path ./merged_model_ltx2.safetensors \
--dtype bf16
# π Merge LTX-2.3 + RL-LoRA
python scripts/merge_lora.py \
--checkpoint-path ./checkpoints/LTX-2.3/transformer/model.safetensors \
--lora-dir ./checkpoints/LTX-2.3-RL-Lora \
--output-path ./merged_model_ltx23.safetensors \
--dtype bf16
5οΈβ£ Inference π₯
python scripts/inference.py \
--model_path ./merged_model_ltx2.safetensors \
--gemma_path $GEMMA_MODEL_PATH \
--prompt "A man plays acoustic guitar on a wooden stage, warm applause from the audience" \
--seed 42 \
--output_dir ./results
π Citation
If you find this work helpful, please consider citing:
@article{zhang2026omninft,
title={OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation},
author={Zhang, Guohui and Ma, XiaoXiao and Huang, Jie and Xu, Hang and Yu, Hu and Fu, Siming and Li, Yuming and Xue, Zeyue and Song, Lin and Huang, Haoyang and Duan, Nan and Zhao, Feng},
journal={arXiv preprint arXiv:2605.12480},
year={2026}
}
π Acknowledgements
- π¬ Lightricks for the base models
- π€ HuggingFace PEFT for LoRA support
π License
This project is licensed under the Apache 2.0 License.