biteve / CREDITS.md
AIBRUH's picture
Upload CREDITS.md with huggingface_hub
470f6d9 verified

EDEN OS V2 — Credits & Acknowledgements

Hallo — Portrait Image Animation

EDEN OS V2's face animation system is powered by Hallo, developed by the Fudan University Generative Vision Lab (fudan-generative-vision).

We extend our deepest gratitude and thanks to the Hallo team for their groundbreaking research in audio-driven portrait animation.

Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization

Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks

Citation

@misc{cui2025hallo4,
    title={Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization},
    author={Jiahao Cui and Baoyou Chen and Mingwang Xu and Hanlin Shang and
            Yuxuan Chen and Yun Zhan and Zilong Dong and Yao Yao and
            Jingdong Wang and Siyu Zhu},
    year={2025},
    eprint={2505.23525},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

@misc{cui2024hallo3,
    title={Hallo3: Highly Dynamic and Realistic Portrait Image Animation
           with Diffusion Transformer Networks},
    author={Jiahao Cui and Hui Li and Yun Zhang and Hanlin Shang and
            Kaihui Cheng and Yuqi Ma and Shan Mu and Hang Zhou and
            Jingdong Wang and Siyu Zhu},
    year={2024},
    eprint={2412.00733},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Additional Technologies

  • Edge TTS — Microsoft Edge Text-to-Speech for Eve's voice
  • AvatarForcing — One-step streaming talking avatars (arXiv:2603.14331)
  • Wav2Vec2 — Facebook's audio encoder (facebook/wav2vec2-base-960h)
  • WAN2.1 — Base video generation model (Wan-AI/Wan2.1-T2V-1.3B)
  • MediaPipe — Google's face mesh detection

License

Hallo4 is a derivative of WAN2.1-1.3B, governed by the WAN LICENSE. Hallo3 is a derivative of CogVideo-5B, released under MIT license.