ERNIE-Image Collection The serieas of image generation models, including text2img、img2img. • 2 items • Updated 2 days ago • 17
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 3 days ago • 24
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 3 days ago • 64
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 85 items • Updated 4 days ago • 525
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 4 items • Updated 3 days ago • 39
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 4 items • Updated 1 day ago • 11
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 4 days ago • 139