HDR Video Generation via Latent Alignment with Logarithmic Encoding Paper • 2604.11788 • Published 7 days ago • 4
ERNIE-Image Collection The serieas of image generation models, including text2img、img2img. • 2 items • Updated 6 days ago • 22
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 7 days ago • 28
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 7 days ago • 69
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 86 items • Updated 3 days ago • 533
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 4 items • Updated 7 days ago • 40
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 5 items • Updated 3 days ago • 35
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 3 days ago • 148