bytedance-research/HuMo
Image-to-Video โข Updated โข 70 โข 267
UMO based on OmniGen2
inpaint images using Qwen Image with inpainting Controlnet
Chat with AI using ERNIEโ4.5 model
Detect objects in images and videos
Transcribe audio files to text with language detection
Generate images from text prompts