google-research-datasets/conceptual_captions
Viewer β’ Updated β’ 5.34M β’ 16.9k β’ 108
Edit images using text instructions
Edit images using naturalβlanguage instructions
Generate music from a text description and optional melody
Combine voice cloning and portrait lipsync animation
Generate a talking face video from an image and audio