facebook/mask2former-swin-large-cityscapes-semantic Image Segmentation • 0.2B • Updated Sep 7, 2023 • 247k • • 37
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models Paper • 2306.07691 • Published Jun 13, 2023 • 13