microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 319k • 1.57k
jq/whisper-large-v3-salt-plus-xog-myx-kin-swa Automatic Speech Recognition • 2B • Updated Feb 4, 2025 • 2 • 1
Running on Zero Featured 927 MMAudio — generating synchronized audio from video/text 🔊 927 Generate audio from video and text prompts