Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
blanchefort 's Collections
Medical
VLA models
Audio
Translate
OCR
OmniModels
Edge models
Video encoders
Judge
Datasets for Embodied
Ru text encoders
Text2Image
VLMs

Audio

updated 22 days ago
Upvote
-

  • nvidia/audio-flamingo-3-hf

    Audio-Text-to-Text • Updated Jan 27 • 180k • 174

  • facebook/sam-audio-large

    Updated Dec 30, 2025 • 40.5k • 374

  • google/medasr

    Automatic Speech Recognition • Updated Jan 26 • 40.8k • 290

  • FunAudioLLM/Fun-CosyVoice3-0.5B-2512

    Text-to-Speech • Updated Feb 3 • 6.31k • 470

  • facebook/sam-audio-large-tv

    Updated Dec 30, 2025 • 636 • 24

  • Qwen/Qwen3-TTS-12Hz-0.6B-Base

    Text-to-Speech • Updated Jan 29 • 264k • 188

  • MTUCI/spectra_0

    Audio Classification • Updated 22 days ago • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs