Speech Recognition - a shail-2512 Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

shail-2512 's Collections

MultiModal (Any-to-Any)

ALMs (Audio Language Models)

Reasoning (LRMs)

Image Generation

Video Generation

Speech Recognition

Dataset to fine-tune Embeddings

Reranking Models

Embedding Models

Speech Recognition

updated Dec 2, 2024

nvidia/canary-1b

Automatic Speech Recognition • Updated Dec 3, 2025 • 2.59k • 457
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 75.9k • 984
nyrahealth/CrisperWhisper

Automatic Speech Recognition • 2B • Updated Apr 7 • 50.1k • 332
openai/whisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.98M • • 3.04k

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs