AudioDataset BAAI/SeniorTalk Viewer • Updated Jan 18 • 60.1k • 923 • 33 BAAI/ChildMandarin Viewer • Updated May 19, 2025 • 40.7k • 89 • 35
TTS nari-labs/Dia-1.6B Text-to-Speech • Updated Jun 1, 2025 • 113k • • 2.83k Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published Jun 4, 2024 • 38 stepfun-ai/Step-Audio-TTS-3B Text-to-Speech • 4B • Updated Feb 17, 2025 • 47 • 196 SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 884 • 726
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published Jun 4, 2024 • 38
ASR BELLE-2/Belle-whisper-large-v3-zh-punct Automatic Speech Recognition • 2B • Updated Apr 16, 2025 • 3.16k • 46 nyrahealth/CrisperWhisper Automatic Speech Recognition • Updated Dec 19, 2024 • 40.1k • 325
BELLE-2/Belle-whisper-large-v3-zh-punct Automatic Speech Recognition • 2B • Updated Apr 16, 2025 • 3.16k • 46
icefall-ASR luomingshuang/icefall_asr_wenetspeech_pruned_transducer_stateless2 Updated Oct 13, 2022 • 4 pkufool/icefall-asr-zipformer-wenetspeech-20230615 Updated Jul 7, 2023 • 3 csukuangfj/sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20 Updated May 28, 2024 • 1 marcoyang/sherpa-ncnn-streaming-zipformer-zh-14M-2023-02-23 Updated Mar 7, 2024 • 5
AudioDataset BAAI/SeniorTalk Viewer • Updated Jan 18 • 60.1k • 923 • 33 BAAI/ChildMandarin Viewer • Updated May 19, 2025 • 40.7k • 89 • 35
ASR BELLE-2/Belle-whisper-large-v3-zh-punct Automatic Speech Recognition • 2B • Updated Apr 16, 2025 • 3.16k • 46 nyrahealth/CrisperWhisper Automatic Speech Recognition • Updated Dec 19, 2024 • 40.1k • 325
BELLE-2/Belle-whisper-large-v3-zh-punct Automatic Speech Recognition • 2B • Updated Apr 16, 2025 • 3.16k • 46
TTS nari-labs/Dia-1.6B Text-to-Speech • Updated Jun 1, 2025 • 113k • • 2.83k Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published Jun 4, 2024 • 38 stepfun-ai/Step-Audio-TTS-3B Text-to-Speech • 4B • Updated Feb 17, 2025 • 47 • 196 SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 884 • 726
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published Jun 4, 2024 • 38
icefall-ASR luomingshuang/icefall_asr_wenetspeech_pruned_transducer_stateless2 Updated Oct 13, 2022 • 4 pkufool/icefall-asr-zipformer-wenetspeech-20230615 Updated Jul 7, 2023 • 3 csukuangfj/sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20 Updated May 28, 2024 • 1 marcoyang/sherpa-ncnn-streaming-zipformer-zh-14M-2023-02-23 Updated Mar 7, 2024 • 5